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Foreword to the 
Second English Edition 



The Mathematical Society of Japan takes 
pleasure in presenting this second edition of 
our Encyclopedic Dictionary of Mathematics 
to every researcher and user of mathematics. 

It is intended to be a compact, up-to-date 
source of information comprising, as com- 
pletely as possible, all significant results in all 
fields of our Science, pure and applied, from 
the elementary to the advanced level. The 
success of the first edition owed much to the 
kind assistance given by the American Mathe- 
matical Society. As described in the preface, 
the members of our Society have taken re- 
sponsibility for compiling this new edition. 

We hope that it will be as useful to the mathe- 
maticians of today as the first edition was 
to the mathematicians of yesterday. We also 
hope that this edition will be followed in 
years to come by subsequent ones incorporat- 
ing the future development of our Science. 

Hikosaburo Komatsu 
President 1985-1987 
Mathematical Society of Japan 



Foreword to the 
First English Edition 



The American Mathematical Society welcomes 
the publication of the Encyclopedic Dictionary 
of Mathematics. For many years we have been 
fascinated by the publication in Japanese, 
Iwanami Sugaku Ziten, because we saw that 
this was an encyclopedia that contained effec- 
tive and penetrating information about all 
the fields of advanced mathematical research. 
We were also frustrated because we could not 
read Japanese and so we could not really 
reach out to this expert and effective source 
of information. We now welcome the fact that 
the second Japanese edition has been trans- 
lated into English and we look forward to the 
fascination which we can now have in getting 
at this rich mine of information. 



Saunders MacLane 
President 1973-1974 
American Mathematical Society 




Preface to the 
Second English Edition 

This second edition of the Encyclopedic Dic- 
tionary of Mathematics is in substance an 
English version of the third edition of Iwanami 
Siigaku Ziten (in Japanese). We shall explain 
how these two versions are related to each 
other and how the present edition differs from 
the first English edition. For the sake of sim- 
plicity we abbreviate Encyclopedic Dictionary 
of Mathematics and Iwanami Siigaku Ziten to 
EDM and ISZ, respectively, and indicate the 
numbers of editions beyond the first by Arabic 
numerals. 

The prefaces of the previous editions will 
clarify how ISZ, its augumented edition, ISZ2, 
ISZ3, and EDM came into existence in 1954, 
1960, 1968, 1985, and 1977, respectively. EDM, 
published nine years later than ISZ2, consisted 
of its English translation and some new mate- 
rials added to update its contents. In retro- 
spect, it was felt that this proeedure was not 
adequate to cope with the recent rapid pro- 
gress of mathematics, and a suggestion was 
raised in the Mathematical Society of Japan 
that ISZ3 and EDM2 be produced simulta- 
neously. The favorable reaction of the mathe- 
matical public to EDM encouraged us greatly. 

In 1978 an agreement was made among the 
Society, Iwanami Shoten, Publishers, and The 
MIT Press for the publication of ISZ3 and 
EDM2. An editorial committee was estab- 
lished in the Society with the members named 
in the preface to 1SZ3. Manuscripts were pre- 
pared simultaneously in Japanese and English 
by each contributor, with Yuji Ito acting as 
linguistic consultant. 

I should mention that we benefited greatly 
from the kind comments on EDM by the 
following mathematicians: J. F. Adams, M. 
Atiyah, A. Borel, H. Cartan, K. Chandrase- 
kharan, S. S. Chern, J. Dieudonne, E. Hewitt, 

F. Hirzebruch, O. Lehto, J. L. Lions, L. Markus, 
and J. P. Serre. In particular, we are deeply 
grateful to J. Dieudonne for his many detailed 
comments. 

Compared with ISZ3 and EDM, EDM2 has 
the following characteristics: 

(1) EDM2 contains many more new mathe- 
matical results than EDM. For the details we 
refer the reader to the main points of revision 
mentioned in the preface to ISZ3, because 
EDM2 has the same mathematical content 

as ISZ3 and the additions made to ISZ2 and 
subsequently in EDM are of relatively minor 
importance. 

(2) The Japanese textbooks listed in the 
article references of ISZ3 are replaced in 
EDM2 by references to standard textbooks 



written in English. Such replacement was not 
done in compiling EDM. 

(3) Years of birth and death are included in 
the Name Index of EDM2 as far as possible. 
This information was given in ISZ2 but was 
removed in compiling EDM. 

(4) The Subject Index of EDM 2 is so de- 
signed that every concept consisting of two or 
more words can be traced from each of these 
component words. (This principle was adopted 
in ISZ2 and EDM but was not rigidly fol- 
lowed in ISZ3 due to the lack of space.) 

(5) While all editions of ISZ were in one 
volume and EDM was in two volumes, EDM2 
is in four volumes: I. Forewords, prefaces, 
introduction, text A-E; II. Text F N; III. 

Text O-Z; IV. Appendices, indexes and other 
backmatter. The Systematic List of Articles 
appears in volume IV, page 1857. 

While EDM2 is more voluminous than 
ISZ3, we hope that, being written in English 
and printed with generous margins, it can 
easily provide readers with information on 
every significant result of today’s mathematics 
and so will be useful to a large number of 
mathematicians. I am responsible for any 
shortcomings that may exist despite all our 
efforts, and I would appreciate any remarks 
from the readers. I hope to have occasion to 
remedy any such shortcomings in the future. 

Finally I would like to repeat my thanks to 
all the collaborators named in my preface to 
ISZ3. Also I express my gratitude to Yuji Ito 
for his tremendous effort to polish the English 
of all manuscripts, to Shigeru litaka for his 
laborious assistance in the final stage of the 
work, to Mrs. M. Nawata for her excellent 
secretarial work, and to the staff of the De- 
partment of Mathematics at Gakushuin Uni- 
versity for providing me with an office for 
compilation for eight years. 

Kiyosi Ito 
December 1986 
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Preface to the 
Third Japanese Edition 

The first edition of Iwanami Sugaku Ziten was 
published in 1954, and a revised and aug- 
mented edition appeared in 1960. Extensive 
revision of the work was carried out sub- 
sequently, and the result was published in 1968 
as the second edition, which has retained its 
valuable and useful life for the past 17 years. 
The English translation of this second edition 
published by The MIT Press has achieved 
international recognition. In the meantime, 
remarkable progress has been made in mathe- 
matical sciences: recognition of the interrela- 
tionships amongst various branches within 
mathematics has increased significantly, and 
the formulation of mathematics as a synthetic 
entity is in the making. Furthermore, advanced 
mathematical theories have been utilized fre- 
quently in the physical, biological, and social 
sciences, and expectations for mathematics 
to be the basis of all the sciences have been 
increasing. In order to cope with these devel- 
opments, it was decided that the second edition 
of Sugaku Ziten should be further revised and 
an updated version be published as the third 
edition. The English translation of this third 
edition will also be published by The MIT 
Press. 

The main points of the revision are as 
follows: 

(1) On the Size and Scope of the Encyclo- 
pedia: The total number of articles has in- 
creased only slightly from that of the second 
edition, from 436 to 450. However, in view 
of the rapid and extensive development in 
mathematics in recent years, many old articles 
have been reorganized and unified, and much 
new material has been added. Consequently, 
the mathematical content of the encyclopedia 
has increased a great deal over that of the 
second edition, and this, together with the 
authors’ efforts to render the articles more 
readable than before, necessitated a 50% in- 
erease in the number of pages devoted to the 
main text. The account of computer science 
has been taken out of the subject area of 
Numerical Analysis, and a new subject area 
has been created for it; thus mathematics is 
now classified into 21 subjeet areas altogether 
instead of 20. 

(2) On the Arrangement of the Articles: In 
the second edition, the title of each article 
was spelled out in Romanized Japanese, and 
articles were arranged alphabetically. In the 
third edition, however, the titles of the articles 
are arranged in the order of the Japanese syl- 
labary (katakana) and each article is numbered 



accordingly. To each article a parenthetical 
number is also attached, giving the subject 
area to which the article belongs and its place 
in that list. 

(3) On the Text: Sections within each arti- 
cle are headed by letters A, B, C, . . . ; the ini- 
tial section of each article gives an outline 
and is devoted to an introductory and general 
account of the topic with which the article 

is concerned. In the aetual exposition of 
the articles, every effort has been made to 
pay close attention to the interrelationships 
amongst various fields of mathematics. Even 
when there was no need to revise the original 
article, changes were made in the bibliography 
to list items that would be more accessible to 
the readers. 

(4) On the Appendices: The appendices have 
been revised and augmented along with the 
text. Numerical tables were deleted when- 
ever it was felt that scientific calculators or 
microcomputers could easily reproduce their 
contents. 

(5) On the Indexes: The listing of mathe- 
matical terms in Japanese has been changed, 
as in the case of the arrangement of articles, 
from the alphabetical order to the order of 
the Japanese syllabary. However, terms in 
Japanese that start with a European word 
have been separated out, and, in order to 
facilitate finding, have been listed indepen- 
dently in an Index for Mathematical Terms in 
Japanese with European Headings. The loca- 
tion of a term listed in the indexes is indicated 
by the article number and the letter heading 
of the section in the article where the term 
appears. 

We here outline briefly how the process of 
compilation of this edition was organized. In 
the summer of 1978 the Mathematical Society 
of Japan decided, following a proposal by 
Professors Shokichi lyanaga and Yukiyosi 
Kawada, the chief editors of the previous 
editions of Sugaku Ziten, to undertake the 
compilation of the third edition. Following 
a resolution by the members of the Soci- 
ety, Kunihiko Kodaira, Sigeru Mizohata, 

Itiro Tamura, Nagayoshi Iwahori, Tosihusa 
Kimura, and myself have joined lyanaga and 
Kawada to form an editorial committee and 
have formulated basic guidelines for the com- 
pilation of the third edition. The work of 
editing and compiling articles in each subject 
area was delegated to the editors designated 
for the areas, and I was asked to undertake the 
responsibility of putting the entire volume in 
order. 

The subject areas and their editors are as 
follows: 
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I Logic and Foundations; Shoji Maehara 

II Sets, General Topology, and Cate- 
gories: Shoji Maehara, Hikosaburo 
Komatsu, Masayoshi Nagata 

III Algebra: Masayoshi Nagata 

IV Group Theory: Nagayoshi Iwahori 

V Number Theory: Takayoshi Mitsui 

VI Euclidean and Projective Geometry: 
Itiro Tamura 

VII Differential Geometry: Morio Obata, 
Shigeru litaka, Itiro Tamura 

VIII Algebraic Geometry; Shigeru litaka 

IX Topology; Itiro Tamura 

X Analysis: Sigeru Mizohata, Kiyosi Ito 

XI Complex Analysis: Yukio Kusunoki, 
Shigeru litaka 

XII Functional Analysis: Hikosaburo 
Komatsu 

XIII Differential, Integral, and Functional 
Equations: Tosihusa Kimura, Sigeru 
Mizohata 

XIV Special Functions; Sin Hitotumatu 

XV Numerical Analysis; Masaya Yamaguti 

XVI Computer Science and Combinatorics: 
Sin Hitotumatu 

XVII Probability Theory: Kiyosi Ito 

XVIII Statistics: Kei Takeuchi 

XIX Mathematical Programming and 
Operations Research: Shigeru Furuya 

XX Mechanics and Theoretical Physics: 
Huzihiro Araki 

XXI History of Mathematics; Shokichi 
lyanaga 

Appendices: Sin Hitotumatu 

Before the actual process of compilation 
started, the editorial committee had met sev- 
eral times, and two meetings of all the subject- 
area editors had been called. Furthermore, 
editors for each area consulted with other 
experts in their fields as they selected the 
titles of articles to be included. In areas where 
remarkable development had taken place, such 
as differential geometry, functional analysis, 
probability theory, and applied mathemat- 
ics, extensive revision and insertion of many 
new articles were proposed by the respective 
editors. For this reason, the original articles 
were thoroughly reorganized and systematized. 
We resolved to keep the whole work in one 
volume, even though a number of new articles 
have had to be added. 

The selection of articles and of their re- 
spective authors was finally completed in the 
spring of 1980, and 197 colleagues were asked 
to write the articles. The names of these con- 
tributors and those of the previous editions are 
listed elsewhere in this volume. To all of the 
authors we express our sincere gratitude. 

Editing of the entire manuscript has been 
carried out since the summer of 1982, and we 
are deeply indebted to the following colleagues 
for their painstaking efforts in checking cross 



Preface to the 
Third Japanese Edition 

references and in proofreading; Nobuyosi Moto- 
hashi (Foundations, Set Theory), Takeo Yoko- 
numa (Algebra, Group Theory), Takayoshi 
Mitsui (Number Theory), Tetsuro Kawasaki 
(Geometry, Topology), Shigeru litaka, Isao 
Wakabayashi (Algebraic Geometry), Morio 
Obata, Koichi Ogiue, Osamu Kobayashi (Dif- 
ferential Geometry), Seize Ito, Hisashi Oka- 
moto (Analysis), Hiroaki Aikawa, Makoto 
Ohtsuka (Computer Analysis), Hikosaburo 
Komatsu, Akihiko Miyachi (Functional 
Analysis), Kazuo Okamoto, Daisuke Fujiwara 
(Differential Equations), Sin Hitotumatu 
(Special Functions), Teruo Ushijima (Nu- 
merical Analysis), Hideo Wada (Computer 
Science), Yasunori Okabe (Probability Theory), 
Mituaki Huzii, Yoshihiro Yajima (Statistics), 
Shigeru Furuya (Mathematical Programming), 
Koichi Nakamura (Theoretical Physics), 
Shuichi Okamoto (History of Mathematics), 
Kosaku Okutsu (History of Mathematics, 
Number Theory). 

On those occasions when it became neces- 
sary to rapidly revise manuscripts in order to 
unify the presentation, we were forced to go 
through the revision without consulting the 
authors of the manuscripts. I am responsible 
for all such revisions and hereby express my 
apologies to the authors concerned. 

As for the indexes, we received assistance 
from Takeo Yokonuma, Koichi Yano, Hiroaki 
Aikawa, and Hisashi Okamoto in all aspects 
of the work, which lasted for two years, 
through the final proofreading. Preliminary 
work on the Name Index was carried out by 
staff members of the University of Tokyo, and 
Seizo Ito supervised its final compilation. We 
also asked many other colleagues, and in par- 
ticular Nobuyuki Ikeda and Huzihiro Araki, 
to participate in the preparation of the Name 
Index, using resources available at different 
universities. To all of these people goes our 
sincere gratitude for their assistance. 

Last August, when the completion of the 
work was drawing near, it became necessary 
for me to leave Japan for some time, and this 
made it imperative that I entrust Shigeru 
litaka with responsibility for supervising the 
work at the final stage of editing. I am most 
grateful to litaka for having agreed to take 
over this responsibility and for having brought 
the work to completion under the pressure of 
time. 

From the beginning of this project, we have 
received an unlimited amount of assistance 
from the members of the Dictionary Depart- 
ment of Iwanami Shoten, Publishers. Messrs. 
Ikutaro Sasaki, Hirotomo Ushida, Kazuhiko 
Uetake, and Nagao Sato in particular, have 
made supreme efforts and have come up fre- 
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quently with innovative ideas to make this 
dictionary polished and perfect. To them and 
also to those at Dai-Nippon Printing Co., who 
typeset with the use of computers the entire 
text, including the complicated mathematical 
formulas, and who have cooperated with us 
fully in the process of editing the indexes, goes 
our deep gratitude. 

Kiyosi ltd 
October 1985 



Preface to the 
First English Edition 

The first and second editions of Iwanami 
Sugaku Ziten (in Japanese) were published, 
respectively, in April 1954 and June 1968 by 
Iwanami Shoten, Publishers, Tokyo. Begin- 
ning in the late 1950s, a number of unsuccess- 
ful attempts were made to arrange for translat- 
ing the Sugaku Ziten into European languages. 
Finally an agreement for an English transla- 
tion was made between The MIT Press and 
the Mathematical Society of Japan in July 
1968. The discussions were carried on first 
by Professor Kosaku Yosida, then presi- 
dent of the Mathematical Society of Japan, 
and later by Yukiyosi Kawada, who suc- 
ceeded him in April 1968. Throughout these 
initial negotiations, which lasted from 1966 to 
1968, we received the kindest assistance 
from Dr. Gordon Walker, Executive Director 
of the American Mathematical Society, 
and from Professors W. T. Martin and Shizuo 
Kakutani. 

The agreement for the project was shortly 
followed by the establishment of a commit- 
tee for the English edition of Sugaku Ziten 
within the Mathematical Society of Japan, 
with the following membership: Professors 
Yasuo Akizuki, Shigeru Furuya, Sin Hito- 
tumatu, Masuo Hukuhara, Isao Imai, Sho- 
kichi lyanaga, Yukiyosi Kawada, Kunihiko 
Kodaira, Atuo Komatu, Hirokichi Kudo, 

Shoji Maehara, Yukio Mimura, Kiyoshi 
Noshiro, Shigeo Sasaki, Shoji Ura, Nobuo 
Yoneda, and Kosaku Yosida. This committee 
requested the original authors of the articles 
and other members of the Society to translate 
the work. A list of translators will be found at 
the end of this work. 

In November 1968, an advisory committee 
for the project was formed with the following 
membership: Professor Edwin Hewitt (ehair- 
man). Dr. Sydney H. Gould, Professor Shizuo 
Kakutani, Professor Kenneth O. May, and 
Professor Isaac Namioka. 

As the translating began, we were immedi- 
ately faced with problems concerning unifica- 
tion of terminology and style, some of which 
were inherent in the differences between the 
structures of our two languages — for example, 
the fact that the Japanese language makes no 
distinction between singular and plural forms 
of nouns. 

In August 1969, Professor Hewitt kindly 
arranged a meeting at the University of Wash- 
ington, Seattle, that included the members 
of the Japanese and American committees 
and a representative of The MIT Press. It was 
agreed during this meeting that the transla- 
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tion should be faithful, with only a minimum 
number of changes, such as the correction of 
mathematical errors; whereas the references 
to each article might be augmented consider- 
ably for the convenience of Western readers. 
Professor Kenneth O. May volunteered to 
review the entire translated manuscript, and 
Professors Isaac Namioka and Shizuo Kaku- 
tani, who are proficient in both Japanese and 
English, proposed to read through some of the 
manuscript of the translated articles. It was 
also agreed that the Systematic List of Arti- 
cles should appear in French, German, and 
Russian, as well as in English. 

We owe very much to the American com- 
mittee: Professor Hewitt organized the whole 
work, and Professor May revised the entire 
manuscript and gave us important advice con- 
cerning the appendices, according to which we 
deleted some of the numerical tables which 
may be easily found in readily accessible West- 
ern books. Professor Namioka reviewed a 
great part of the manuscript, transmitting his 
views to Professor May, who forwarded them 
to us with his comments. All of this assistance 
helped us greatly in making our final decisions. 
Professor Kakutani gave us very detailed and 
important advice on the choice of reference 
works. 

We were also assisted concerning English 
terminology and reference books by the fol- 
lowing Japanese mathematicians working in 
American universities: Tadatoshi Akiba, Pro- 
fessors Kiyosi Ito, Tatsuji Kambayashi, Tosio 
Kato, Teruhisa Matsusaka, Katsumi Nomizu, 
Ichiro Satake, Michio Suzuki, and Gaisi 
Takeuti. 

The Mathematical Society of Japan estab- 
lished the following double reviewing system: 
group A with its twenty subgroups, each 
headed by the members listed below, reviewed 
their respective subjects; while group B re- 
viewed the whole manuscript, mainly from the 
linguistic standpoint. 

Group A 

1. Foundations of mathematics: Shoji 
Maehara 

2. Set theory: Atuo Komatu and Shoji 
Maehara 

3. Algebra: Akira Hattori, Masayoshi Nagata, 
and Hideyuki Matsumura 

4. Group theory: Shingo Murakami, Mitsuo 
Sugiura, and Reiji Takahashi 

5. Number theory: Yukiyosi Kawada and 
Tomio Kubota 

6. Geometry, 7. Differential geometry: Sho- 
kichi lyanaga, Shigeo Sasaki, and Kentaro 
Yano 

8. Algebraic geometry: Yasuo Akizuki and 
Kunihiko Kodaira 

9. Topology; Atuo Komatu 

10. Real analysis: Sin Hitotumatu, Shunji 
Kametani, and Shigeki Yano 
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1 1. Complex analysis: Kiyoshi Noshiro 

12. Functional analysis: Yukio Mimura and 
Kosaku Yosida 

13. Differential equations: Masuo Hukuhara 
and Sigeru Mizohata 

14. Special functions: Sin Hitotumatu 

15. Numerical analysis, 16. Probability 
theory: Kiyosi Ito 

17. Statistics: Hirokichi Kudo 

18. Information theory: Tosio Kitagawa and 
Hirofumi Uzawa 

19. Theoretical physics: Isao Imai and 
Kazuhiko Nishijima 

20. History of mathematics: Tamotsu Murata 
Group B 

Kenichi lyanaga and Mitsuyo lyanaga 

Professor Sin Hitotumatu also assisted us in 
translating the titles of Japanese books given 
in the references and the explanations attached 
to the lists of formulas and numerical tables in 
the appendices. We are also grateful for the 
generous cooperation offered to us by our 
colleagues in the Department of Mathemat- 
ics, Faculty of Science, University of Tokyo: 
Hiroshi Fujita, Shigeru Furuya, Akio Hat- 
tori, Seizo Ito, Nagayoshi Iwahori, Tosihusa 
Kimura, Kunihiko Kodaira, Hikosaburo 
Komatsu, Akihiro Nozaki, and Itiro Tamura. 
We are indebted as well to Professors Walter 
L. Bailey of the University of Chicago and 
Yuji Ito of Brown University for many valu- 
able consultations concerning both mathe- 
matical and linguistic questions. 

In translating the Systematic List of Articles 
into French, German, and Russian, we were 
assisted by Professor Hideya Matsumoto in 
Paris, Professor Emanuel Sperner in Hamburg, 
Professor Katsuhiro Chiba in Tokyo, and 
Professor Arkadi! Maltsev in Moscow. 

We began to send the manuscript to The 
MIT Press in March 1970 and finished send- 
ing it in July 1972. The manuscript was edited 
there, then sent to Professors May, Namioka, 
and Kakutani, and finally was sent back to us 
with their comments and questions. All of the 
references were carefully checked by Laura 
Platt. 

Iwanami Shoten, Publishers, have always 
been cooperative with us. In the office of 
the Mathematical Society of Japan, Yoko 
Endo, Reiko Nagase, and Chieko Sagawa 
helped us with their efficient secretarial 
work. 

The fruition of this project was made pos- 
sible only by the gracious assistance offered to 
us by many people, including those already 
mentioned. We should like to express our 
most sincere gratitude to all those who have 
helped us so kindly. 

Shokichi lyanaga, Yukiyosi Kawada 
August 1973 
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Addition made in January 1976 

After the procedures described above, the 
whole manuscript of this Encyclopedia was 
sent to The MIT Press in August 1973. It was, 
however, toward the end of November 1975 
that the final decision was made by The MIT 
Press to send the manuscript to composition 
in early 1976 in order that the work be pub- 
lished in 1977. 

At the same time, we were asked to review 
and update the manuscript up to the end of 
February 1976. We are now making our best 
effort to this effect with the kind help, especially 
from the linguistic viewpoint, of Dr. E. J. 
Brody. 

In so doing, we have noticed that perhaps 
too much emphasis has been given to results 
obtained by Japanese mathematicians and that 
there are still many things in this book which 
should be improved. 

We hope that ongoing revisions will be 
carried out in subsequent editions. 



Preface to the 
Second Japanese Edition 

Seven and a half years have passed since the 
revised and augmented edition of Iwanami 
Sugaku Ziten was published. The nature and 
purpose of this book remain the same as de- 
scribed in the preface to the first edition: It is 
an encyclopedic dictionary with articles of 
medium length aimed at presenting the whole 
of mathematics in a lucid system, giving exact 
definitions of important terms in both pure 
and applied mathematics, and describing the 
present state of research in each field, together 
with historical background and some per- 
spectives for the future. However, mathemat- 
ical science is in rapid motion, and the “pre- 
sent state of research” changes constantly. 

The present updated second edition has been 
published to remedy this situation as far as 
possible. 

The main points of revision are as follows: 

(1) On the Articles and the Size of the Ency- 
clopedia: From the articles of the last edition, 
we have removed those whose importance has 
diminished recently (e.g., Geometry of Tri- 
angles), while we have added new articles in 
domains of growing importance (e.g.. Cate- 
gories and Functors; /C-Theory). Many articles 
concerning applied mathematics in the first 
edition were short; in this edition, they have 
been combined into articles of medium length 
to save space and to systematize the presenta- 
tion. The number of articles, 593 in the first 
edition, has thus diminished to 436. We have 
made every effort to keep the size of the ency- 
clopedia as it was, but the substantial aug- 
mentation of content has necessarily brought 
about an enlargement of about 30%. 

(2) On the Text: When the title of an article 
has remained the same as in the first edition, 
we have reviewed the whole text and revised 
whenever necessary. Especially for the fun- 
damental ideas, we have endeavored to give 
thorough explanations. In the first edition we 
gave English, French, and German transla- 
tions of article titles; in the present edition, we 
have also added Russian. The bibliographies 
at the ends of articles have been updated. 

(3) On Terminology: In previous editions we 
endeavored to unify the terminology of the 
whole encyclopedia so that the reader would 
have no difficulties with cross-references. Here 
we have done this once again with the hope of 
attaining results more perfect than before. 

(4) On the Appendices: The appendices were 
designed to supplement the text efficiently. 
Some overlapping of the appendices with the 
text found in previous editions has been re- 
moved. Also deleted in this edition are elemen- 
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tary formulas in analytic geometry and tables 
of statistical distributions, which can be easily 
found in other books. However, we have 
added some formulas in topology, the theory 
of probability, and statistics, as well as tables 
of characters of finite groups, etc. 

(5) On the Indexes; Important terms are 
listed multiply in the indexes to facilitate find- 
ing, e.g., the term transcendental singularity 
appears under both transcendental and singu- 
larity. Both names in the text and those in the 
references are included in the Name Index of 
this edition. The numbers of items in the Index 
of Mathematical Terms in Japanese and in 
European Languages and in the Name Index in 
this edition are 17740, 10124, and 2438, com- 
pared with 8254, 8070, and 1279, respectively, in 
the previous edition. 

The compilation of this edition was organized 
as follows. In the spring of 1964 we began to 
select the titles of articles with the aid of the 
following colleagues: in set theory and foun- 
dations of mathematics, Shoji Maehara; in 
algebra and number theory, Yasuo Akizuki, 
Yukiyosi Kawada; in differential geometry, 
theory of Lie groups, and topology, Yozo 
Matsushima, Atuo Komatu; in analysis, 

Masuo Hukuhara, Kosaku Yosida, Shunji 
Kametani, Sin Hitotumatu; in probability 
theory, statistics, and mathematics for pro- 
gramming, Kiyosi ltd, Hirokichi Kudo, and 
Shigeru Furuya; in theoretical physics, Isao 
Imai; and for the appendices. Sin Hitotumatu. 

I have participated in compiling the articles 
on geometry and the history of mathematics. 
Kawada and Hitotumatu undertook the re- 
sponsibility of putting the volume in order. 

The work of selecting titles was completed 
in the summer of 1964. We then asked 173 
colleagues to contribute articles. The names of 
these contributors and those of the previous 
editions are listed elsewhere. To all of them 
goes our most sincere gratitude. 

In editing the manuscript, we were assisted 
by the following colleagues in addition to 
those mentioned already; in set theory and 
foundations of mathematics, Setsuya Seki and 
Tsurane Iwamura; in algebra and number 
theory, Masayosi Nagata, Akira Hattori, 
Hideyuki Matsumura, Ichiro Satake, Tikao 
Tatuzawa; in geometry, theory of Lie groups, 
and topology, Singo Murakami, Hideki Ozeki, 
Noboru Tanaka, Kiiti Morita, Hirosi Toda, 
Minoru Nakaoka, Masahiro Sugawara, Shoro 
Araki; in analysis, Kiyoshi Noshiro, Yusaku 
Komatu, Seizo Ito, Hiroshi Fujita, Shige- 
Toshi Kuroda, Sigeru Mizohata, Masaya 
Yamaguti, Tosiya Saito, Tosihusa Kimura, 
Masahiro Iwano; in probability theory, sta- 
tistics, and mathematics of programming. 
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Nobuyuki Ikeda, Tadashi Ueno, Masashi 
Okamoto, Haruki Morimoto, Kei Takeuchi, 
Goro Ishii, Tokitake Kusama, Hukukane 
Nikaido, Toshio Kitagawa; and in theoretical 
physics, Ryogo Kubo, Hironari Miyazawa, 
Yoshihide Kozai. 

After the summer of 1965, we entered into 
the period of finer technical editing, in which 
we were assisted by the following colleagues: 
in algebra, Keijiro Yamazaki, Shin-ichiro 
Ihara, Takeshi Kondo; in geometry, Tadashi 
Nagano, Mitsuo Sugiura, Ichiro Tamura, 
Kiyoshi Katase; in analysis, Nobuyuki Suita, 
Kotaro Oikawa, Kenkiti Kasahara, Tosinobu 
Muramatsu, Hikosaburo Komatsu, Setuzo 
Yosida, Hiroshi Tanaka; and in history, 
Tamotsu Murata. 

We were also assisted by Katsuhiro Chiba 
in the Russian-language translation of the 
article titles, and by Osamu Kota and Kiyoshi 
Katase in the indexing. 

Proof sheets began appearing in the spring 
of 1966. In proofreading, Kaoru Sekino, 
Osamu Kota, Kiyoshi Katase, and Teruo 
Ushijima helped us, as well as Mrs. Rieko 
Fujisaki. Miss Yoko Endo worked with all of 
us throughout the entire revision at the office 
of the Mathematical Society of Japan. She 
helped us especially in looking for and check- 
ing references and preparing the Name Index, 

Yukiyosi Kawada supervised the whole 
work, succeeding me in the role I had played 
in the compilation of the first and augmented 
editions. Sin Hitotumatu collaborated with 
him throughout, especially on the appendices. 
The second and the third proof sheets of the 
text were read by Kawada; the fourth proof by 
Hitotumatu; the proof sheets of appendices by 
both Kawada and Hitotumatu. 

The Editorial Committee of the Mathe- 
matical Society of Japan asked me to write 
this preface. Having edited the previous edi- 
tions and realizing full well the difficulty of the 
task, I would like to express my particular 
gratitude to Kawada. In the Dictionary De- 
partment of Iwanami Shoten, Publishers, 
Messrs. Hiroshi Horie, Tetsuo Misaka, Shi- 
geki Kobayashi, and Toshio Kouda were very 
cooperative in their collaboration with us. 

To them and also to those who typeset and 
printed this book at Dai-Nippon Printing Co. 
and Shaken Co. goes our gratitude. 

S. lyanaga 
March 1968 
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Preface to the Revised and 
Augmented Japanese Edition 

Six years have passed since the publication of 
the first edition of this encyclopedia. This re- 
vised and augmented edition incorporates the 
achievement of these years. Tt contains, to- 
gether with the correction of errors found in 
the first edition, some new articles such as 
Abelian Varieties, Automata, Sheaves, Homo- 
logical Algebra, Information Theory, and also 
supplements to articles in the first edition such 
as Complex Multiplication, Computers, and 
Manifolds. These additional items render the 
new edition 93 pages longer than the previous 
one. Each article has been thoroughly re- 
vised, and the indexes have been completely 
rewritten. 

We were assisted by the following colleagues 
in selecting articles, writing, and proofreading: 
In set theory and foundations of mathematics, 
Sigekatu Kuroda, Setsuya Seki; in algebra 
and number theory, Tadao Tannaka, Tsuneo 
Tamagawa; in real analysis, Shunji Kametani, 
Kosaku Yosida; in function theory, Kiyoshi 
Noshiro, Sin Hitotumatu; in theory of dif- 
ferential and functional equations, Masuo 
Hukuhara, Masahiro Iwano, Ken Yamanaka; 
in functional analysis, Kosaku Yosida, Seizo 
ltd; in geometry, Shigeo Sasaki, Nagayoshi 
Iwahori; in topology, Atuo Komatu, Itiro 
Tamura, Nobuo Yoneda; in theory of proba- 
bility, Kiyosi ltd, Seizo ltd; in statistics, Toshio 
Kitagawa, Sigeiti Moriguti, Tatsuo Kawata; 
in applied mathematics, Sigeiti Moriguti; and 
in mechanics and theoretical physics, Taka- 
hiko Yamanouchi, Isao Imai. The revision 
and augmentation of the articles concerning 
the history of mathematics was done by 
myself. Miles. Ydko Tao, Eiko Miyagawa, and 
Mutsuko Nogami worked in the office of the 
Society. 

The names of authors who contributed to 
the completion of this edition have been added 
to the original list of contributors. 

The project of editing this edition started 
in the summer of 1958. We acknowledge our 
deep gratitude to all those who have col- 
laborated with us since that time. 

S. lyanaga 
August 1960 
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This encyclopedia, Iwanami Sugaku Ziten, was 
compiled by the Mathematical Society of 
Japan at the request of the Iwanami Shoten, 
Publishers, who have hitherto published a 
series of scientific dictionaries such as Iwanami 
Rikagaku-Ziten {Iwanami Encyclopedia of 
Physics and Chemistry) and Iwanami Tetugaku- 
Ziten (Iwanami Encyclopedia of Philosophy). 

As mentioned in the prefaces to these volumes, 
the importance of such encyclopedias in clari- 
fying the present state of each science is obvious 
if we observe the rapid pace of contemporary 
research. Mathematics is also in rapid motion. 
As a fundamental part of exact science, it 
serves as a basis of all science and technology. 

It also retains its close contact with philo- 
sophy. Therefore, the significance of having 
an encyclopedia of mathematics cannot be 
overemphasized. 

Mathematics have made remarkable pro- 
gress in the 20th century. As for the situation 
toward the end of the 19th century, we quote 
the following passage from the article Mathe- 
matics in the 19th Century of this encyclo- 
pedia; “Toward the end of [the 19th] century, 
the subjects of mathematical research became 
highly differentiated. Branches split into more 
specialized areas of study, while unexpected 
relations were found between previously 
unconnected fields. The situation became 
so complicated that it was difficult to view 
mathematics as a whole. It was in these cir- 
cumstances that in 1898, at the suggestion of 
Franz Meyer and under the sponsorship of the 
Academies of Gottingen, Berlin, and Vienna, a 
project was initiated to compile an encyclo- 
pedia of the mathematical sciences. Entitled 
the Enzyklopddie der mathematischen Wissen- 
schaften, it was completed in 20 years. ...” 

One of the characteristics of 20th-century 
mathematics is the conscious utilization of the 
axiomatic method and of general concepts 
such as sets and mappings, which serve as 
foundations of different theories. Indeed, 
mathematics is being reorganized on the basis 
of topology and algebra. One such example of 
reorganization is found in Bourbaki’s Elements 
de mathematique; some fifteen volumes of this 
series have been published since 1939, and 
more are coming. This encyclopedia, with 
its limited size, cannot contain proofs for 
theorems. However, we intend to present a 
lucid view of the totality of mathematics, 
including its historical background and future 
possibilities. 

Each article of this encyclopedia is of me- 
dium length — sufficiently short to permit 
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the reader to find exact definitions of notions, 
and sufficiently long to contain explanations 
clarifying how important concepts in the same 
field are related to each other. The problem of 
choosing adequate titles required some de- 
liberation. The Systematic List of Articles, 
classified according to specific fields, shows 
those we have chosen. The Index of Terms 
contains detailed references for each notion. 

The appendices, including formulas and tables, 
supplement the text, and will be particularly 
useful for applied mathematicians. 

The project of compiling this encyclopedia 
was proposed in the spring of 1947 by the 
Steering Committee of the Mathematical 
Society of Japan. It was promptly adopted, 
and the selection of articles in specific fields 
was started by the sectional committees of the 
Society. After seven years, our encyclopedia 
is finally appearing. We shall not give a de- 
tailed description of how our work proceeded 
through all these years. We list simply the 
names of those who assisted us greatly and 
to whom we should like to express our deep 
gratitude. 

The president of the Society at the start of 
this project was the late Professor Tadahiko 
Kubota; but our work has been supported also 
by Professors Teiji Takagi, Zyoiti Suetuna, and 
Masatsugu Tsuji as well as by other leading 
members of the Society. 

At the stage of selecting articles, we were 
assisted by the following colleagues; in history 
and the foundations of mathematics, Sige- 
katu Kuroda, Motokiti Kondo; in algebra 
and number theory, Kenjiro Shoda, Tadasi 
Nakayama, Masao Sugawara, Yukiyosi 
Kawada, Kenkichi Iwasawa; in geometry, 
Kentaro Yano, Asajiro Ichida; in function 
theory, Kiyoshi Noshiro, Yusaku Komatu; in 
the theory of differential and functional equa- 
tions, Masuo Hukuhara, Shigeru Furuya; in 
topology, Atuo Komatu, Ryoji Shizuma; in 
functional analysis, Yukio Mimura, Shizuo 
Kakutani, Kosaku Yosida; in the theory of 
probability and statistics, Tatsuo Kawata, 
Toshio Kitagawa, Junjiro Ogawa; and in 
applied mathematics, Ayao Amemiya, Isao 
Imai, Kunihiko Kodaira, Shigeiti Moriguti. 

We asked 190 colleagues to contribute arti- 
cles, which were collected in 1949. Since then 
we have spent an unexpectedly long time edit- 
ing them. Terminology had to be unified 
throughout the encyclopedia so that the reader 
would have no trouble with cross-references. 
Repetitions had to be eliminated and gaps 
filled. Part of the manuscript thus had to be 
rewritten a number of times. We have made 
out utmost effort in this editing work, but we 
are not completely without apprehension that 
our result has still left something to be desired. 
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For any shortcomings in the work, I take 
complete responsibility, as I have acted as the 
editor-in-chief. Also, since we have rewritten 
the manuscript, as already mentioned, we have 
refrained from printing the name of the original 
author of each article; for this, I must request 
the understanding of the contributors. 

In the stage of editing and proofreading, 
we were assisted by the following colleagues; 
Yukio Mimura, Yukiyosi Kawada, Kazuo 
Matsuzaka, Sin Hitotumatu, Setsuo Fuku- 
tomi, Setsuya Seki, Shoji Irie, Shigeo Sasaki, 
Tatsuo Kawata, Sigekatu Kuroda, Yusaku Ko- 
matu, Ayao Amemiya, Isao lami, Tosio Kato, 
Tsurane Iwamura, Morikuni Goto, Kosaku 
Yosida, Jiro Tamura, Yasuo Akizuki, Kiyoshi 
Noshiro, Motosaburo Masuyama, Sigeiti 
Moriguti, Osamu Kota, Nobuo Yoneda, 
Tsuneo Tamagawa, Jun-ichi Hano; and more 
partieularly, in the foundations of mathematics, 
Sigekatu Kuroda, Tsurane Iwamura; in alge- 
bra and number theory, Kazuo Matsuzaka, 
Yukiyosi Kawada; in algebraic geometry, 
Yakuo Akizuki; in real analysis, Tatsuo Ka- 
wata; in complex analysis, Yusaku Komatu, 

Sin Hitotumatu, Jiro Tamura; in functional 
analysis, Kosaku Yosida; in topology, Setsuo 
Fukutomi, Nobuo Yoneda; in the theory of 
probability and statistics, Motosaburo Masu- 
yama, Sigeiti Moriguti; and in applied mathe- 
matics, Ayao Amemiya, Isao Imai, Tosio 
Kato, Sigeiti Moriguti. 

The portraits of Abel and Riemann were 
kindly loaned to us by Torataro Shimomura. 

The formulas in the appendices were com- 
piled by Isao Imai, Sin Hitotumatu, and Sigeiti 
Moriguti; the Subject Index (in Japanese and 
European languages) by Osamu Kota and 
Mrs. Hiroko Ide; the Name Index and Com- 
ments on Journals and Serials by Setsuo 
Fukutomi. Setsuo Fukutomi has taken an 
active part in our work ever since 1948 and 
given much effort to collecting and rewriting 
the manuscript and to unifying terminology. 
The editorial staff of Iwanami Shoten, Pub- 
lishers, has always been cooperative. Without 
their generous support, this encyclopedia 
could never have been published. 

I should like to express my sincere gratitude 
to all those who have collaborated with us 
directly or indirectly. 

S. lyanaga 
March 1954 




Introduction 

The text of this Encyclopedic Dictionary con- 
sists of 450 articles arranged alphabetically, 
beginning with 1 Abe! and ending with 450 
Zeta Functions. Most of these articles are 
divided into sections, indicated by A, B, C, 

A A, BB, .... Cross-references to articles, e.g., 
to the second article, are of the form: 2 

Abelian Groups) or 2 Abelian Groups A), 
according as the whole article or a specific 
section is being referred to. Citations in the 
indexes are also given in terms of article and 
section numbers. 

Key terms accompanied by their definitions 
in the text are printed in boldface. All of these 
terms are found in the Subject Index at the 
end of the volume. 

The sign + means that the term preceded by 
it can be found in the index. A list of special 
notations used throughout the work (with 
explanations of their meanings) appears after 
the appendices. 

A Systematic List of Articles, showing the 
general structure of the work, will be found on 
p. 1857 (vol. IV). The number in parentheses 
after each article title refers to this systematic 
classification into subject areas; e.g., “Abelian 
Varieties (VIII. 5)” means that the article on 
Abelian varieties is the fifth article in Section 
VIII of the systematic list. 

Books and articles in journals are cited in the 
text by numbers in brackets: [1], [2], .... At 
the end of each article there is a section of 
references in which, for books, the name of the 
author or authors, title, name of the publisher, 
year of publication, and the number of the 
edition are given; for journals, the name of the 
author, title of the article, name of the journal, 
and the volume numbers and inclusive page 
numbers are given in this order. (The names of 
journals and publishers are abbreviated as 
indicated in the lists at the end of the work.) 



The Cyrillic alphabet is transliterated as 
follows: 
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Abel, Niels Henrik 

Niels Henrik Abel (August 5, 1802- April 6, 
1829) was born a son of a poor pastor in the 
hamlet of Findo in Norway. In 1822, he en- 
tered the University of Christiania; however, 
he studied mathematics almost entirely on his 
own. He was recognized as a promising stu- 
dent by his senior, Holmboe, and after gradu- 
ation he studied abroad in Berlin and Paris. 

In Berlin he met and was aided by A. Crelle, 
the founder of the Journal fur die Reine und 
Angewandte Mathematik, and participated in 
the founding of this journal. Although he did 
brilliant work in Paris, he did not gain the 
fame he deserved. He returned to Norway in 
May 1827, but, unable to find a job, he was 
obliged to fight poverty while continuing his 
research. He died at twenty-six of tuberculosis. 

His best-known works are the result that 
algebraic equations of order five or above 
cannot generally be solved algebraically, the 
result that tAbelian equations can be solved 
algebraically, the theory of ^binomial series 
and of series in general, the theory of tellip- 
tic functions and more generally of ^alge- 
braic functions, the introduction of tAbelian 
integrals, and the establishment of tAbel’s 
theorem. His work in both algebra and analy- 
sis, written in a style conducive to easy com- 
prehension, reached the highest level of attain- 
ment of his time. 

References 

[1] N. H. Abel, Oeuvres eompletes 1, II, L. 
Sylow and S. Lie (eds.), Grondahl & Son, new 
edition, 1881 (Johnson, n.d.). 

[2] C. A. Bjerknes, Niels Henrik Abel, 
Gauthier-Villars, 1885. 

[3] F. Klein, Vorlesungen fiber die Entwick- 
lung der Mathematik im 19. Jahrhundert I, 
Springer, 1926 (Chelsea, 1956). 



2 (IV.2) 

Abelian Groups 

A. General Remarks 

A +group G is called an Abelian group (or 
commutative group) if G satisfies the commuta- 
tive law ab~ha for all a, be G. In this article, G 
always denotes an Abelian group. Every ^'sub- 
group of G is a ’''normal subgroup, and all the 
elements of finite order in G form a subgroup 



T, for which the ^factor group G/T has no 
elements of finite order except the identity e. 

T is called the (maximal) torsion subgroup of G. 
If G = T, then the Abelian group G is called a 
torsion group (or periodic group). On the other 
hand, if {e), then G is called torsion-free; if 
7/ G, {c}, then G is called mixed. If the 
order of every element of a torsion group G is 
a power of a fixed prime number p, then G is 
called an Abelian p-group (or primary Abelian 
group). An Abelian torsion group is the tdirect 
sum of primary Abelian groups. Thus the 
study of torsion groups is reduced to that of 
primary Abelian groups. 

B. Finite Abelian Groups 

The following fundamental theorem on finite 
Abelian groups was established by L. Kro- 
necker, G. Frobenius, and L. Stickelberger in 
the 1870s. An Abelian group G of order p", 
where p is a prime number, is a direct product 
of ^cyclic subgroups Z^, ... ,Z/. G = Zj x ... x 
Z^. If Z; is of order p”i, then n = ni + ...+n^, 
and we can assume that nj^nj+^. A direct 
product decomposition of G, as above, is not 
unique, but are determined uniquely 

by G. The system |p"‘, ...,p"’’} or {nj, is 

called the system of invariants (or type) of G, 
and a system of ^generators (z i , . . . , z,.} of 
Z, , . . . , Z,. is called a basis of G. An Abelian 
group of type (p, p, . . . , p) is called an elemen- 
tary Abelian group. The decomposition of a 
finite Abelian group into a ^direct sum of 
subsets (not necessarily of subgroups) was 
considered by G. Hajos (1942) and applied 
successfully to a problem of number theory 
(^ 151 Finite Groups). 

C. Finitely Generated Abelian Groups 

The theory of ^’finitely generated Abelian 
groups, i.e., Abelian groups generated by a 
finite number of elements, is as old as that of 
finite Abelian groups. The direct product of 
infinite cyclic groups is called a free Abelian 
group. A finitely generated Abelian group G is 
the direct product of a finite Abelian group 
and a free Abelian group. The finite factor is 
the torsion subgroup of G. The free factor of the 
group G is not necessarily unique; however, 
the number of infinite cyclic factors of the free 
factor is uniquely determined and is called the 
rank of G. Two finitely generated Abelian 
groups are isomorphic if they have isomorphic 
maximal torsion subgroups and the same 
rank. This theory can be extended to the 
theory of ^modules over a '’principal ideal 
domain (^ 67 Commutative Rings K). 
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D. Torsion Groups 

The structure of Abelian p-groups is relatively 
well known, compared with other infinitely 
generated Abelian torsion groups. In the 
1920s, H. Priifer made the first important 
contribution to the study of Abelian p-groups, 
and H. Ulm and L. Zippin completed the 
theory for countable groups in the 1930s. The 
uncountable case was first treated by L. Kuli- 
kov in the 1940s, but the study of this case is 
still in progress. 

An Abelian p-group G^{e} is called divis- 
ible (or complete) if for any aeG there is an 
element xeG satisfying x’’ = a. A divisible 
group is a ^direct sum of Abelian groups of 
type p® (Priifer). Here a group of type p“ is 
isomorphic to the tmultiplicative group of all 
the p"th roots of unity (« = 1 , 2, . . . ) in the 
complex number field. Let G be any Abelian 
p-group. The maximal divisible subgroup 
K of G is a direct product factor of G:G — 

Vx R, where R has no divisible subgroups. 

An Abelian p-group without a divisible sub- 
group is called a reduced Abelian group. 

An element x of an Abelian p-group G is 
said to have infinite height if for any n there is 
an element y„eG satisfying x = y;[’". The ele- 
ments of infinite height form a subgroup G' of 
G. If G‘ = {e} and G is countable, then G is 
decomposed uniquely into the direct sum of 
cyclic groups. This assertion fails if the hy- 
pothesis of countability is dropped. By ''‘trans- 
finite induction we can define G^ as follows. If 
li is an tisolated ordinal number, then G^ — 
(G^"*)^; if ^ is a tlimit ordinal number, then 
G^ = ordinal number t 

such that G^ = G^^\ G' is the maximal divis- 
ible subgroup of G. If G is reduced, then G^ = 
{e}. We call t the type of an Abelian p-group 
G. For a<T, G“ = G7G“+^ is called the Ulm 
factor of G, and the sequence G°, . . . , G®, . . . 

(a < t) is called the sequence of Ulm factors 
of G. Each Ulm factor G® has no element of 
infinite height, and if a < t — 1 , then G® has an 
element of arbitrarily large order. Let t be a 
countable ordinal number; assume that for 
any ordinal number a<i there is given a 
countable Abelian p-group such that 
has no element of infinite height, and that 
for a T — 1 has an element of arbitrarily 
large order. Then there is a reduced coun- 
table Abelian p-group which is of type t with 
a sequence of Ulm factors isomorphic to Aq, 
Aj, ..., Aj,, ... (a<i) (Zippin). Two reduced 
countable Abelian p-groups A and B are iso- 
morphic if they have the same type i; and for 
any a<x the Ulm factors A® and B* are iso- 
morphic. The assertion fails if the hypothesis 
of eountability is dropped. 
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Abelian Groups 
E. Torsion-Free Groups 

In Abelian groups, the group operation is 
often written a + b, using the additive notation; 
an additively written group, called an additive 
group, is generally assumed to be Abelian. In 
the rest of this article we consider exclusively 
additive Abelian groups, of which the additive 
group Z of rational integers is the most primi- 
tive example. In such a group the identity 
element is called the zero element and is de- 
noted by 0; the inverse of a is denoted by — a, 
and we write a-\-( — b) — a — b. The direct sum 
of additive groups A;i (/eA) is called a free 
additive group if each A^ is isomorphic to Z. 

An additive group G is regarded as a tmodule 
over the tring Z, to which the notion of linear 
independence is applicable (-^ 277 Modules). 
Elements a^, of G are linearly dependent 
if there are integers not all of which 

are zero such that aj -I- . . . -t- — 0. Those 

that are not linearly dependent are termed 
linearly independent. An infinite set of elements 
of G is called linearly independent if the ele- 
ments of any finite subset are linearly inde- 
pendent. If there are N elements of G that are 
linearly independent, but if any N-\-l elements 
of G are linearly dependent, then N is called 
the rank of G. Such a system of N linearly 
independent elements is called a maximal 
independent system. A torsion-free additive 
group G is not necessarily free if G is not 
finitely generated. 

The first important work on torsion-free 
additive groups was done by F. W. Levi (1917). 
A. G. Kurosh (1937) completed the theory in 
the case of finite rank. In the general case little 
is known, and I. Kaplansky, J. Rotman, and 
others are continuing the investigation. 

The additive group Q of rational numbers is 
of rank 1, and conversely any additive group 
of rank 1 is isomorphic to some subgroup of 
Q. An additive group G is called divisible (or 
complete) if for any aeG and for any integer n 
there is an element x„eG such that nx„ = a. A 
divisible torsion-free additive group is isomor- 
phic to a direct sum of some copies of Q. For 
any torsion-free additive group G there is a 
divisible torsion-free additive group containing 
G. A minimal additive group F among these 
groups is uniquely determined up to isomor- 
phism and has the rational number field Q as 
an toperator domain. Let = {a/b\(a,b) = 
l,pf/)} be the ring of p-integers in Q, and let 
Gp be the smallest Q*^*-subgroup of F contain- 
ing G. Let Qp be the ^'p-adic number field and 
Zp the ring of tp-adic integers. Extending the 
operator domain Q to Qp we obtain naturally 
a Qp-module Fp from F. Let Gp be the natural 
closure of Gp in Fp. Then Gp has Zp as an 
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operator domain and thus becomes a Z^- 
module. A Z^-module of rank N is isomorphic 
to the direct sum of Kp copies of and N — Kp 
copies ofZp:Gp^i:„QptJ„ + Z„ZpW„{n- 
1, ...,Kp\m— 1, ...,N — Kp). Here Kp is called 
the p-rank of G. As the invariants of G, Kurosh 
gives the rank, the p-ranks for all primes p, 
and a certain equivalence class of the sequence 
of the matrices 'ilRp. Here p ranges over all 
primes, and 9[Rp is the matrix of coefficients 
when the elements of a maximal independent 
system of F are written as linear combinations 
of Wj. 



F. General Abelian Groups 

An Abelian group is, in general, an textension 
of a torsion group by a torsion-free group. A 
torsion group T is called bounded if there is an 
integer n such that t"=l for all te T. Suppose 
there is a torsion group T. Then T is a direct 
summand of an Abelian group G which con- 
tains T as its maximal torsion subgroup if and 
only if T is the direct product of a divisible 
group and a bounded group (R. Baer and S. 
Fomin). 



G. Characters 

A character y of an Abelian group G is a func- 
tion which assigns to each aeG a complex 
number x{a) of absolute value 1 and satisfies 
y{ab) ~ y{a)y{b) for all a,heG. The product y~ 
X 1 X 2 of two characters y^ and ^2 is defined 
= y is also a character 

of G. Thus all the characters of G form an 
Abelian group C(G), which is called the char- 
acter group of G. The identity element of the 
character group is the identity character (or 
principal character) y such that y{a) = 1 for all 
aeG. If G is finite, then G^C(G). This implies 
the duality G = C[C(G)]. This fact was ex- 
tended by L. S. Pontryagin to docally compact 
ttopological Abelian groups 422 Topo- 
logical Abelian Groups B-D). For additive 
Abelian groups with operator domains ^ 277 
Modules. 
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Abelian Varieties 



A. History 



Except for C. F. Gauss, whose work on this 
subject saw the light many years after his 
death, N. H. Abel was the first to consider 
talgebraic functions as functions of complex 
variables and to discover double periods of 
telliptic functions such as x — x{u), which is the 
inverse function of an elliptic integral 

dx 

vSw 

where f 4 .{x) is a polynomial in x of degree 4. C. 
G. Jacobi expressed elliptic functions explicitly 
as ratios of theta series. As a natural generali- 
zation of elliptic functions, which are tAbelian 
functions of genus 1, Jacobi and his successors 
studied the inverse function of the thyperellip- 
tic integral, or more generally, the inverse 
function of the Abelian integral. By investigat- 
ing hyperelliptic integrals of the first kind of 
genus 2, 

dxi X2dX2 

^JfeiXl) J y/Mx^) 

where f^{x) is a polynomial of degree 6 in x, 
Jacobi obtained multiple-valued functions 
with quadruple periods. He then discovered 
the remarkable fact that if we consider sums of 
two integrals 

Wi = 






the elementary symmetric functions 
-I-X 2 and S 2 = XiX 2 of Xj and X 2 are single- 
valued functions of Wj and U 2 with quadruple 
periods. He also conjectured that these func- 
tions Si and S 2 might be expressed explicitly in 
terms of theta series of and Wji this conjec- 
ture was later confirmed by J. G. Rosenhain 
and A. Gdpel. 

In the latter half of the 19th century, the 
general theory of Abelian funetions was es- 
tablished. The central subject was Jacobi’s 
inverse problem (^ Section L), which is a 
natural generalization of the above results. B. 
Riemann solved this problem by expressing 
elementary Abelian functions (-^ Section M) 
as rational functions of theta functions. 

The theory of functions with multiple 
periods was developed further by H. Poincare, 
G. Frobenius, and E. Picard. In the 20th cen- 
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tury, the importance of the theories of Abelian 
functions and Abelian varieties has become 
more obvious with the development of the 
theory of functions of several complex vari- 
ables and algebraic geometry. In particular, 
problems intimately related to number theory 
have given rise to the development of a purely 
algebraic theory of Abelian varieties. Valuable 
contributions have been made by S. Lefschetz, 
C. L. Siegel (tSiegel modular forms), A. Weil 
(algebraic theory of Abelian varieties, syste- 
matization of the theory of theta functions), 
and by D. Mumford (tmoduli theory, algebraic 
theory of theta functions). 

B. Algebraic Theory 

When a tgroup variety is ^complete as a 
variety, the group law is commutative; such a 
group variety is called an Abelian variety (-^ 

12 Algebraic Varieties H.). Let B be a sub- 
variety of an Abelian variety A, and assume 
that 6 is a subgroup of A as an abstract group. 
Then B has the structure of an Abelian variety 
whose law of composition is induced by that 
of A, and B is called an Abelian subvariety of 
A. More generally, when an algebraic subset ® 
is a subgroup of A, then the component B of S 
containing the identity element is an Abelian 
subvariety, and ® is a union of B and a finite 
number of cosets of B. When A is defined over 
a field k, then any Abelian subvariety of A is 
defined over a finite ^separable extension of k 
(W. L. Chow’s theorem). An Abelian variety 
A is called simple if A has no Abelian sub- 
varieties other than A itself and 0. 

Every trational mapping of an algebraic 
variety V into an Abelian variety is defined at 
each simple point of V. This implies that an 
Abelian variety is ^absolutely minimal. 

C. Homomorphisms 

A rational mapping of an Abelian variety A 
into an Abelian variety B is called a rational 
homomorphism (or simply homomorphism) if / 
is a group homomorphism. Let F be a rational 
mapping of A into B; then F can be uniquely 
expressed as follows: F{x) = Fq{x) + F{0) (xeT), 
where Fq is a homomorphism and F(0) is the 
image of the unit element 0 of A. Hence the 
structure of an Abelian variety (as a group 
variety) is uniquely determined by the underly- 
ing algebraic- variety structure. 

When a rational homomorphism / is bira- 
tional, / is called a birational isomorphism (or 
simply isomorphism). It is clear that a rational 
isomorphism is an abstract isomorphism, but 
the converse is not necessarily true. Let A, B 
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be two Abelian varieties. We say that A is 
isogenous to B if the dimension of A is equal to 
that of B and there exists a surjective homo- 
morphism of A onto B, or equivalently, if there 
exists a surjective homomorphism of A onto B 
whose kernel is finite. The relation of isogeny 
is an equivalence relation. For an Abelian 
variety A and an Abelian subvariety X of A 
there is an Abelian subvariety Y of A such 
that the natural homomorphism V x 7^ 

A\{x, y)^x + y is an isogeny (Poincare’s com- 
plete reducibility theorem). In particular, every 
Abelian variety is isogenous to a product of 
simple Abelian varieties that are determined 
uniquely up to isogeny and order. 

Let A, B be two Abelian varieties; we denote 
by Hom(A, B) the additive group of rational 
homomorphisms of A into B. When a rational 
homomorphism I is surjective, then the degree 
v(A) of A is defined by /l(T) = v(2)B as talgebraic 
cycles. If /I is an isogeny, then v(A)#0, and the 
order of the kernel { 1 1 f e A , = 0} is at most 

v{X) and equal to v{X) if and only if X is tsepa- 
rable. The additive group Hom(A, B) is shown 
to be free of rank ^ 4 dim A ■ dim B by the /-adic 
representation (^ Section E). If A = B, then 
Hom(T, A) has a ring structure; it is called the 
ring of endomorphisms (or endomorphism ring) 
of A and is denoted by 91(A). The tensor prod- 
uct 9 Iq(A) 91(A) ® Q, where Q is the field 
of rational numbers, is an tassociative alge- 
bra over Q. If A is simple, then 91 q(A) is a 
tdivision algebra. More generally, 9Io(A) is iso- 
morphic to a direct product of some ttotal 
matrix algebras over division algebras; thus 
9lo(A) is tsemisimple. In particular, if A is 1- 
dimensional (in other words, if A is an telliptic 
curve), the types of 9to(A) are well known; 
when the characteristic p = 0, then 91 q(A) is 
either the field of rational numbers or an 
imaginary quadratic field. When p>0, aside 
from these two fields, we have a ^quaternion 
algebra over Q as a possible type of 9Io(A). 

Let /c be a finite field with q elements. An 
algebraic integer is called a Weil number for q 
if every conjugate of it has absolute value ^/q. 
If A is an Abelian variety defined and simple 
over k, the qth power endomorphism of A : 
determines a conjugacy class of Weil 
numbers for q, as Weil showed (-^ 450 Zeta 
Functions). Moreover, we have the following 
classification theorem (J. Tate, T. Honda): 
There is a one-to-one correspondence between 
the set of all k-isogeny classes of k-simple 
Abelian varieties over k and the set of all con- 
jugacy classes of Weil numbers for q. Tate also 
determined the structure of the division alge- 
bra 9Io(A) over Q, which is described in terms 
of the decomposition of the ^th power endo- 
morphism into prime ideals. 
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D. Divisors 

Let ® be the additive group of Mivisors on an 
Abelian variety A and ©„ be the subgroup of 
divisors that are talgebraically equivalent to 0. 
Then the factor group ©/©a has no ttorsion 
part; this implies that for an Abelian variety 
tnumerical equivalence coincides with talge- 
braic equivalence. We denote this relation 
by = . Given an element a of A, the translation 
T„:Aax^x + aeA gives a tbirational trans- 
formation, which is everywhere tbiregular, on 
the underlying variety of A; we denote by 
the image of a divisor X on A. Then V = 0 if 
and only if X^ is tlinearly equivalent to X for 
each point a of A. The tAlbanese variety of an 
Abelian variety A is A itself, and the t Picard 
variety A of A is isogenous to A. In particular, 
for the tJacobian variety J of an algebraic 
curve, J is isomorphic to J itself. The Picard 
variety A of A is isomorphic to A (duality 
theorem). Let A be a divisor on A; the map- 
ping fl^the ^linear equivalence class of the 
divisor — AT, ae A, is a rational homomor- 
phism of A into A, and we denote it by (p^. If 
q>x — 0, then AT s 0, and vice versa; hence we 
have an additive map of ©/©„ into Hom(A, A). 
If is surjective, we say that X is nondegener- 
ate. A tpositive divisor X is nondegenerate if 
and only if X is ^ample, and then nX is tvery 
ample for n^3. There always exist positive 
nondegenerate divisors on an Abelian variety; 
therefore an Abelian variety is a tprojective 
variety. For a given divisor X on A, we can 
find n suitable points Mi , . . . , u„, where n is the 
dimension of A, so that the intersection prod- 
uct ... • is defined. We denote by (A‘"’) 
the tdegree of the zero cycle • . . . ■ A„ . If X 
is positive nondegenerate, then the dimension 
1{X) of the tdefining module of the tcomplete 
linear system determined by X is equal to 
(X^"’)/nl (Poincarfe’s theorem). Furthermore, the 
degree v{(px) of <Px, where X is any divisor on 
A, is given by the formula v(^;f) = ((A*"V«!)^ 
(Frobenius’s theorem). 

For a nondegenerate divisor A on A there is 
a unique integer i = i{X), 0 ^ i ^ dim A, called 
the index of A, such that //^(A, C^(A)) = 0 for 
p i and H‘(A, A)) ^ 0. We have i{ - A) = 

dim A — i(A) and /(A) = 0 if and only if A is 
ample. 

E. /-adic Representations 

Let A be an Abelian variety of dimension n. 
For a given prime number I, let ©,(A) denote 
the group of points on A whose order is a 
power of /. If I is different from the character- 
istic of the base field of A, then the group 
©,(A) is isomorphic to the direct product of 2n 



factor groups Qi/Zi, where Q, is the field of l- 
adic numbers and Z, is the group of /-adic 
integers 439 Valuations). We call such an 
isomorphism the /-adic coordinate system of 
©,(A). Now let A be a rational homomorphism 
of A into an Abelian variety B of dimension m. 
Then we can see that /. induces a homomor- 
phism of ©,(A) into ©i(5). This shows that by 
placing /-adic coordinate systems on ©;(A) and 
©,(B) respectively, we get a matrix representa- 
tion M,(A) of A with 2m rows and 2n columns. 
The representation /l^M,{/l) is faithful, and 
Mi{X) is called the /-adic representation of 2. 
When A — B, then /l-^M;(A) is a faithful repre- 
sentation of the ring of endomorphisms 91(A). 
This representation can be naturally extended 
to the representation of the algebra 9lo(A); 
the characteristic polynomial of the /-adic 
representation M,{X) (where 2 is an element 
of 9 Iq(A)) is a polynomial with coefficients in 
Q. Moreover, the polynomial does not depend 
on the choice of the prime number /. When 
/S.6 91(A), then v{2) is equal to detM/(A). The 
trace of M;(a) is usually written as ct(A). 

Let A be a rational homomorphism of A 
into B and F be a divisor on B. Then by the 
correspondence cl(F)->cl(/“^(F)), we obtain a 
rational homomorphism of B into A, called the 
transpose of A and denoted by 'A, where cl 
means the linear equivalence class (^16 Alge- 
braic Varieties M). If A is a nondegenerate 
divisor on A, then the composition map /?: A^ 
A o^'(px'■A-^A and the canonical isomorphism 
A ^ A satisfy the equality po(px — v(cpj-)(5 
(5 = the identity mapping of A). We denote by 
(Px'^ the element {\/v{(px)) (i in Hom(A, A) ® Q. 
The correspondence a^a', cl' — (px^ o^ci.o(px 
(ae9Io(A)) is an ^involution of 9Io(A) and 
is of order 1 or 2. If a ^ 0, then o{a' o a) > 0 
(Castelnuovo’s lemma). A. Weil was the first to 
recognize the importance of this theorem in 
connection with tRiemann’s hypothesis on 
■^congruence zeta functions. 

F. Differential Forms 

A ^differential form co on an Abelian variety A 
of dimension n is called invariant if 7'*<u = m 
for every point aeA, where 7^ is the trans- 
lation by a (^ Section D). The differential form 
^of the first kind is invariant, and conversely, 
every invariant differential form is of the first 
kind. Let K be the ^universal domain and 
K{A) be the tfunction field of A. The set of the 
linear differential forms of the first kind on A 
is a linear space over K of dimension n, and its 
basis becomes a basis over K{A) of the linear 
space consisting of all linear differential forms 
on A. An invariant derivation on A is a deriva- 
tion D in K(A) satisfying (Df) oT^ = D{ f o TJ 




7 



for any element / of K{A) and every point a of 

For a linear differential form co — Zfdu, 
and a derivation D, we put (^co,Dy — 'ZifiDUj. 
Then <cu, D} is a bilinear form in a> and D. A 
derivation D is invariant if and only if <(to, D) 
is a constant function for every invariant linear 
differential form co. Similarly, a linear differen- 
tial form CO is invariant if and only if <(U,D> is 
a constant function for every invariant deriva- 
tion D. The linear space of invariant linear 
differential forms and that of invariant deriva- 
tions are dual to each other with respect to 
the bilinear form {to, D}. 

Now consider the case when the character- 
istic p of the universal domain is positive. The 
automorphism a->a^, aeK, of the universal 
domain K induces a group isomorphism of A; 
we denote by A'^ the image of A and by the 
image of a point x of A. The image A” is an 
Abelian variety, and the group isomorphism 
7i:x^x^, xeA, is an isogeny of A onto A”. Let 
B be another Abelian variety and let ^ be an 
isogeny of A onto B. If there is an isogeny p: 
B-*A’^ such that n — poX, then we say that X is 
of height 1. The function field K(B) of B can be 
considered as a subfield of the function field 
K(A) of A by the mapping X. If X is of height 1 
and v{X) — p, there exists an invariant deriva- 
tion D of K{A) with the constant field K{B), 
uniquely determined up to constants. More- 
over, we can choose D so that D‘’ — DorD’’ — 0. 
In the first case X is said to be of type (ij); in 
the second case it is said to be of type (j^). An 
isogeny whose degree is a prime different from 
the characteristic p is said to be of type (sj, 
and a tseparable isogeny whose degree is p 
is said to be of type (S 2 ). Any isogeny can be 
written as a product of isogenies of these four 
types. 

G. Polarized Abelian Varieties 

Let X be a divisor on an Abelian variety A; we 
denote by X the class of divisors X' such that 
mX = m'X' for suitably chosen positive inte- 
gers m, in'. When the class X contains positive 
nondegenerate divisors, we say that X deter- 
mines a polarization on A, and the couple 
{A, X) is called a polarized Abelian variety. In 
particular, if A is a tJacobian variety whose 
polarization X is determined by a theta divi- 
sor, we call {A, X) the canonically polarized 
Jacobian variety. If an endomorphism a of A 
keeps the polarization invariant, i.e., if the 
class determined by a “HA") coincides with the 
class X, then a is called an endomorphism of 
the polarized Abelian variety (A, X). In partic- 
ular, if a is an automorphism of A, then we 
say that a is an automorphism of {A, X). The 
group of all automorphisms of a polarized 
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Abelian variety is finite. In particular, the 
group of automorphisms of a canonically 
polarized Jacobian variety is finite. Hence 
follows the famous theorem concerning the 
finiteness of the group of automorphisms of an 
^algebraic curve of genus not less than 2. 

On the other hand, the algebraic equiva- 
lence class of a nondegenerate divisor is called 
an inhomogeneous polarization. (The above 
polarization is then sometimes called a homo- 
geneous polarization.) An inhomogeneous 
polarization X determines an isogeny cpx : 
A^A uniquely. If cp^ is an isomorphism, 
the polarization is called principal. An endo- 
morphism of an inhomogeneously polarized 
Abelian variety can be defined similarly. 

H, Analytic Theory 

For the rest of this article we take the complex 
number field C as the universal domain, and in 
this case we can utilize analytic and topolog- 
ical methods. 

Let C" be an n-dimensional vector space 
over C. In a natural way, the space C" be- 
comes a 2n-dimensional vector space R^" 
over the real field R, and the mapping J :z~* 
z, zeC", is an R-linear automorphism 
of R^" such that — 1. Conversely, if for an 
even-dimensional R-vector space R^" such 
a mapping J is given, then by putting (a -t- 
b)x — ax + bJx (xeR^"; a,beR), we can 
introduce an n-dimensional complex linear 
structure into R^". We then say that J deter- 
mines a complex structure on R^"; we denote 
by C" = (R^”, J) the space having the complex 
structure determined by J. Let (Ui , . . . , oj 2 „ be 
2n R linearly independent points on C" = 

(R^", J). Then the lattice D generated by 
these points is discrete, and the factor group 
T"=:C7D is a complex torus of dimension n. 
We fix a basis of C" and introduce a complex 
coordinate system on C". Utilizing the basis 
oji , . . . , ( 02 „ of R^", we also introduce a real 
coordinate system on R^". We then obtain an 
nx2n matrix Q = (o;y), where the (coif, ...,cu„;) 
are the complex coordinates of cop, the ma- 
trix Q is called the period matrix of T". Let 
(z, , ... ,z„) be the complex coordinates of 
a point z e C" and (x j , . . . , X 2 „) be the real 
coordinates. Then we have ‘(zj , . . . , z„) = 

Q‘(xi, ...,X 2 „). If we let the same symbol J 
stand for the representation matrix of the 
linear transformation J with respect to the 
basis coi , . . . , C 02 „, then we have ^ 

The underlying real Lie group of T" is a 2n- 
dimensional (real) ttorus group (R/Z)^". Hence 
the tPoincare polynomial of T" is (1 -l-x)^". 

Any tHermitian metric on C" (as a vector 
space) induces a +flat tKahler metric; hence the 
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Hodge numbers Kahler Manifolds C) are 

I. Theta Functions 

A holomorphic function f{z) on C" = (R", J) is 
called a theta function if for every deD we 
have/(z + if)=/(z)exp(/j(z) + cJ, where /^(z) is 
a linear form on C" which, as for q, depends 
on d. The set of zeros of a theta function /, 
which we write as (/), determines an analytic 
divisor on T". Conversely, for every effective 
analytic divisor X on T", there exists a theta 
function / such that { f) = X. With respect to 
the real coordinate system x^, deter- 

mined by the basis co^, co 2 „, we can find In 
X 2n matrices A, and a 1 x 2« matrix b, 
with elements in C, such that the transforma- 
tion formula f{x + a) = /(x)exp(27r,y — 

+ jaAQa + ^ba)) (where A = Aq (modZ), ‘Aq 
= A q) holds for every 1 x 2n matrix a whose 
elements are rational integers. Moreover, if 
we put £ = A — ‘A, then E is an talternating 
matrix whose elements are rational integers, 
and S = £J is a ^positive semidefmite sym- 
metric matrix. Conversely, if there exists such 
an alternating matrix £ we can find a theta 
function. (There does exist, however, a com- 
plex torus on which no theta function exists 
other than trivial ones, i.e., ones of the form 
exp(<p(z)), where <p(z) is a polynomial of degree 
at most 2.) 

A theta function / is called nondegenerate if 
it cannot be a function of n — 1 complex vari- 
ables, and / is nondegenerate if and only if the 
matrix S = £J is tpositive definite. A complex 
torus has the structure of an Abelian variety if 
and only if there exists a nondegenerate theta 
function, i.e., if and only if there exists an 
alternating matrix £ whose elements are ra- 
tional integers such that EJ is a positive de- 
finite symmetric matrix. The latter condition 
is satisfied if and only if there exists an alter- 
nating matrix £ whose elements are rational 
integers such that Q‘£"“Q = 0, 

>0 (positive definite Hermitian matrix). In 
particular, a period matrix Q satisfying these 
conditions is called a Riemann matrix, and the 
rational matrix *£“' is called the principal 
matrix belonging to Q. 

Determining a polarization on an Abelian 
variety can be reduced to designating a class of 
principal matrices obtained from a principal 
matrix by multiplying by positive integers. Let 
A be a positive divisor on an Abelian variety 
T" = C"/D, and let / be a theta function such 
that {f) — X. Then the divisor X is nondegen- 
erate if and only if the theta function / is non- 
degenerate, and the latter statement holds if 
and only if the alternating matrix £ obtained 



from / is hnvertible. For a given principal 
matrix ‘£~^ we can choose suitable coordinate 
systems of C" and D so that = F) and E~ 
/ 0 A\ 

I ^ ^ I, where /„ is a unit matrix and A is 

a diagonal matrix whose elements are telemen- 
tary divisors of £. In these situations, AF is 
symmetric and its imaginary part positive 
definite, i.e., AF is a point of the tSiegel upper 
half-space ®„. Thus to the polarized Abelian 
variety (where the polarization is determined 
by ’£“^) there corresponds a point of S„. This 
gives a one-to-one correspondence between 
the isomorphism classes of Abelian varieties 
inhomogeneously polarized by a principal 
matrix with given elementary divisors and the 
points of the factor space S„/T„(A), where 
r„(A) is a subgroup that is ^commensurable 
to the Siegel tmodular group of degree n and 
operates on discontinuously. The case 
of principal polarization corresponds to the 
case when A^/„, and then F„(A) is the Siegel 
modular group itself. S„/T„(A) is the coarse 
tmoduli space of Abelian varieties, polarized 
as above; the projective embedding of S„/T„(A) 
is given by means of +Siegel modular forms. 

J. Abelian Functions 

Meromorphic periodic functions on C" with 
periods (Ui , . . . , C 02 „ (i.e., meromorphic func- 
tions on T = C"/D, D = Zo)j 0 . . . ® Zo> 2 „) are 
called Abelian functions with periods D (or on 
T). The quotient of two theta functions having 
the same periods and the same transforma- 
tion formulas is an Abelian function, and con- 
versely any Abelian function can be written as 
such. All Abelian functions on T form a field 
C(T) called the Abelian function field. If T has 
the structure of an Abelian variety, then C(T) 
coincides with the field of rational functions on 
T. In general, for any complex torus T there 
are an Abelian variety T' (possibly of dimO) 
and a surjective homomorphism A:T->T' 
which induces an isomorphism 2*;C(T')z:i 
C(T). Such a pair (T',/l:T->T') is unique up 
to isomorphism, and is called the talgebraic 
reduction of T. 

K. Homomorphisms 

Let C"‘ = (R^"', Jj) (i = 1, 2) be complex linear 
spaces; an R-linear mapping /:R^"i^R ^"2 is 
C-linear if and only if the relation foJ^ — 

J 2 o/ holds. Let £), be tlattice groups of C"‘ for 
i — 1, 2. If a C-linear mapping A:C”i->^C"2 
satisfies A(Di) cz Dj, then A induces a com- 
plex analytic homomorphism of Tj — C"VDj 
to Tj — C"VD 2 - Conversely, any complex 
analytic homomorphism of Tj to T 2 is ob- 
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tained in this way. Let T, and Tj be Abelian 
varieties, and let =(co*i'*, and Qj 

= (cop*, . . . , be their Riemann matrices. 
Then for a homomorphism a:T^-*T2, we 
can find a representation matrix W{k) with 
complex coefficients; and with respect to the 
real coordinate systems . . . , CO2V,) 

we can find a representation 
matrix M{a) with coefficients in Z such that 
lV(/l)Qj =0.2^ W- Conversely, if for a complex 
matrix W there is a matrix M, with coefficients 
in Z, satisfying the relation given above, then 
W gives a homomorphism of to T2. The 
above equation is called Hurwitz’s relation. 

The notion of /-adic coordinate system, which 
is valid for a general characteristic, corre- 
sponds to that of the lattice group, and the l- 
adic representation Mj{a) of a is the abstrac- 
tion of the integral representation M(/l). 

L. Abelian Integrals 

Let 91 be a compact tRiemann surface of genus 
11 Algebraic Functions) and let co be 
a sum of ^Abelian differentials of the first kind 
or of the second kind. Then the '’'period of to 
along a cycle 7 depends only on the thomology 
class of 7. The set of all differentials of the first 
kind forms a complex linear space of dimen- 
sion g; we denote it by Let P be a point 
and Fq a fixed point of R; then we denote by 
u(P) the vector integral (JJ^„tOi, ...,Jp^fOg), 
where (to, , . . . , to^) is a basis of l)o and the path 
from Fq to F is common to every integral. The 
correspondence F-yu(F) is not a single- valued 
mapping; the totality of differences of values 
of u(F) coincides with the group D consisting 
of periods ( tu, , . . . , J,, to,), where 7 varies over 
all cycles. Let a set of cycles {7, , . . . , 72^} be a 
basis of the homology group, with coefficients 
in Z; then Ig column vectors of the gx2g 
matrix Q — (to,^), to^ are linearly inde- 
pendent over R. Since the group D coincides 
with the set of linear combinations, with co- 
efficients in Z, of column vectors of fi, this D 
is a tlattice of rank 2g, and the matrix Q is a 
period matrix of the complex torus T^ = C^/D 
of dimension g. 

For a basis of the homology group with 
coefficients in Z, we take ^normal sections 
a,, ...,0Cg, oCg+j, ...,cc 2 gO[ 91, and let the same 
symbol Q stand for the period matrix (cUy), 

Wy = jj.a;,-; then we have 

QE^Q = 0, ^^QE^Q>0 

(positive definite Hermitian matrix), where 

/ 0 

£ = ^ ^ (with 7g the unit matrix). 

This implies that £ is a principal matrix be- 



longing to Q; we call the equality and inequal- 
ity just given Riemann’s period relation and 
Riemann’s period inequality, respectively. Fur- 
thermore, we can choose a suitable basis of Dq 
so that the period matrix Q is of the form 
(Ig, F) with £e S3 (^ Section T). We consider 
the function 9(u), u = (u , , . . . , Ug), defined by an 
infinite series 

'^(w) — ^exp(27iA“l (m‘M-f ^mf‘m)), 

m 

where the sum is taken over all row vectors 
m = (m 1 , . . . , rUg) with coefficients in Z. If u is in 
a bounded region, then the series for 9{u) is 
uniformly and absolutely convergent. Hence 
9{u) is a holomorphic function of u. This is 
a theta function corresponding to the princi- 
pal matrix £ = ( ® ) and is called the 

Riemann theta function. As i9(u) is nondegen- 
erate, the complex torus T® = C®/D has the 
structure of an Abelian variety. If we regard 
the Riemann surface 91 as an algebraic curve, 
this Abelian variety T® is precisely the Jaco- 
bian variety of the curve 91. The collection of 
zeros of i9(w), a divisor on T®, defines the ca- 
nonical polarization on the Jacobian variety 
(^ Section G). 

The correspondence F-^u{F), pe'H induces 
a well-defined mapping of 91 into T® = 

C®/D. Moreover, if we set <p(/4) = Zj=i (p{Pj) — 
<p(Qj) for any divisor A F, . . . FJQ , . . .Q, 
of degree 0, then (p{A) is a point on T® = C®/D, 
which is represented by the vector (Lj=i Jq.cj,, 

• - • of C®, and the mapping A ->■ 

cp{A) is a homomorphism of the group (6 q of 
divisors of degree 0 onto T® = C®/D. The kernel 
of this homomorphism coincides with the 
group of tprincipal divisors (Abel’s theo- 
rem). Hence a divisor A ^ £, . . . FJQ , . . . Q,. of 
degree 0 is a divisor of some function if and 
only if we have to,- = 0 (mod D) (/ = 

...,g), or equivalently, the left-hand side is 
0 for a suitable path. 

Given g fixed points P, , . . . , on 91 and 
given (w, , . .. , Ug) as any vector of C®, the prob- 
lem of finding g points Q\,--,Qg satisfying 
relations jpjTo, = u,- (mod D), j = 1, . . . , is 
called Jacobi’s inverse problem. To solve the 
problem, we take a divisor A of degree 0 such 
that the class (p{A) (modD) is represented by 
(u , , . . . , Ug); then, by virtue of the tRiemann- 
Roch theorem, there exists a divisor Q, ...Qg 
satisfying 

A = G,...0g/P,...Pg(mod®,). 

Abel’s theorem implies that the set of points 
{Q 1 , • • • , Qg} is a solution of Jacobi’s problem. 
Moreover, for general (u,, ...,Ug), the solution 
is unique; i.e., there exists a subvariety S of 
dimension t/ — 2 on C® such that the solution is 
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unique up to order if and only if (uj , . . . , 
does not lie on In particular, if every point 
Pi coincides with the fixed point Pq that ap- 
peared in the definition of u(P), the subvariety 
on T® = CVD determined by 5 is obtained in 
the following way: Let W^ + ... + \V 2 g -2 be a 
canonical divisor on 91, and put c = + ... 

+ ^(^29-2)- Then the locus of points c — (p{R^) 
— ... — (p{Rg_ 2 ) (where 0 — 2 points R^,...,Rg 2 
are taken independently over all points of 91) is 
the desired subvariety. 

In terms of the theory of complex manifolds 
the above result can be rewritten as follows: Let 
S^(91) be the symmetric product of 91 and 3E the 
subvariety of T®=:CVD induced from Then 
the holomorphic map S®((/)):S®(91)^T® induced 
from 0?:91^T^ is bimeromorphic and isomor- 
phic outside that 3£ whose elements corre- 
spond to effective divisor classes of degree g 
contained in canonical divisors on 91. 

M. Elementary Abelian Functions 

Let z be a nonconstant meromorphic func- 
tion on 91. Then for any u~{ui, ...,Ug)eC^ 
that does not lie on 5, there exist g points 
Qi,...,Qg (uniquely determined up to order) as 
the solution of Jacobi’s problem. Therefore 
the elementary symmetric functions 

si(«;z)= Z ziQjl 

j=i 

.S2 (m; z) = ^ z{Q'i)z{Qj), . . . , .Sg(u; z) - 0 ^(Qj) 

i<j 9=1 

are well defined if u lies outside the variety 
of dimension g — 2. Each function s,(u; z), re- 
garded as a function of u, can be extended 
uniquely to an Abelian function in the whole 
space C®; the function is denoted by the same 
symbol Sj(u;z). The Abelian functions Si(u;z), 

. . . , Sg(u; z) are called the elementary Abelian 
functions obtained from z. 

Now let K be the field of Abelian functions 
on C7D and k the field of meromorphic func- 
tions on 91; then the dimension of K over C is 
g, and [X : C(si (u; z), . . . , Sg(u; z))] = r^, where r 
is the degree of the function z that is given by 
[k:C(z)]. Moreover, if we take any function w 
such that k = C(z, w), then we have 

X = C(s, (u; z), . . . , Sg{u; z); .S j (a; w), . . . , Sg{u; w)), 

where s, (u; w), . . . , Sg{u; w) are the elementary 
Abelian functions obtained from w. 

We can write any elementary Abelian func- 
tion as a rational function of Riemann theta 
functions; therefore any Abelian function can 
be written as a rational function of Riemann 
theta functions. Furthermore, if u and v are 
variable vectors, then Sj(u -I- v; z) can be repre- 



sented as an algebraic function of s, (u; z), . . . , 
Sg(u; z), Si(v; z), ..., Sg(v; z); i.e., we can choose 
a suitable polynomial //,(Z; Xj , . . . , X,; Yj , 
...,Yg) with coefficients in C so that //i(s,(u -I- 
v; z); Si (u; z), ..., Sg(u; z); Si (v; z), . . . , 5^(1;; z)) — 0. 
This algebraic addition formula with respect 
to the elementary Abelian functions Sj(u;z), i = 

I, ...,0, is a function-theoretic interpretation 
of the fact that the addition map X x X-> 

X :(x,y)-+x + y is a tmorphism of algebraic 
varieties. 

As the study of ^^Abelian integrals of the first 
kind led us to the theory of Jacobian varieties, 
tAbelian integrals of the second and the third 
kind give rise to the theory of tgeneralized 
Jacobian varieties (^ 9 Algebraic Curves). 

The theory of Abelian varieties has signifi- 
cant applications to number theory, as shown 
by the following examples: the theory of tun- 
ramified Abelian extensions with respect to a 
function field of several variables defined over 
a finite field (S. Lang), the theory of heights of 
points on an Abelian variety (Weil, A. Neron, 

J. Tate), and the theory of complex multipli- 
cation (^ 73 Complex Multiplication) in the 
case of higher dimensions (Y. Taniyama, G. 
Shimura). 

N. Some Recent Results 

(1) Level Structure, Moduli of Abelian 
Varieties. Let A be an Abelian variety over k 
of dimension g and n a positive integer which 
the characteristic of k does not divide. A level 
n structure on A is defined to be a set of 2g 
points (Tj , . . . , fTjg on A which form a basis for 
the group of points of order n on A. 

Let A(g, d, n; k) be the set of triples: (i) an 
Abelian variety A over k of dimension g, 

(ii) an inhomogeneous polarization X on A 
with v{cpx) = d^, and (iii) a level n structure 
(Ti ,..., (Tjg of A, all up to isomorphisms. Simi- 
larly we can define A{g,d, n; S) for Abelian 
schemes over a scheme S. The correspondence 
S-^A(g, d, n\ S) defines a functor ,^{g,d, n). 

D. Mumford has shown that there exists 
the tcoarse moduli scheme A(g,d, n) quasi- 
projective over Spec(Z[l/n]), and that it is even 
fine if n^3[6]. He used the technique of tHil- 
bert schemes and Ntable points (^16 Alge- 
braic Varieties). One of the key steps of his 
proof is to show that for an embedding <j)\A-^ 
P"* of degree r (i.e., the degree of (j)(A) in P"*) 
over an algebraically closed field k and a posi- 
tive integer n such that char(k) does not divide 

n and n > l)r, the point (<^(x,)),=j „i<, 

in (P™)"^® is stable with respect to the action 
of PGL{m), where the x,- are the points of order 
n on A (with an arbitrary order). Mumford 
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later showed another method of constructing 
the moduli of polarized Abelian varieties by 
using algebraic theta constants [7]. 

(2) Neron Minimal Models, Good and Stable 
Reduction. Let K be a discrete valuation ring 
with residue field k and quotient field K. For 
an Abelian variety A over K, there exists a 
smooth group scheme .a/ of finite type over S 
= Spec(R), called the Neron minimal model of 
A, such that for every scheme S' smooth over S 
there is a canonical isomorphism 

Homs(S', j/) = Homj;,{SK, A), 

where is the pullback of S' by Spec(K)-> 
Spec(R) (A. Neron, M. Raynaud). In particu- 
lar, we have = A. Denote by Aq the fiber of 
.a/ over the closed point of S. 

If is proper over S, we say that A has a 
good reduction at R. If the connected compo- 
nent Aq of Aq containing 0 has no unipotent 
radical (or equivalently, Aq is an extension of 
an Abelian variety by an talgebraic torus over 
a finite algebraic extension of k), we say that A 
has a stable reduction. If there is a finite sepa- 
rable extension K' of K with a prolongation R' 
of R to K' such that AXf^K' has good (stable) 
reduction, we say that A has a potential good 
(stable) reduction at R. Let be a separable 
closure of K and R a prolongation of R to 
K^. For a prime number l^ch'dvik), we have 
a canonical homomorphism p:Gal(KJX)-^ 
Aut(Cfi,(/l)), called a monodromy. Then A 
has potential good reduction if and only if the 
image of the tinertia group HR) by p is a finite 
group (J.-P. Serre and Tate). Every Abelian 
variety A over K has potential stable reduc- 
tion at R (stable reduction theorem, A. Gro- 
thendieck [16]). 

(3) Graded Ring of Theta Functions. If / is a 

theta function with a period system D, /" is 
also a theta function with the same period 
system for every positive n. We denote by S„ 
the vector space of the theta functions with the 
period system D subject to the same trans- 
formation law as /". If we denote by X the 
effective divisor on T = C7D defined by /, then 
S„ can be naturally considered as the tdefining 
module of the ^complete linear system of nX, 
and the dimension of S„ { — l{nX)) is equal to 
the product of the nonzero diagonal elements 
of uA (^ Section I) (Frobenius). If peS„, and 
heS„, then gheS^+„', hence S — 0„>o\ is a 
tgraded ring, which is normal and finitely 
generated. For m>2 and n^3, the product 
map S„ X is surjective (D. Mumford, 

S. Koizumi). If the elementary divisors of E 
can be divided by an integer ^ 4, the kernel of 
the natural graded mapping S(S,)->-S (where 
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S(Sj) denotes the tsymmetric algebra over Sj) 
is generated by the quadratic relations (i.e., the 
part of degree 2) for sufficiently large degrees 
(Mumford). Geometrically this means that if 
X = (/), /e Si , is nondegenerate, then, with the 
projective embedding i:T-^P''' defined by the 
complete linear system of X, i(T) is an inter- 
section of quadrics in containing i(T). 

Mumford has developed a theory of alge- 
braic theta functions that also works for the 
positive characteristic case [7] and has proved 
the above results in general. 
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4 (V.5) 

Additive Number Theory 

A. General Remarks 

Let N be the set of natural numbers, and let A, 

B, ... be subsets of N. The sum C = A -I- fi is 
defined as the set {c\ ce A, ceB, or c = a + b, 
aeA,beBj.A finite sum of subsets of N is 
defined similarly. If N is the sum A + ... + A 

{r times), then we say that A is a basis of order 
r in N. Let A(x) denote the number of integers 
in A that do not exceed x. The density of A 
is defined as infA(x)/x. If A(n)^an, B{n)^ 
(0<a, 1) for all neN, then we have 

(A + B)(n)^(min(l,a + /i))n. This result was 
stated by E. Landau (1937) without proof; A. J. 
Khinchin had given a proof (1932) for the case 
oc — P, and H. B. Mann (1942) and E. Artin and 
P. Scherk (1943) succeeded in proving the 
statement for the general case. Suppose that 
the densities of A and B are, respectively, a 
and 0. If B is a basis of finite order in N, then 
the density of A -I- B is greater than that of A 
(Khinchin and P. Erdos, 1936). Let P be the 
set of all prime numbers. Though the density 
of P is 0, the density of B -f- B is positive (L. G. 
Shnirel’man, 1930). Hence B is a basis of finite 
order in N; in other words, there exists a posi- 
tive integer r such that every natural number 
can be expressed as a sum of at most r primes. 
Though the density of the set Q of the kth 
powers of natural numbers is 0, there exists a 
positive integer s(/c) such that the sum Q + ... 



+ Q (s(k) times) is a set of positive density. L. 
K. Hua (1956) gave a simple proof of this fact, 
based on Yu. V. Linnik’s idea. It follows, 
therefore, that any natural number n can be 
expressed as a sum n = a j -h . . . + af , where 
UjeN, t^s{k). This result had already been 
shown by D. Hilbert (1909). 

An ancient method of finding prime num- 
bers is ^Eratosthenes’ sieve. V. Brun (1920) 
devised a new sieve method to express an 
arbitrary integer n as the sum of two integers 
n = a + b, where the number of prime factors 
of a and b is as small as possible. This method 
was improved by H. A. Rademacher (1924), 
Landau (1931), A. Bustab (1937), and others. 
Among these the method found by A. Selberg 
(1952) is notable (-^123 Distribution of Prime 
Numbers E). 



B. Farey Sequences 

Let I be a positive integer. We arrange in 
increasing order the set of all positive irreduc- 
ible fractions lying between 0 and 1 whose 
denominators do not exceed r. This sequence 
is called the Farey sequence of order x. For 
example, the Farey sequence of order 5 con- 
sists of 

0 1 1 1 2 1 3 2 3 4 1 
I’ 5’ 4’ 3’ 5’ 2’ 5’ 3’ 4’ 5’ I' 

A necessary and sufficient condition that a 
fraction ajb be directly followed by a fraction 
c/d in the Farey sequence of order n\sb + d^ 
n+\,bc — ad — 1. In this case the fraction 
(a + c)/{b + d) is called the mediant of ajb and 
c/d. Interpolating the Farey sequence of order 
n with such mediants {a + c)l{b + d) satisfying 
/)-|-<i = n+l,we obtain the Farey sequence 
of order n-|- 1. 

Let a/q be a fraction in the Farey sequence 
of order x, and a'/q', a"/q" be adjacent mem- 
bers of a/q in the sequence such that a'/q' < 
a/q < a" /q" . The interval \_{a' + a)/{q' + q), 

{a + a")/{q + q")'] is known as the Farey arc 
surrounding a/q. In particular, if a/^ — 0/1, 
then we set [ — 1/n, 1/n] to be the Farey arc 
surrounding 0/1, where « = [x] -h 1 ([ ] is the 
tGauss symbol). We can thus decompose the 
interval [ — 1/n, 1 — 1/n] into a disjoint union 
of Farey arcs. If a is contained in the Farey 
arc surrounding a/q, then \a — a/q\ < 1/gx. 
Therefore, for a given x ^ 1 and a real a, we 
can prove the existence of a/q such that 

(|u|,g)=l, 0<(j^x, \c(. — a/q\<\/qx. 

Let /(C) = S“=on„C'’ be a power series which 
is convergent in the disk |C| ^ 1. Then, by 
Cauchy’s integral formula, 
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a„=— I :^dC=\ 

27n-J„=ir^ ' Jo-"' 

To estimate such an integral, we sometimes 
utilize the decomposition of the interval [0, 1] 
into a disjoint union of Farey arcs as men- 
tioned above. This method is called the circle 
method, and the subdivision of the interval is 
known as the Farey dissection. 

Given a positive number c, a Farey arc 
around ajq is usually called a major arc (or 
basic interval) if q does not exceed the given 
bound c; otherwise, it is called a minor arc (or 
supplementary interval). Usually, the principal 
part of the previously mentioned integral is 
derived from the integral over the major arcs, 
and the residual part is provided by the in- 
tegral over the minor arcs. 



C. Goldbach’s Problem 



Goldbach’s problem is found in letters (1742) 
he exchanged with L. Euler. In them he stated 
that every positive integer can be expressed as 
the sum of primes. More precisely, he conjec- 
tured that any even integer not smaller than 6 
can be expressed as the sum of two odd primes 
and that any odd integer not smaller than 9 
can be expressed as the sum of three odd 
primes. 

1. M. Vinogradov (1937) proved that every 
sufficiently large odd integer can be expressed 
as the sum of three primes. Let V be a suffi- 
ciently large odd integer. If we write 



A{q,N) = 



<P'^(<?) 



Z exp 



(a,q)=l 



a 

— Ini-N 

q 



then the series S{N) = Y,qLi A{q, N) is ab- 
solutely convergent and is equal to 

n('+(7:rDT)n('-p2_3p+3)' 

It is known that S{N)>6/n^ for all N. If we 
denote by r{N) the number of solutions of 
N = Pi + P 2 + P 3 , then r(iV)~(iVV2(logiV)^)- 
S{N). To prove this, Vinogradov used the cir- 
cle method. He employed the ^'prime number 
theorem for arithmetic progressions to esti- 
mate the integrals over the major arcs and 
devised an ingenious method to estimate the 
series Sp^A-exp(27n'ap) in the computation of 
the integrals on the minor arcs. A finite or 
infinite sum of exponential functions such as 
this is called a trigonometric sum. More gener- 
ally, we consider trigonometric sums of several 
variables. Vinogradov provided detailed re- 
marks and calculations [6]. 

In the case of even integers, the problem is 
still unsolved, although J. G. van der Corput, 
T. Estermann, and N. G. Chudanov proved 



simultaneously (1938) that almost all even 
integers (i.e., except a set of density 0) can be 
expressed as the sum of two primes. For these 
problems, Linnik (1946) and Chudanov (1947) 
introduced function-theoretic methods. They 
obtained the density theorem concerning the 
zeros of L-series, and A. Zulauf (1952) con- 
tinued along the same lines. These methods 
had been suggested by G. H. Hardy and J. E. 
Littlewood (c. 1919), although they had as- 
sumed that the '•'extended Riemann hypothesis 
held. 

We denote by Pj a number having at most j 
prime factors counted with their multiplicity. 
Using his sieve method, Brun (1919) proved 
that every sufficiently large even integer 2n can 
be expressed in the form 2n — Pg + Pg. Radema- 
cher (1924) improved this result and obtained 
2n = P-j + P-j. Applying his new sieve method to 
this problem, Selberg (1950) proved that 2n = 
P 2 + Pi - On the other hand, using the '•'large 
sieve, A. Renyi proved that 2n = Pi+ P^ for 
some k. Afterwards, combining Richert’s sieve 
with-a large sieve, Chen Jing-Run (1973) 
proved that 2n = Pi+ P 2 for large 2n [1 1]. 



D. Polygonal Numbers 

Let m be an integer greater than 3, and let 

= 1, a„+i—a„ — {m — 2)n + l (n= 1,2, ... ). 

The sequence {a„} forms the system of polyg- 
onal numbers of order m. The genera! term 
of is given by n + j{m-2){n^-~n) (n = 
1,2,...). Such a„ are said to be triangular 
numbers if m = 3, square numbers if m — 4, and 
pentagonal numbers if m — 5. 

P. Fermat (1636) stated that every natural 
number can be expressed as the sum of m 
polygonal numbers of order m. This conjecture 
was proved by A. M. Legendre (1798) for m = 

3, by J. L. Lagrange (1772) for m — 4, and by 
A. L. Cauchy (1813) for the general case. With 
regard to Lagrange’s result, Legendre noticed 
that in order that a positive integer n be ex- 
pressed as the sum of three squares, it is neces- 
sary and sufficient that n not be of the form 
4‘’(8m + 7). 

Given a positive integer n, the number of 
integral solutions of the equation x^ + xl + ... 
-^-x^ — n is denoted by r^{n). For example, T2(5) 
^8. The ^generating function r 2 {n)n~^ 
can be expressed as ^ where 

the term corresponding to m = n = 0 is omitted; 
the function is equal to 4 Ck(.s), where Ck(^) is 
the tDedekind zeta function of the Gaussian 
number field K = Q(,y — 1). The equation 
4C/i:(s)==4C(s)L(.s, x) (where x(u) = ( — 4/n)) leads 
to 

GW-4Z'(-l)'"'■^"^ 
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where means the sum over all odd factors 
m of n. This result was obtained by C. G. J. 
Jacobi (1829). He also obtained the following 
formula: 

Mm) = 8 m, 

ni\nA/ni 

where Z' means the sum over all divisors m of 
n not divisible by 4 (Hardy and E. M. Wright, 
C. L. Siegel, 1964). Let ^ = exp(27i:iT) (lmi>0), 
and 



tt=i 

-(1 +2q + 2q‘^ + 2q‘^+ ...y. 

Hardy (1920) considered the variation of /(^) 
for q — aexp{2nih/k) (with 0<a<l)asa— >l;he 
obtained 

iog(^i) ' , 



where ^ = exp{2ni(h/k)j^). Furthermore, 
he constructed the singular series 



Ps(n) = 



_.v /2 (s/ 2 )-l oc. 

— I 

r(s/2) h ' 



Ai = l, 



= * Y. (\*)*exp( -27nJ”« 

(h.k)=l 

and showed that r^(n) = ps(n) for s = 5, 6, 7, and 
8. P. T. Bateman (1951) proved the same equa- 
tion for s = 3 and 4. If s > 9 then r^n) = pjn) + 
0(«M (Hardy, Littlewood, and S. Rama- 
nujan, c. 1919). The detailed exposition of this 
result is in the notes of A. Z. Walfisz (1952) 
and Rademacher [4]. H. D. Kloostermann 
(1926) and Estermann (1962) studied the equa- 
tion axf + bx 2 -hcx 2 + dxl — n, which led to a 
new field of study concerning the tKlooster- 
mann sum. For instance, estimates such as 



V cx + dix 

> exp 2m 

;c = l V P 






are obtained by using the theory of zeta func- 
tions in algebraic function fields in one vari- 
able (A. Weil, 1948) (^ 450 Zeta Functions). 



E. Waring’s Problem 

The first formulation of Waring’s problem is 
found in E. Waring, Meditationes algebraicae 
(1770), in which he discusses the problem of 
expressing an arbitrary positive integer as the 
sum of at most nine cubes or as the sum of at 
most nineteen biquadratics. Hilbert proved (^ 
Section A) that there exists a positive integer 
s(k) such that, for any integer N, the equation 

x'^ + X2 + ...+X^= N 



has a nonnegative integral solution if s^s{k). 
We denote by g{k) the least value of s{k), and 
by G(/c) the least value of s{k) for which the 
equation is solvable with at most finitely many 
exceptions of N. Research concerning g{k) and 
G(k) received its initial impetus from the circle 
method considered by Hardy and Littlewood, 
and it underwent considerable development in 
the works of H. Weyl and Vinogradov. 

Let /*s(A) be the number of solutions of the 
above equation. We then have 

f.(A)= I ( Z exp(27u'ax*) ) 

Jo / 

X exp( — 27a'Aa)i/a. 

If we make the Farey dissection, translating 
the interval [0, 1] slightly, then the main term 
of rJN) is provided by the integrals over major 
arcs, and the residual term is derived from the 
integrals over minor arcs. Let p be a prime, 
and let M(N,p') denote the number of solu- 
tions of the congruence equation 

-H xM • • ■ + = V(mod p'). 

Then limj^o^, M(V, p^)/p''*“‘' = Xp(N) is not zero, 
and the infinite product = con- 

verges for s^4^, where S{N) is larger than a 
positive constant which is determined inde- 
pendently of the choice of N. On the other 
hand, let 



S{a,q)= Z exp I 2m-x^ ), 
a:=o V q 



A(q,N) = q-’‘ Z Sia^qfoxp 

1 < rt ^ <7 

(a. 9) = ! 



, a 
2ni - N 
(I 



Then A{q,N) is absolutely convergent, 
and the sum is equal to S{N). According to 
Hua (1959) we have 



r,(N)~S(A) 



r(\ + i/ky 

r{s/k) 






provided that s > 2k^{2 log k + log log k + c). 
Next, if we denote by V{N,S) the volume of the 
closed region satisfying N ^x\ + x^ + ■■■ + 
x^^N +S \n the s-dimensional Euclidean 
space, then lim^^o V(N,d)/^ = Xao{^) exists and 
is equal to (T(l -i- l/k)Vr(s//c))/V*^'‘"L Hence we 
can show that the principal part of r^{N) is 
equal to the infinite product IlpXp(M» where p 
runs over all finite and infinite prime spots in 
Q. This is a generalization of the singular 
series studied by Hardy. 

With regard to g{k), there are studies by L. 

E. Dickson (1936) and others. It is easy to see 
that g(k) ^ 2M (3/2)'“ - 2 and that G(k)^k+\. 
It has been shown that j^(3) = 9, G(3)<7 (G. L. 
Watson, Linnik, 1947) and that g(4X37, 
G(4)= 16 (H. Davenport, 1939). More gener- 
ally, Vinogradov (1959) proved that 
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G(k) ^ 2k log k + 4k log log k 

+ 2k log log log k + ck. 



To prove this, Vinogradov (1934) introduced 
the following integral, which is closely related 
to the tprime number theorem: 



m- 



exp(27u’(ai X + a.2X^ 






Is 

dec ^ . . . dcCf^ . 



Hua (1949) improved Vinogradov’s result and 
proved that if s^\k{k+ l) + //c, then 

/(P)^(5s)5"'(logP)^'P^"''‘‘'‘+^'/2+^ 

where S==jk{k+ 1)(1 — \/k)’~K Concerning 
/(P), another notable approach was made by 
A. A. Karacuba and N. M. Korobov (1963). 
Further investigation proved that f(P) = 

+ if s^ck^logk 

(Vinogradov, Hua, 1959). The result is called 
the Vinogradov mean value theorem. 

There are many variations and generaliza- 
tions of this theorem. Vinogradov and Hua 
(1944) studied the problem of representing an 
arbitrary N as N = p\ + P 2 + ...+ (with p, 

prime). Hua (1937) and others also considered 
the problem of representing N as V=/(xi) + 
/(X 2 ) + . . . +f(x,), where /(x) is a given poly- 
nomial. Also, let 

n n n 

C(x,,X2,...,x„)=X Y Y QjkX.XjX^ 

1 = 1 J=1 fc = l 

be a homogeneous polynomial of degree 3 
with integral coefficients. Davenport (1963) 
proved that if 16, then the equation 
C(xj,X 2 , ...,x„) = 0 has at least one nontrivial 
integral solution. There are further develop- 
ments of the theory of representations of in- 
tegers by forms in many variables by V. A. 
Tartakovskii, H. Davenport, B. J. Birch, and 
D. J. Lewis. 



F. Additive Problems in an Algebraic Number 
Field 

Siegel (1922) considered the generalization of 
Hardy’s square sum problem to the case of an 
algebraic number field. He later (1945) studied 
the generalized Waring’s problem in an alge- 
braic number field K of finite degree: Let I 
be the principal order of K, and let be the 
subring of / generated by the kth powers of 
integers in K. It is easily seen that the index 
{I :Jk) is finite. Hence our concern regarding 
s{k) must be restricted to integers contained in 
Jj^. Another question is how to extend the 
Farey dissection to an algebraic number field. 
Siegel succeeded in solving these difficulties. 



His ingenuity is seen in his way of dealing with 
the minor arcs, which provided a stimulus to 
the research of T. Mitsui (1960). 

A generalization of Goldbach’s problem to 
the case of an algebraic number field was 
obtained by Mitsui (1960) and O. Korner 
(1961). 

As another extension of the Vinogradov 
three primes theorem, Mitsui (1971) proved 
the following theorem: Let K be an algebraic 
number field of degree n. Let C be the prin- 
cipal ideal class generated by a totally positive 
number in K, and P be the set of prime ideals 
of degree 1 contained in C. Let iV be a positive 
integer and I^iN) be the number of representa- 
tions of N as the sum of the norms of s prime 
ideals belonging to P, 

IAN)= Y 1’ (l^Ks). 

JV = (Vp J 4- ... + iVp_^ 

If N is sufficiently large and 3, we have the 
asymptotic formula 



h{N)-A,S{N) 



(logiV)* 



+ 0 



( _, \og\ogN \ 

\ (logiV)^"V 



where A, is a positive constant depending on s 
and K independent of N, and S{N) denotes the 
singular series. If s = N (mod 2D), where D is 
the discriminant of K, then S{N)^c>0, where 
c is a constant. Later this problem was ex- 
tended by Mitsui and T. Tatuzawa (1981). 
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Additive Processes 

A. General Remarks 

The study of sums of ^independent random 
variables has been one of the main topics in 
modern probability theory (^ 250 Limit 
Theorems in Probability Theory). Combin- 
ing the ideas of this study with the considera- 
tion of tstochastic processes with continuous 
time parameter, we get the notion of additive 
processes. 

B. Definitions and Fundamental Properties 

A real- valued ^stochastic process 
denoted by X(t){Q^t< oc) for the rest of this 
article, where for simplicity we assume that 
X (0) = 0, is called an additive process (or pro- 
cess with independent increments) if for any Iq < 
t] < ... <t„, A(t;) — A"(f;_i) (i= 1,2, ...,n) are 
^independent. An additive process is essentially 
the same as a tspatially homogeneous tMar- 
kov process (i.e., a Markov process on R* that 
is invariant under translations). When, for any 
h>0 and t>s, X{t + h}-- X{s + h) and X (t) — 
A(s) have the same law, i.e., the distribution 
law of 2f(f) — X(s) depends only on t — s, we 
call the additive process X(t) temporally homo- 
geneous. This is essentially the same notion as 
a temporally and spatially homogeneous Mar- 
kov process. 

Let X (t) be a given additive process. If f{t) is 
a function of t only, then clearly Yit) = X{t)~ 
f{t) is also an additive process, and we can 
choose /(t) in such a way that for every t>0 
and for every sequence (s„J.t), F(s„) con- 
verges with probability one. Here, lim T(s„) 
is independent of a particular choice of s„, 
and we denote it by F(t-) (F(t -(-)). We call 
such T(f) a central process and also say that 
Y(t) is obtained from X(t) by centering. This 
f{t) is given, for example, by the condition 
£(arc tan(X(f)— /({))) = 0. 



Let T(t) be a centered additive process. 

Then y(t — ) = T(r) = T (t -t- ) for all r > 0, except 
on an at most countable f-set S, and fsS is 
called a fixed point of discontinuity of Y(t)- 
Then Yi(f) — lim„^^ U„{t) exists with proba- 
bility 1, where 

U„{t)= X (y(s, + )-Y(s,-)) 

0 ^ i < n 
s,<( 

+ Y{t)-Y{t-)-C", 

C" being a constant determined by £(arctan 
V„(t)) = 0, andS = {s^} rj=l,2,...). Y,(0 = 
Y(r)— Yi(r) is a eentered additive process 
without any fixed point of discontinuity. Fur- 
thermore, Yi(f) and Y 2 (t) are independent. 

Thus we have a decomposition of Y(f): Y(r) = 
Yi(r)-l- Y 2 (r), where Yi(r) and Y 2 (t) are mutu- 
ally independent additive processes. The struc- 
ture of Yi(f) is simple, and it is not worthwhile 
to study its behavior in more detail. On the 
other hand, since Y 2 (r) is a centered additive 
process without any fixed point of discontinu- 
ity, it is an additive process that is '^continuous 
in probability. Let Yjit) be a ^separable modi- 
fication of Y2(0- Then the discontinuities of 
almost all sample functions of Y 2 (t) are of the 
first kind. If we set Y 2 *(t)= Y 2 (f-j-), Y 2 *(t) is a 
■•■modification of Y 2 (t), and almost all sample 
functions of Y 2 *(t) are right continuous and 
have a left-hand limit at every t. In the study 
of the process Yjft), it is always convenient to 
take such a modification. Thus we give the 
following general definition: An additive pro- 
cess is called a Levy process if it is continuous 
in probability and almost all sample functions 
are right continuous and have a left-hand limit 
at every t e [0, go) [3, 9]. 

The notions of additive processes and Levy 
processes can also be considered for R^-valued 
processes. 



C. Additive Processes and Infinitely Divisible 
Distributions 

Let X (t) be a Levy process and <t>s, (s < t) be 
the tdistribution of X(t) — X(s). Then <t>sj is an 
■•■infinitely divisible distribution (-^341 Proba- 
bility Measures G). Conversely, for a given 
infinitely divisible distribution fl> we can con- 
struct an essentially unique temporally homo- 
geneous Levy process X(f) such that coin- 
cides with the distribution of X(l). If X(f) is 
temporally homogeneous, the ^characteristic 
function (p^i(z) = of the distri- 

bution is given in the form — exp((t — 
s)t//(z)); hence the law of the process X(t) is 
completely determined by the function i/^(z). 

By the ■''Levy-Khinchin canonical form, i//(z) is 
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written in the form 



ij/(z) = imz — -z^ + 



IZU 

e'-"— 1 — r T ]n{du). 



1 “h 



( 1 ) 



where m, veR,v'^0, and n{du) is a nonnegative 



-n{du) 



measure on R— (0} such that , 

'-col+w" 

< 00. These m, v, and n(du) are uniquely deter- 
mined by tf/{z). 



D. Basic Additive Processes 

Wiener Process. When almost all sample func- 
tions of a Levy process A(t) are continuous, 
the distribution of — X(s) is a ^normal 
distribution. If, further, A(t) is temporally 
homogeneous, i//(z) has the form \j/{z) — imz — 
jvz^. In particular, if m = 0 and y= 1, then 
A(t) is called a Wiener process or Brownian 
motion. This stochastic process was introduced 
by N. Wiener (1923) as a mathematical model 
for the random movement of colloidal par- 
ticles first observed by a British botanist, R. 
Brown. This is one of the most fundamental 
and important stochastic processes in modern 
probability theory {-- 45 Brownian Motion). 

Poisson Processes. When almost all sample 
functions of a Levy process are increasing step 
functions with only jumps of size 1, the distri- 
bution of A'(t) — 2f(s) is a tPoisson distribution. 
If, further, X{t) is temporally homogeneous, 
i/^(z) in (1) has the form i//(z) = 2(e'^ — 1) (2>0), 
and X{t) is called a Poisson process. Let X{t) 
be a Poisson process, and let Tq, T^+T^, Tq-I- 
Ti + Tj,... be successive jumping times of a 
sample function X(t). Then Tq, 7j, T 2 , ... is 
a sequence of mutually independent ran- 
dom variables with the common exponential 
distribution P{Tedt) = dt. Conversely, 

given such a sequence {Tj,}, if we define X{t) = 
inf{n| Tq + Ti + ... + T„>t}, then A"(t) is a 
Poisson process. Thus, for example, the num- 
ber of telephone calls at a switchboard is a 
Poisson process when the intervals between 
successive calls can be regarded as inde- 
pendent and having a common exponential 
distribution. 



E. The Structure of the General Levy Process 

[8,9] 

In this section we restrict ourselves for sim- 
plicity to temporally homogeneous Levy pro- 
cesses. As we noted, the probability law of the 
process X{t) is determined by the function 
i//(z). The Levy-Khinchin formula (1) in a 



certain sense shows that i/^ is a combination of 
a Wiener process and Poisson processes. This 
fact can be seen more clearly from the Levy-Ito 
theorem, which states that the sample function 
of A(t) itself can be expressed as a composite 
of those of a Wiener process and Poisson 
processes. The Levy-Ito theorem actually 
implies formula (1) and, moreover, clarifies its 
probabilistic meaning. 

The Levy-Ito theorem can be summarized 
as follows; Let t/ be a Borel subset of R which 
has a positive distance from the origin, and let 
N{t, U) be the number of s such that A(s) — 

X(s — )eU, s^t. Then N{t, U) is a Poisson 
process. The expectation E{N{t, U)) can be 
written in the form tn(U), where n(U) defines 
a nonnegative Borel measure on R — {0}; fur- 
thermore, it satisfies 



1 -|- U' 



■n(du)< 00. 



Next, we set 






uN{t,du) 

u|>£ 

X (X(s)-A(s-)). 

\X{s)-X(s-)\^c 



Generally, S^{t) diverges as e|0. However, with 
probability one, a centered process 



S,(t)=-S,(t)-t —^n{du} 

Jn>£1+w 

converges uniformly in t on every finite inter- 
val as fij.0. Furthermore, A(t) — lim^ioS^jf) is 
continuous with probability one. However, a 
Levy process X{t) of which almost all sample 
functions are continuous has the form mt + ^ 
B{t), where m,v^0 are constants and B{t) is a 
Wiener process. Hence we have 

X{t) — mt + ^ B{t) 

uN{t,du) :rtn{du)Y (2) 

1 -l-u^ / 

Furthermore, we can show that if 




are disjoint, then 

B{t), N{t, L,), N{t, U 2 I U„) 

are mutually independent Levy processes. In 
particular, in (2) the terms are mutually inde- 
pendent. The m, v, and n{du) in (2) correspond, 
of course, to those in (1). Conversely, given m, 
V, and n{du), we can construct B{t) and N{t, U) 
with the properties above, and then (2) defines 
a Levy process which corresponds to if/{z) 
given by (1). The measure n{du) is called the 
Levy measure of A'(f). 
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F. Examples of L6vy Processes 



Compound Poisson Processes. A temporally 
homogeneous Levy process is called a com- 
pound Poisson process if almost all sample 
functions are step functions, namely, if i//(z) in 
(1) is given by 






J - 00 



n{du)<cc. 



If we set 0(du) — {l/A)n{du), then <h(du) is a 
probability distribution on R. <I) is the distri- 
bution of the size of jumps when they occur. 
A compound Poisson process is constructed 
in the following way: Let Tq, T, , T 2 , ; t/, , 
L 2 , ... be mutually independent random vari- 
ables such that 



P{T„edt) = Ae ^'dt, t>0, 

P((7„edu)-a)(du), 

and let 



^(0 — Cl + 1/2 -t- ... -h 
where 

n = inf(n | Tq + 7j -I- . . . -h T„ > t}. 

Then X{t) is a compound Poisson process. 
Thus the number of jumps of X{t) follows a 
Poisson process, and the size of each jump 
obeys the distribution O. 



Stable Processes. A temporally homogeneous 
Levy process X(t) is called a stable process if 
for every a > 0 we can find b>0 and c real 
such that the processes Xi{t) = X{at) and Ajjt) 
= hX{t) + ct are equivalent in law. It is called a 
strictly stable process if in the above c can be 
chosen to be 0. X (t) is a stable process (resp. a 
strictly stable process) if and only if the corre- 
sponding infinitely divisible distribution is a 
■^quasistable distribution (resp. ^stable distri- 
bution). The ■''exponent a (0<a^2) of the 
quasistable distribution is called the exponent 
(or index) of the stable process. in (1) cor- 
responding to a stable process is given as 
follows: 



ij/{z) = imz + C^ 






+ c_ 



ij/{z) = imz + C^ 



+ C_ 



du 

0<oc<l, 



|n|' 



1 — 



izu \ du 



1 +u^/u 






izu \ du 



\-\-u^ )\u 



r, OL- 



ij/{z) — imz + C^ 



du 



du 

+ C_ (e‘^‘^-l-izu)--^, \<a<2. 



V 

ij/{z) = imz — -z^, a — 2. 

Here m, C+ , C_ , and v are real constants such 
that C+ >0, C_ >0 with C+ + C^ >0 and y>0. 
ij/{z) corresponds to a strictly stable process if 
and only if m = 0 when a # 1 and C+ = C 
when a~l. The above ij/{z) is also expressed as 
follows: 

(i) if0<a^2, 1, then 



^(z) = zmz — Cq|z|“( 1 — i^tan 



2 |z| 



where 



Co = - if a = 2 
2 

and 



Cq— — (C+ +C_)r( — a)cos— , 



n C+-C_ 

— if a ^2; 

C++C_ 

(ii) if a = 1, then 

/ 2 z 

i/z(z) = zyz~Co|z| l+z-)3— log|z| , 
V 7t |z| 

where 

y — ct{C ^ -f- C_) 

u \du\ 

1 +u^Ju^J 

n C+ -C 

Co = -(C+-C_), ^ 

2 C,-hC_ 



When a = 2, it is thus essentially a Wiener 
process: X{t) = mt + y/v B{t), where B(t) is a 
Wiener process. When a = 1, it is called a 
Cauchy process, a symmetric Cauchy process if 
m — 0 and C+ = C_ or equivalently y = ^ = 0, or 
an asymmetric Cauchy process if /i 0. Gener- 
ally it is called a symmetric stable process if 
m — 0 and C+ =C_ or a = 2 and m = 0. In par- 
ticular, for the symmetric Cauchy process cor- 
responding to ij/(z)= —\z\, we have 



P(X{t)<x) = - 

K 



.t +y 



:dy. 



Next, when 0 < a < 1 and m — C^—0, almost 
all sample functions of Alt) are purely dis- 
continuous increasing functions (i.e., sums of 
positive jumps). In this case, X{t) is called a 

one-sided stable process of the exponent a (or 
subordinator of the exponent a). Now, if X{t) is 
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a symmetric stable process of the exponent 

(0</5^2) and F(t) is a subordinator of the 
exponent a which is independent of X (t), then 
Z{t) = X(Y{t)) is a symmetric stable process 
of the exponent a^. This operation is called 
a subordination and is closely related to the 
theory of tfractional powers of ^infinitesimal 
generators of semigroups (^261 Markov 
Processes) [2], 

A stable process X (t) is defined in a simi- 
lar way when A(t) takes values in an N- 
dimensional space In particular, if X{t) is 
a symmetric stable process of the exponent a 
(0<a^2) given by 



then for every bounded measurable function 
f{x) with compact support, we have 



f{x + X{t))dt 



r((N-«)/2) 

2^n^l^r{a/2) 







The right-hand side is the tRiesz potential of 
order a 338 Potential Theory). This fact 
is a generalization of a well-known rela- 
tion between Brownian motion and Newton- 
ian potential (-^ 45 Brownian Motion), and 
through this relation we can study several 
properties of sample functions and also com- 
pute various quantities related to stable pro- 
cesses [1]. 



G. Sample Path Properties of Levy Processes 

Let X{t) be a temporally homogeneous Levy 
process. For a Borel set B in R, the hitting 
time (Tg is defined by 

<TB = inf{t >0] A(t)eB}. 

Recurrence. X (t) is called recurrent if ng < oo 
a.s. for every nonempty open set B. Otherwise 
it is called transient. A(t) is recurrent if and 

r 1 

only if Re — ryrdz = 00 (Port and Stone, 

J-i -Mz) 

Ornstein). If, in particular, £(AT(1)) exists, it is 
recurrent if and only if £(A(1)) = 0. When X{t) 
is a stable process, it is recurrent if and only if 
a> 1, or a = 1 and /? = 0. 



Hitting Probabilities for Single Points. If B = 

{a}, Gg is denoted by OeR is said to be 
regular for A (t) if a,, = 0 a.s. Set C = (x e R | 

oo)>0}. The following result is due to 
Kesten [10]: 

(i) If f 0, then C = R and 0 is regular; 

(ii) If y = 0 and jli \u\n(du)^ oo, then either C 
= R and 0 is regular, or C = 0 and 0 is not 



regular according to whether 



R ^ 



1 



>(z) 



dz 



< 00 for all 2 > 0 or = 00 for some 2 > 0; 

(iii) If r = 0 and [Lj |u|n(du)< qo, then the 
following four cases are possible, where 
fl — m — Jru(1 + u^)~'-n{du) and S(n) is the sup- 
port of the Levy measure n: 

(a) C = 0 when a = 0, 

(b) C = (0, oo) when a>0 and S(n)c:(0, oo), 

(c) C = (— oo,0) when a<0 and S(n)c(— oo,0), 
and 

(d) C = R in the remaining case. 

For further properties of sample functions 

-[4]. 



H. Generalization of Additive Processes 

A temporally homogeneous Levy process is, as 
we have seen, essentially a temporally homo- 
geneous Markov process on R which is homo- 
geneous in space (i.e., invariant under transla- 
tions of the space). Thus, on a homogeneous 
space when homogeneity in space makes sense, 
we can generalize the notion of additive pro- 
cesses. Let M be a thomogeneous space with 
transformation group G. A temporally homo- 
geneous Markov process A(t) is called an 
invariant Markov process, (or homogeneous 
Markov process) if its system of ttransition 
probabilities {P{t,x,E)} satisfies P{t,x, E) = 
P{t,gx,gE) for all geG. Thus an additive pro- 
cess is exactly an invariant Markov process 
on R^ when G is the group of translations. G. 
A. Hunt determined all invariant Markov 
processes when M is a tLie group or a factor 
space of a Lie group [6]. 

Let G be a Lie group and A — A(G) be the 
(left-invariant) tLie algebra of G. Let G^ — 

GU {co} be a tone-point compactification of 
G, and C be the set of all continuous functions 
on Gf. We can define Yf{YeA,feC) as usual by 

T(/) = lim ^"“^'^~-^ , 
rio t 

rj{t)^exptY RJ{T)^ f{xa), 

when the limit exists uniformly. Let C 2 = 
|/eC| Y(Xf) exists for every X, FgA}. Let 
Ai, A 2 , ..., Aj be a basis of A(G), and let Xj, 

X 2 , ...,x^ be functions in C 2 such that x,(e) = 

0 and A;(x^) (e) = {ij = 1, 2, . . . , d; e is the 
unit element of G). Take a neighborhood of e, 
and define a function (p(tif) = L?=i xfig) for g 
contained in the neighborhood, and extend 
this function to G, in such a way that (peC 2 
and (p^k>0 {k — constant) outside of the 
neighborhood of e. Then geG defines a trans- 
formation of G^ by z^G — go, Xg(D = oi, and in 
this way G^ is supplied with the structure of a 
thomogeneous space with the transformation 
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group G. Now let X{t) be an invariant Markov 
process on which is '^continuous in proba- 
bility. Then the tsemigroup T, (which is a 
^strongly continuous semigroup on C) of the 
process X (t) is characterized as follows; The 
domain of the infinitesimal generator A of 
contains C 2 , and for feC 2 

1 = 1 i.J=l 

+ [ |/M-/(t) 

JCc-{e} ( 

-Z 

i = l 

where a^, are real numbers {ij =1,2, ...,d) 
such that (aij) is a symmetric nonnegative 
definite matrix, and n(dcr) is a nonnegative 
measure on G^ — {e} such that 
n(d(j]<oo. Conversely, given such a^, a^j and 
n{da), there exists one and only one invariant 
Markov process on G^ whose infinitesimal 
generator is given as above. 

A similar result is obtained when M is a 
factor space of a Lie group by its compact 
subgroup. Furthermore, for more concrete 
homogeneous spaces such as spheres or Loba- 
chevskil spaces (more generally, tsymmetric 
Riemannian spaces) the eanonieal form of the 
invariant Markov processes and infinitely 
divisible laws is obtained by making use of 
harmonic analysis [5], 
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Adeles and Ideles 



A. Introduction 

The concept of idele was first introduced by C. 
Chevalley {J. Math. Pares AppL, (9) 15 (1936); 
Ann. Math., (2) 41 (1940)), for talgebraic num- 
ber fields. Later on, this concept and the allied 
concept of adele were defined for tsimple alge- 
bras and also for talgebraic groups over 
algebraic number fields, and the two concepts 
became important in the arithmetical theory of 
these objects. We shall first explain the general 
concept of restricted direct product, by means 
of which adeles and ideles will be defined. 



B. Restricted Direct Product 

Let I be an index set. Suppose we are given, 
for each pel, a tlocally compact group Gp, and 
for each p except for a given finite set, say 
pi,P 2 ,...,Pr,a compact open subgroup of 
Gp. Let G be the subgroup of the direct prod- 
uct ripe/ Gp consisting of elements (^p) whose 
Gp-components lie in U^, except for a finite 
number of p. Put U = nriGp.xnp^p.Up. 
Then G is a locally compact group with re- 
spect to the tproduct topology. The group G 
can be supplied naturally with a topology with 
respect to which G is a locally compact group 
and the quotient space G/U is discrete. The 
group G together with this topology is called 
the restricted direct product of {Gp} with re- 
spect to {Gp}. 

C. Adeles and Ideles 

Let k be an talgebraic number field of finite 
degree and / be the totality of finite and in- 
finite tprime divisors of k. For each pel we 
denote by k^ and /Cp’“ the tcompletion of k with 
respect to p and the multiplicative group of 
nonzero elements of k^, respectively. Further- 
more, for each finite prime divisor p, we de- 
note by Op and Up the ring of tp-adic integers 
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of k and the multiplicative group of '•'units of 
Op, respectively. 

(1) Since is a compact open subgroup of 
/Cp as an additive group, we can construct the 
restricted direct product of {k^} with re- 
spect to {Op}. Then A^ is a locally compact ring 
with respect to the componentwise ring oper- 
ations. We call A^ the adele ring (or ring of 
valuation vectors) of k, and an element of A^ an 
adele (or valuation vector) of k. The element of 
the direct product IIp k^ whose p-component 
is a fixed element of k for all p is an adele. We 
call such an adele a principal adele. Since Up is a 
compact open subgroup of k^’' for each finite 
prime p, we can construct the restricted direct 
product J;; of {/Cp’'} with respect to {Up}. We 
call Jfc the idele group of k and an element of 
an idele of k. The element of the direct product 
n/cp whose p-component is a fixed element 
of k for all p is an idele. We call such an idele a 
principal idele. Each element b of induces an 
automorphism of the additive group A^ 
defined by /t,(o) = b • a (a e A^). Thus can be 
regarded as a subgroup of the automorphism 
group Aut(A,^) of the additive group A^. The 
topology of coincides with the relative 
topology of as a subgroup of Aut(A^). We 
note, however, that the topology of is dif- 
ferent from the relative topology of as a 
subspace of A^, and the former is stronger 
than the latter. Finally, for a tfunction field 

in one variable over a '•'finite field, the adele 
ring and the idele group can be defined 
similarly. 

(2) Let 91 be a '^normal simple algebra over k 
and O be a tmaximal order of 91. For each 

p e / put 91p = 91 ® fcfcp, and for each finite 
prime divisor p put Op == Op • O. Then Op is a 
compact open additive subgroup of 91p. By the 
adele ring A,j, of 91 we mean the restricted 
direct product of {91p} with respect to {Op}. 

Let 91p’‘ and Up be the multiplicative group of 
nonzero divisors of 91p and the multiplicative 
group of the units of Op, respectively. (Up can 
be defined only if p is a finite prime divisor.) 

By the idele group J,^j of 91 we mean the re- 
stricted direct product of |91p’'} with respect to 
(Up). The notion of principal adele (or idele) of 
91 can be defined similarly, as in (1). The struc- 
tures, as topological groups, of Aj, and Jy, do 
not depend on the choice of a maximal order 
O. The adele ring A^ and the idele group 
described in (1) are special cases of Ay, and J„, 
respectively. 

(3) Let G be a linear ^algebraic group de- 
fined over k, and let Gp be the set of k^- 
trational points of the group for each pel. For 
each finite prime divisor p, let Gp be the set of 
elements a of Gp such that the coordinates of 
both a and are p-adic integers. We can 
then construct the restricted direct product of 



(Gpl with respect to {Gp}, which is called the 
idele group (or adele group) of G. 

In the following section we focus on describ- 
ing the fundamental properties of adeles and 
ideles of an algebraic number field k. We shall 
start, however, by observing more generally 
those adeles and ideles of a normal simple 
algebra 91 over k. (For the properties of the 
adele group of algebraic groups ^ [7]; 13 
Algebraic Groups.) 

D. The Structures of the Adele Ring and Idele 
Group 

Let 9i be a normal simple algebra over an 
algebraic number field of finite degree /c. We 
identify the totality of principal adeles of 91 
(principal ideles of 9?) with 9? (9I'‘), and denote 
it by the same letter 9i (91''). Then 9i (9I'‘) is a 
discrete subgroup of Ay, (Jy,). The quotient 
group Ay,/9l is compact. Denoting by [otplp 
(apC/Cp) and (ape9?p) the tnormalized 

valuation of k^ and the treduced norm from 9?p 
to kp, respectively, we define, for ae Jy,, a posi- 
tive number: F(a) = Ilpe/l-^p(«p)lp. where a = 
(«p). We call F(a) the volume of a. If a is a prin- 
cipal idele, we have K(a) — 1 by the tproduct 
formula on valuations. Denote by J,^.the set 
of ideles o with F(a) — 1 and put C,^ = J,^/9I . 
Then has finite volume with respect to 
the +Haar measure of J„. Furthermore, C° is 
compact if and only if 91 is a tdivision algebra. 
In particular, C° is compact. Let Q be the field 
of rational numbers. For each rational prime 
p, we define a character Ap of the completion 
Qp of Q with respect to the p-adic topology 
(by a character of Qp, we mean a continuous 
homomorphism from Qp to the 1 -dimensional 
torus R/Z): If p = p^ is the infinite prime of Q, 
then we put Ap^(x)= — xmodZ (xeQp). If p is 
finite, then we let Ap be the composite of the 
following three canonical homomorphisms, 
namely, the one from Qp to Qp/Zp, the one 
from Qp/Zp to Q/Z, and the one from Q/Z to 
R/Z. We define a character Ap of 9Ip as follows: 
Ap = ApOtr(9Ip/Qp), where p is the rational 
prime divisible by p and tr(9Ip/Qp) denotes the 
treduced trace from 9Ip to Qp. For x,ye9lp, 
put (x,y)p = exp(27n'Ap(xy)). Then the additive 
group 9lp is self-dual relative to (x, y)p. Fur- 
thermore, if we put < 0 , b> = rip(ap, hp)p for 
a = (Up) and b = (hp)eAy, then Ay, is self-dual 
relative to <a,b>. The tannihilator of the 
group of principal adeles with respect to <a, b> 
is 9?. Hence it follows from tPontryagin’s 
duality theorem that Ay,/9? is compact. Hence- 
forth let 91 = fc. We call the quotient group 
= Jfc//c'' (an element of CJ the idele class 
group of k (an idele class). If a character x of J^ 
satisfies the condition x(°^)= 1 for all aek (i.e.. 
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if X is a character of Q), we call such a charac- 
ter a Grdssencharakter {or Hecke character). 
Grossencharakters were introduced by E. 
Hecke as characters of a certain type of the 
^ideal group of k {Math. Z., 1 (1918), 5 (1920)), 
but they are essentially the same as the ones 
defined above [1]. Let be the connected 
component of the identity element of C^. Then 
Cfc/D* is totally disconnected and compact. 
Hence a Grdssencharakter x is of finite order if 
and only if ;^(D^)= 1. We can prove by tclass 
field theory that CJD^ is canonically isomor- 
phic to the Galois group over k of the maxi- 
mal Abelian extension of k (-^ 59 Class Field 
Theory). For the structure of the following 
fact is known; Let rj and rj be the number of 
treal infinite prime divisors and timaginary 
infinite prime divisors of k, respectively. Then 
the dual group of is isomorphic to R x 
Q'-i+'-j - 1 X where R is the additive group of 

real numbers with the usual topology and 
Q (Z) is the additive group of rational numbers 
(rational integers) with the discrete topology. 
Let F be a function field in one variable over a 
finite field Fq. The properties of the adele ring 
and idele group of F are similar to the prop- 
erties of Aj and J^, while the group Cp has 
a simpler structure than C;<. To explain the 
structure of Cp, let F be the maximal Abelian 
extension of F, G be the Galois group of F/F, 
and Gp be the subgroup of G consisting of the 
elements <j such that ff(a) = a®" for all aeFo 
( = the talgebraic closure of Fp), where q is the 
number of elements of the finite field Fp and n 
is a given rational integer. Also, let be the 
subgroup of Gp consisting of the elements 
inducing the identity map on Fp. G® is a com- 
pact group with respect to the tKrull topology. 
Gp can be naturally supplied with a topology 
such that the group Gp is a locally compact 
group and the quotient group GpjGp is dis- 
crete. Then class field theory implies that Gp is 
isomorphic to Cf as a topological group. 

The following characterization of the adele 
ring of a number field or function field in one 
variable over a finite field is the work of K. 
Iwasawa {Ann. Math., (2) 57 (1953)). Let A be a 
tsemisimple commutative and locally compact 
topological ring with unity 1. Assume that A is 
neither discrete nor compact, and moreover 
that A contains a discrete subfield k3l and 
A/k is compact. Then k is an algebraic number 
field or a function field in one variable over a 
finite field, and A is isomorphic to the adele 
ring of k as a topological ring. 



E. Ideles and Cohomology 

Let K be a Galois extension of finite degree of 
an algebraic number field k, and © be the 



Galois group of the extension K/k. © operates 
naturally on the idele group and the idele 
class group of K. The structures of the 
^cohomology groups of © with the coefficient 
groups and Cj^ were investigated by G. 
Hochschild, T. Nakayama, E. Artin, J. Tate, 
and others. In particular, we have H^(©,Ck) = 
{0} and H^(©,Ck)^ Z/nZ (cyclic group of 
order n), where n = [K;k]. These facts play an 
important role in one of the proofs of class 
field theory (^ [3]; 59 Class Field Theory). 
Furthermore, A. Wei) introduced the so-called 
Weil group, which is a tgroup extension of a 
certain type of by ©. He defined the most 
general L-functions, which include both 
t Artin L-functions and tHecke L-functions 
with Grossencharakters (^ [2]; 450 Zeta 
Functions). 



F. Fourier Analysis on the Adele Group 



Dedekind zeta functions and Hecke L- 
functions are tmeromorphic on the whole 
complex plane and satisfy functional equa- 
tions of certain types. This can be proved by 
methods of Fourier analysis on the adele 
group Afc (Artin, Iwasawa, Tate [1, 8, 9]). For a 
continuous complex-valued function q)(o) on 
Afc satisfying suitable conditions, we define the 
Fourier transform of cp{a) as follows: 



(p{a)— <p(a)<a, b>db, 

hk 

where dh denotes the Haar measure on A^. 
By normalizing dh suitably and applying 
+ Poisson’s summation formula, we get, for 
each idele a of k, 

^ (p{acc)= F(a)"^ ^ 0{a~^cc). 

aek xek 



This is called the ©-formula. Consider the 
following integral on 






r 



F(a)®x(ct)<p(a)d*a, 






where d*a denotes the Haar measure on 
s is a complex number, and ;( is a Grossen- 
charakter of k, namely, a character of This 
integral converges if s > 1, and by using the 
©-formula one can show that (^(s) is meromor- 
phic on the whole complex plane and satis- 
fies a functional equation of a certain type. 
When the function (p is of special type, then 
the above integral can be explicitly expressed 
as the product of L-functions, F-functions, and 
exponential functions. This method of express- 
ing L-functions by integrals on J;^ and apply- 
ing the ©-formula ean be applied to investi- 
gate tHey zeta functions and L-functions of 
various types defined for a simple algebra (-^ 
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450 Zeta Functions) (G. Fujisaki [6]; T. Tama- 
gawa, Ann. Math., 77 (1963)). 
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Affine Geometry 

A. Construction of Affine Spaces 

An affine space A is constructed as follows: Let 
F be a tvector space over a tfield K, and let A 
be a nonempty set. For any vector a e V and 
any element p of A, suppose that an addition p 
+ ae/l is defined so as to satisfy the following 
three conditions: (i) p -I- 0 = p (0 being a zero 
vector); (ii) (p + a) + b = p -h (a -I- b) (a, b e F); and 
(iii) for any qeA there exists a unique vector 
a e F such that q — p + a. (Condition (i) follows 
from (ii) and (iii).) Then we call A an affine 
space, F the standard vector space of A, and K 
the coefficient field of A. Each element of A is 
called a point. 

If we fix an arbitrary point oeA, there is a 
one-to-one correspondence between A and F 
given by the mapping sending peA to aeV 
such that p = o4-a. Such an element a of F is 
called a position vector of p with the initial 
point 0 and is denoted by dp. We say that r-f 1 
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points p„ (0 ^ a < r) of A are independent if r 
vectors a,- = p^; (1 ^ i < r) are linearly inde- 
pendent in F; otherwise, they are said to be 
dependent. This definition of dependence of 
points Pj, is independent of the choice of the 
initial point among them. If F is of dimension 
n, we say that A is of dimension n, dim A — n; in 
this case, we sometimes write A" instead of A 
and F" instead of F. The affine space A is of 
dimension n if and only if the maximum num- 
ber of independent points in zl is n-l- 1. 

Next, for any vector subspace F'^ of F" and 
an arbitrary point peA", we put Al = {qeA"\ 
qf = p4-x,xe F*} and call it a subspace of A". It 
is an affine space of dimension /c. Conversely, 
every subset of A" that is an affine space under 
the affine space structure of A can be expressed 
in this form. A ^ A^, and in A” are called 
a line, plane, and byperplane, respectively. A set 
that consists of only one point is also consid- 
ered as a subspace A^. For subspaces A' and 
A^ of A", we denote by A' n A* the intersection 
(i.e., the set-theoretic intersection) of zF and zl*, 
and by A''U A® the join of A' and (i.e., the 
intersection of all subspaces that contain both 
zl'' and zl^). Then zl'' fl zl* is the affine space of 
highest dimension contained in A' and A\ and 
zl' U zl^ is the affine space of lowest dimension 
that contains zl' and zl^. If r-|- 1 points are 
given in zl”, there always exists a subspace zl' 
that contains all these points. In particular, if 
the points are independent, then such an A"" is 
unique. Moreover, if zl'rizl“#0 (0 is the 
empty set), then we have r-f-s = dim(zl'Uzl*)-t- 
dim(zl' n .4*). This is called the dimension 
theorem (or intersection theorem) of affine 
geometry. 

Next suppose that r+ 1 points p^iO^a^r) 
in zl" are independent, and put — Po- Let 
q^ be an arbitrary point on that differs 

from p^ and p^+i • If ^^e elements of K such 
that A* • - q;p;;i , then qo,...,qr are de- 

pendent if and only if A°A’ ... A' = (— And 
if r ^ 2 and = g, U p „+2 U . . . U p^+, (p;+, = 
if 1 ^ 1 ), then ao, ...,c,. have a point in common 
if and only if A° AU . . A' = 1. The former is 
called Menelaus’s theorem, and the latter 
Ceva’s theorem. 

The set L(zl) of all subspaces (including 0 
considered as an affine space of dimension — 1) 
constitutes a ^lattice with respect to the inclu- 
sion relation. 



B. Parallelism in Affine Spaces 

Let zl' and zl* be subspaces of A". We say 
that A' and zl* are parallel in the wider sense 
if either of the following conditions holds: 

(i) zl' 3 zl* or A* 3 zl'; or (ii) zl' fl zl* = 0 
and dim(zl' U zl*) ^ r -l- s. Next, let zl' and 
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B' be subspaces of A" of the same dimension. 

If A' and B' coincide, or A*" fl B'' = 0 and 
dim(A''UB'') = J'+ 1 , then they are said to be 
parallel in the narrower sense (or simply paral- 
lel), and we denote the relation by A'' H B'. If 
1 , the definitions of parallelism in the 
narrower sense and wider sense are equivalent. 
If r> 1, parallelism in the narrower sense im- 
plies parallelism in the wider sense. For two 
sets (n J and (bj (0 < a ^ r) of r -I - 1 independent 
points, let V' and W be vector spaces with 
bases and hob,- (1 respectively. 

Then A'' = Uq U . . . U < 3 ,. and B'' = bg U . . . U are 
parallel if and only if V = W; and for an 
arbitrary point p, there exists a unique r- 
dimensional subspace that is parallel to A'' and 
passes through point p. If A'' and B* are paral- 
lel in the wider sense, then there exist sub- 
spaces A* and B' (t > 1) of /F and B* that are 
parallel to each other. Moreover, if neither A'' 
nor B^ is contained in the other and if t is the 
largest integer with the property just given, 
then we have t = r + s+\— dim(/T U B*). 

Parallelism between subspaces of A" is an 
equivalence relation. Specifically, the equiva- 
lence class of a 1 -dimensional subspace A^ is 
called a point at infinity and is denoted by . 
Given a subspace A' of A”, the set of points at 
infinity represented by lines A ‘ con- 
tained in A’’ is denoted by A'^^; we have 
A' // B' if and only if —B''~^. A'~^ is 

called a space at infinity. In particular, the set 
A^“^ is called the hyperplane at infinity. The 
set-theoretic sum A"U A^' — A" is supplied 
with the structure of a tprojective space; the 
“points” in A" are elements of A", and the 
“lines” in A" are A' U A^ and Aj^. 

C. Coordinates of Affine Spaces 

If we fix a point o in A” and a basis (cj , . . . , e„} 
of the standard vector space F", then any 
point p in A" is uniquely expressed as 

n 

P — o+Y^x'-Ci, x'eK. ( 1 ) 

1=1 

The system 5 = (o; ej , . . . , e„) is called an affine 
frame (simply the frame) of A"; the point o 
is called its origin, and e, is called the ith 
unit vector. The mapping sending p to (x^, 

. . . , x") gives a ^bijection of A” to K"; we call 
(xS ... ,x") affine coordinates of p with respect 
to and x' the I'th affine coordinate. In partic- 
ular, if X is a topological field (e.g., the real 
number field R or the complex number field 
C), this bijection A”->JC” induces a topology of 
A", which can be shown to be independent of 
the choice of For the remainder of this 
article, by “coordinates” we mean affine co- 
ordinates unless otherwise stated. Putting 



«; = o -f e,- ( 1 ^ i ^ n), we sometimes call (o; , 

an affine frame. Further, putting 
a,-, 7 !:,- = oUai U ... Ua,-_i (JUj+j U ... Ua„, we call 
( 3 ;, Ij, and n,- the ith unit point, the ith coordi- 
nate axis, and the ith coordinate hyperplane, 
respectively. 

Assume that subspaces A’’ and A* (r, s > 0, r -f 
s = n) are not parallel in the wider sense. For 
a point p of A", denote by A''(p) the subspace 
that passes through p and is parallel to A', and 
put q - A'ip) n A\ A mapping 9 : A"-» A^ de- 
fined by (p{p) — q is called a parallel projection 
on A* with respect to A^ In particular, if A'' = 

7 C; and A'^ = /j (r = n — 1 , s = 1 ), we write (p{p) — p,-. 
Then the ith coordinate x' of p is an element 
of K such that opi = x‘Mi. Hence such coordi- 
nates are also called parallel coordinates (or 
Cartesian coordinates). 

Suppose that we are given r+ 1 points 
bo, . . • , b, of A" and r -|- 1 elements 2®, . . . , 0 of 
K such that 1- We fix a point 0 of A". 

If a point p in A" satisfies bp — then 
boP — 'LUi ^'bobp, hence p is contained in the 
subspace bo U . . . U b,. Conversely, if a point p is 
contained in the latter subspace, then there 
exists a system (2®, . . . , X') such that 

= and 

The system (A®, . . . , has a geometric mean- 
ing since we also have o'p = Z«=oA“o'ba if we 
replace the point o by any other point o' of A". 
The elements A®, . . . , A"" are called barycentric 
coordinates of p with respect to {bo, . . . , bj. In 
particular, if {bo, . . . , b,.} are independent, 
then the barycentric coordinates (A®, . . . , A*") 
are uniquely determined by the point p on 
bo U . . . U br- F urthermore, let (y *,..., y") be 
affine coordinates of p with respect to an affine 
frame and let (x^ , . . . , x") be affine coordi- 
nates of bj, (oe = 0, . . . , r). Then p belongs to the 
subspace A*” = bo U . . . U b0f and only if y' = 
SUoA“x' (i= !,..,,«). In this case we say 
that the system of the linear equations y' = 

Z A*x‘ (22®=!) gives a parametric represen- 
tation of the subspace A' (by parameters A®). 
Specifically, if r = n— 1, the solvability of the 
system of equations y ‘ = S"=o A®x„ (i = 1, . . . , n), 

1 = £ A® implies the equation 2"=i y'Pi = Bo ^^or 
some nontrivial constants Hence the 

latter equation represents the hyperplane n = 
A"“'. If a point p has baryeentrie coordi- 
nates A® = A^ = ... = A'' = (r-l- 1)“‘ with respect 
to {bo, ...,b,.}, it is called the barycenter of bp, 
bi , . . . , b,.. The barycenter is uniquely deter- 
mined by the set {bo, ... , b^} and is denoted 
by B (bo, ... , b,). Specifically, the barycenter 
of two points bo and bj is called the midpoint 
(or middle point) of bo and bj. If we divide 
B = {bo, . . . , b^} into two sets of points and if 

and Q 2 are barycenters of these two sets 
of points, respectively, then Ugj passes 
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through the barycenter of B. More generally, a 
point with barycentric coordinates (A®, . . . , A'') 
with respect to {bo,...,b,} is called a barycen- 
ter of /?o? ••• > K with weights A*^, . . . , A^ 

D. Affine Spaces over Ordered Fields 

Suppose the coefficient field K is an ^^ordered 
field (e.g., the real number field R). Given a 
hyperplane n of A", we take an affine frame for 
which n is the nth coordinate hyperplane. If we 
denote coordinates of points with respect to 
this frame by (xS . . . , x"), then the equation of 
7t is given by x" = 0. Let A"+ and .4"_ be sets of 
points whose nth coordinates are positive and 
negative, respectively. They are called half- 
spaces of A" divided by n. The union of n and 
a half-space is called a closed half-space. A 
half-space of a subspace A' of A" (divided by 
some /F ^ on A') is called a half-space of di- 
mension r. For a point p of A" that does not lie 
on K, the half-space containing p is called the 
side of p with respect to n. In particular, when 
n = 1, let p and q be two points on a line /. The 
closed side of q with respect to p is called the 
(closed) half-line (or ray) from p to q. The inter- 
section of the closed half-lines emanating from 
ptoq and from ^ to p is called the segment 
joining p and q and is denoted by Clearly 
pq^^. A subset C of A" is called a convex set 
if the segment joining two arbitrary points of 
C is contained in C. Each half-space of each 
dimension is convex. For any family of 
convex sets, is also convex. Therefore, 
for any subset D in A" there exists a minimal 
convex set that contains D. It is called the 
convex closure (or convex hull) of D. The con- 
vex closure C(F) of a finite set of points P = 

{po, ... ,Pjj) in A" is called a convex cell, and 
dim(po U ... Upt) is called the dimension of the 
convex cell. In particular, when Po, ... ,Pk are 
independent, C{P) is called a fc-dimensional 
simplex with vertices Po,---^Pk- 1 -dimen- 

sional simplex having two distinct points p 
and q as vertices is the segment pg, and the 
vertices p and q are called ends of the segment. 
A point is regarded as a 0-dimensional sim- 
plex. Each 2-dimensional or 3-dimensional 
simplex is called a triangle or tetrahedron, 
respectively. A /c-dimensional simplex S with 
vertices po, ... ,Pfc is a set of points whose 
barycentric coordinates A“ (0 ^ a ^ /c, Z A* = 1 ) 
with respect to the vertices satisfy A“ ^ 0. On 
the other hand, if we put A'‘ = Po U . . . U p* and 
= Po U . . . U p,_i U p,+, U . . . U Pfc, and denote 
by the side of p^ in and by A J the closed 
side of Pa in A'^ with respect to n^, then the 
simplex S is given by p|a=oAa, and f)a=oAa is 
called an open simplex. A” has the structure of 
a ttopological space in which the set of open n- 
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dimensional simplexes forms a ^base of +open 
sets. In particular, if K is R, the topology of A" 
thus defined is compatible with the one that is 
naturally induced by the topology of R. With 
respect to this topology. A" is a "t^Hausdorff 
space. The terms open and closed used before 
for n-dimensional simplexes agree with the 
corresponding notions with respect to this 
topology. 

A subset of A" is said to be bounded if it is 
contained in some simplex. A bounded set 
obtained through a finite process of construct- 
ing intersections and unions from a finite 
number of closed half-spaces is called a poly- 
hedron. The points of a convex polyhedron are 
characterized by several linear inequalities 
satisfied by their coordinates. A set of points 
whose coordinates (x\ ...,x") satisfy /I'^x'^ 
k' for k\ h‘eK is called a parallelotope; it is a 
polyhedron whose ^interior is called an open 
parallelotope. A simplex is a polyhedron, and 
polyhedra admit tsimplicial decompositions. 

A polyhedron can also be defined as the set- 
theoretic union of a finite number of simplexes. 

Let E be a finite set of points, and let its 
convex closure C{P) be a convex cell of dimen- 
sion m. Then C{P) is contained in an m- 
dimensional subspace A"*. The ^boundary of 
C{P) in the topological space A"' is called the 
boundary of a convex cell C(P). We can take a 
subset Q of P so that dim C(Q)^m—l and 
C{Q) is the intersection of the boundary of 
C{P) and an (m— l)-dimensional subspace. 
Such a C{Q) is called a face of C(F), and 
we denote this relation by C{P))^C{Q). If 
C(E)>C(Pi)>-...>-C(Ps), then C(PJ is called 
an (m — s)-dimensional face of C(P). A 0- 
dimensional face is called a vertex, and a 1- 
dimensional face is called an edge. Suppose 
that C{P)>C{Q) ior P = {po, ...,p^} and Q 

= { Pio ’ • • ■ ’ P'. - , } ■ ^ = P'o ^ ^ Pi- - . ® 

hyperplane of £ — Po U . • . U p^, and C(P) is 
contained in a closed side of E divided by F. 
Therefore, if C(P) has d (m — l)-dimensional 
faces, then C(P) is expressed as the intersection 
of d m-dimensional closed half-spaces. This 
shows that any convex cell is a polyhedron. 

E. Affine Transformations 

A mapping (p: A"^A'" is an affine mapping if 
there is a linear mapping (p: F"-)- F'" of the 
standard vector spaces of A" and A”' such that 
(p(p-l-x) = (p(p)-|-<p(x) holds for any pe A" and 
any xe F". An affine mapping of A" into itself 
is called an affine transformation (or affinity) 
of A". Specifically, a bijective affine transforma- 
tion is called a regular (or proper) affine trans- 
formation. An affine transformation <p of A" is 
characterized by each one of the following 
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properties: (i) Let oe A" be a fixed point. Then 

is a mapping of A ” onto itself that can be 
expressed as 

(p{o + \) = o + a+f{\), (2) 

where a is a fixed vector of V” and / is a 
linear transformation of V". (ii) The mapping 
(p: is a mapping such that (f){a)(p(h)^ 

/ • (p(p)(p{q) if ab = A- {/.eK). Moreover, if 
the ^characteristic of K is not equal to 2, an 
affine transformation is also characterized as 
follows: (hi) ^ is a mapping that sends lines 
into lines and preserves the ratio of each pair 
of parallel segments. 

The set ?l(A”) of all regular affine transfor- 
mations of A ” constitutes a group that we call 
the group of affine transformations. If the 
linear mapping / associated with a regular 
affine transformation (p is the identity map- 
ping, then (p is called a translation. The set 
%(A”) of all translations is a normal subgroup 
of 9f(/4") and is called the group of translations. 
The group of translations is isomorphic to V 
regarded as an additive group. The vector 
group S(/4") (i.e., an additive group of a linear 
space) acts '^simply transitively on A”. We see 
that '2l{.4")/®(/l")SGL(n, iif), where GL{«, K) 
denotes the '''general linear group. The set of 
all regular affine transformations that leave a 
point 0 of A ” invariant constitutes a subgroup 
©(A") of it is called an tjsotropy group 

at 0 and is isomorphic to GL(n, K). Let ^ = 

(o; e, , , e,) be an affine frame of A ” with 
origin o, and let ^ be a regular affine trans- 
formation of A ” given by (2); put x 
(p(o -h x) = o -h Z 3c'e,, a = Z and /(e,) = 

Z Then (p is expressed with respect to 
by the following equation: 

n 

x‘ = a'+ ^ alx\ det(u|)?^0, 1 (3) 

k=\ 

Conversely, a transformation that is given by 
(3) is a regular affine transformation. Elements 
of and ©(A") are expressed with respect 

to by 

x‘ = x'-ha‘, (4) 

and 

x' = X det(aj) ^0, (5) 

k = I 

respectively. HenCC )M(A") is represented as a 
■''semidirect product group of ®(A") and (F>(A”). 
In particular, a regular affine transformation 
that is represented by x' = ax' (1 ^ n) for 
some ae K (a 7 ^ 0 ) is called a similarity with the 
origin 0 as center. 

According to F. Klein, the objects we deal 
with in affine geometry are the properties 
(parallelism, barycenters, etc.) that are invar- 
iant under regular affine transformations. 



Subsets S, and S2 of A” are called affinely 
congruent if there exists a regular affine trans- 
formation (p sending 5, OlltO S,. For a fixed 

k, two k-dimensional simplexes are affinely 
congruent. Now we fix an affine frame J in A” 
and denote by x^ the coordinates of n -|- 1 
points Pj (0 ^ a ^ u) in A ”. Then the quantity 

1 

( 6 ) 

is called the volume with respect to of the ii- 
dimensional simplex with vertices Po, , P„- If 
(p is a regular affine transformation given by 
(3), we have V=det{aj) V. Hence the ratio of 
volumes of two n-dimensional simplexes is 
independent of the choice of coordinate sys- 
tems, and is invariant under regular affine 
transformations. 

A regular affine transformation given by (3) 
satisfying det(aj) = 1 is called an equivalent 
affnity. The set of all equivalent affinities 
constitutes a subgroup of '>21 (A"). The geometry 
belonging to this group is called affine geome- 
try in the narrower sense. For instance, the 
concept of volume is an invariant in affine 
geometry in the narrower sense. 

F. Relation to Projective Geometry 

Fet P" be a projective space over a coefficient 
field K 343 Projective Geometry). If we fix 
a hyperplane Ti^ in P", then the set of projec- 
tive transformations that leave n.^ invariant 
constitutes a subgroup of the group of '•'projec- 
tive transformations of P"; this subgroup is 
isomorphic to a group of regular affine trans- 
formations. Actually, if we use a iprojective 
frame [a,), a,, ,a„ u] such that a„ , a, are 
points on 7:^ , then each projective transfor- 
mation leaving n,j- invariant is expressed by 
equations of the same form as (3) with respect 
to the tinhomogeneous projective coordinates. 
The point set A ” complementary to 71^ in P” 
is an affine space, and coincides with the 
hyperplane at infinity. Moreover, two distinct 
lines in P” are parallel in A ” if they meet on the 
hyperplane at infinity. Hence, denoting by 
(0, l\. , /") the -i-homogeneous projective co- 
ordinates of the intersection of a line / in A” 
and we call (/\ , /") the direction ratio of 

l . A projective transformation leaving each 
point of invariant induces a translation. 

The ^principle of duality that holds in projec- 
tive geometry does not hold in affine geome- 
try. The +pole of the hyperplane at infinity, 
with respect to a quadric hypersurface, is 
called the center of the quadric hypersurface. A 
regular quadric hypersurface is called central 
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or noncentral according as its center belongs to 
A” or is a point at inlinity. Quadric hypersur- 
faces in an affine space are elassilied in several 
ways, by taking account of their relations with 
the hyperplane at infinity (^78 Conic Sec- 
tions, 350 Quadric Surfaces). 
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Algebra 

Though the word “algebra” usually refers to 
a field of mathematics as will be explained 
below, the word may also denote specific 
mathematical stmctures such as tassociative 
algebras, tjordan algebras, ■’'Clifford algebras, 
etc. The first concepts concerning “unknowns” 
in algebra originated in India, whence came 
also our decimal positional system of numer- 
ation. These ideas were introduced to Europe 
through Arabia in the Renaissance period. E. 
tViete systematized them into a symbolic 
method, called algebra, representing numbers 
by letters. The first problem of algebra was 
solving equations. Before Viete, G. Cardano 
and E. Eerrari had solved algebraic equations 
of degrees 3 and 4; the solution of equations of 
lower degree had been known from antiquity. 
The effort to solve equations of higher degree 
remained unresolved until the middle of the 
19th Century, when N. H. tAbcl and E. tGalois 
proved the nonexistence of algebraic solutions 
of such equations. They considered not only 
individual roots of these equations but also 
any rational transforms of their roots at the 
same time, and thus were led to the concept of 
Ifields. They also noticed that the problem of 
algebraic solution could be characterized by 
properties of permutation groups of the roots. 



After the discovery of the Galois group, group 
theory and group-theoretical considerations 
maintained the central position in algebra for 
some time (-^ 172 Galois Theory). These de- 
veloped into the “abstract algebra” of this 
Century in the general atmosphere of arithme- 
tization and of axiomatization of mathematics. 
At the turn of the century the monumental 
textbook in three volumes by H. Weber [ 1 ] 
was considered the standard work on algebra. 
Then there appeared in 1910 an epoch-making 
paper [2] by E. Steinitz on the abstract theory 
of fields. 

The main objects of algebra today are talge- 
braic systems of various kinds, such as fgroups, 
’Tings, tfields, and ■’'modules. Another fun- 
damental concept of algebra is that of tiso- 
morphism or of thomomorphism. The col- 
lection of algebraic systems of a given kind, 
together with the homomorphisms among 
them, gives rise to the notion of '’'category; a 
functor is a sort of homomorphism between 
categories (-^ 52 Categories and Eunctors). 
These notions were first used in thomological 
algebra, created in the 1940s by methods 
transferred from topology to algebra; now 
they are of basic significance to the whole of 
mathematics. 

An important branch of algebra with wide 
applications is the theory of t vector spaces, or 
more generally that of tmodules over a ring. 

This branch is called linear algebra. Homo- 
morphisms between finitely generated modules 
can be represented by tmatrices. Another 
branch of algebra, called trepresentation 
theory, is concerned with representations of 
groups or rings by matrices. The methods of 
modem algebra provide useful and powerful 
tools for the whole of mathematics, in partic- 
ular for the theory of numbers and algebraic 
geometry. 

The present development of algebra owes 
much to the activity of the German school in 
the late 1920s represented by E. Noether, E. 
Artin, W. Krull, and B. E. van der Waerden. 
The book by van der Waerden [3] has had a 
great impact on mathematics. N. Bourbaki [4] 
has been influenced by van der Waerden but 
gives accounts of more recent developments, 
particularly in linear algebra. In Japan, M. 
Sono, who worked at about the same period 
as E. Noether, was a forerunner in this field; 
after him, algebraists of the Kyoto School, Y. 
Akizuki, M. Nagata, and their followers, did 
notable research, especially in algebraic geom- 
etry. On the other hand, K. Shoda studied 
with E. Noether toward 1930 in Germany; his 
school includes such algebraists as T. Naka- 
yama, K. Asano, and G. Azumaya. Einally 
K. Morita and his disciples have made signif- 
icant contributions to homological algebra. 
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Algebraic Curves 

A. General Remarks 

An ^algebraic variety of dimension 1 is called 
an algebraic curve (for analytic theory ^11 
Algebraic Eunctions). The theory of algebraic 
curves has two aspects, the geometry of 1- 
dimensional complex manifolds in projective 
spaces and the theory of function fields of 
transcendence degree 1 (-^ 3 Abelian Varieties, 
16 Algebraic Varieties). The number-theoretic 
study of algebraic function fields concerns the 
latter theory (-73 Complex Multiplication, 
450 Zeta Eunctions). In this article, the geo- 
metric aspect of the theory is emphasized. 

We denote the ^universal domain by K. 

B. Classical Results on Plane Algebraic Curves 

Eet f(X, Y) be a polynomial of degree m in 
two variables X and Y. A point set in an affine 
two-space defined by f[X, Y) = 0 is called the 
plane algebraic curve of degree m. If we set 
f (Yo, Y, , Y^) = XlfiXJXo, Y^/Yo), the 
homogeneous polynomial F defines an alge- 
braic curve of degree m in a projective plane 
P^. The curve is called irreducible if the poly- 
nomial /(Y, Y) is irreducible. A curve of degree 
1 is said to be a line. Some results of this sec- 
tion are valid only in the case where the char- 
acteristic of K is zero. 

Eet C be a plane curve defined by the equa- 
tion f{X, Y) = 0. A point F = (a, b) on C is 
called an r-ple point if /(Y + a, Y -)- 6) has no 
term of degree < r in X and Y. At an r-ple 
point there are r tangent straight lines (count- 
ing multiplicity). An r-ple point with r > 1 is 



called a multiple point (or singular point). If 
these tangents are distinct, P is called ordi- 
nary. An ordinary double point is called a 
node; e.g., the origin for X^ -f -3X Y = 0. 

An algebraic curve can be transformed into a 
plane curve that has only ordinary multiple 
points by a finite number of plane tCremona 
transformations (tquadratic transformations of 
the projective plane into itself). 

Eet C be an-irreducible plane curve of de- 
gree m > 1 in a projective plane. The set of all 
tangent lines at nonsingular points of C deter- 
mines a set of points in the dual projective 
plane, and its closure is an algebraic curve C, 
that is called the dual curve of C. The dual 
curve of C becomes the original C. The degree 
m’ of C is said to be the class of C, which is 
equal to the number of tangent lines to C 
drawn from a general point. A nonsingular 
point P is called a point of inflection (or a flex) 
if the tangent line at P has contact of order 
> 2. If C is defined by an irreducible homoge- 
neous polynomial F{Xq, Yj, X,), the curve 
defined by det(d^F/5X,dX^)=^0 is said to be the 
Hessian of C. A nonsingular point P is a point 
of inflection if and only if P is contained in the 
Hessian. A singular point P is said to be a cusp 
if C is detined by an equation =X^ + higher 
terms, in terms of suitable affine coordinates X 
and Y where P = (0, 0). Whenever the singular 
points of C are only v nodes and y cusps, the 
effective genus g is given by the formula g = 

(m — 1) (m — 2)/2 — v -y. In addition, suppose 
that the dual curve has only v’ nodes and y’ 
cusps as singularities. Then C has y’ points 
of inflection, and m’ and y’ are given by = 
m(m- !) — 2v — 3y and y' = 3m(m — 2) — 6v — 8y. 
Moreover, m=m’(m’- 1) — 2v'— 3y' and y = 

3m' {m' — 2) 6v' 8y' hold. These formulas are 

called PIucRer formulas. 

Eor example, a nonsingular plane curve of 
degree 3 is an elliptic curve, i.e., g = 1, of class 
6, which has 9 points of inflection. A non- 
singular plane curve of degree 4 has, in gen- 
eral, 24 points of inflection and 28 bitangents. 
This results from the Plucker formulas. 



C. Fundamental Notions 

In what follows, by a curve we mean an alge- 
braic variety of dimension 1. Eet E be a notl- 
singular complete irreducible curve. An ele- 
ment of the free Abelian group generated by 
points of r is called a divisor. A divisor is 
written in the form a = J] with «,• £ Z. The 
integer n = Z,n, is called the degree of a and is 
denoted by deg a. The expression for a divisor a 
is said to be reduced if P^ Pj for i A divisor 
whose reduced expression has only positive 
coefficients is called a positive divisor (or effec- 
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tive divisor, or integral divisor), and this is 
denoted by a>0. The group of divisors on F is 
denoted by G(r), and the subgroup consisting 
of divisors of degree 0 is denoted by Go(r). Let 
F be a point of F. The subset of the function 
field K(F) of F consisting of functions regular 
at F forms a valuation ring Rp for a tdiscrete 
valuation of K(T). A prime element t of Rp is 
called a local parameter at P. Let Vp be the 
tnormalized valuation of K(F) defined by Rp; 
the integer Vp{f) is called the order off at P. 
The point F is a zero off if Vp{f) > 0; it is a 
pole off if Vp( f ) < 0. There are only a finite 
number of poles and zeros of a given function 
f_ The divisor '^Vpif) P is called the divisor of 
the function f and is denoted by (t). The set of 
divisors of functions forms a subgroup Gj of 
G,. Any divisor a in G, is called a principal 
divisor (we also say that a is tfinearly equiva- 
lent to zero and write a ~ 0). 

Let a be an arbitrary divisor. The set of all 
positive divisors that are linearly equivalent to 
a forms a complete linear system |a| deter- 
mined by a. We set L(a) = { /e K(r) (f) -I- 
a>0} U {0}. Then L(a) is a finite-dimensional 
vector space over K, and 1-dimensional sub- 
spaces of L(a) correspond bijectively to the 
elements of aj. We set J(a) = dim, L(a) and 
dim|a| = /(a)- 1. Then dim|a|is called the 
dimension of a|. For any divisor a, the integer 
deg a — dim I a is nonnegative and bounded. 

The supremum g of such integers is called the 
genus of F. The nonnegative integer i(a) = 
g deg a -h dim a is called the speciality index 
of a. 

Let CO be a +differential form on F, F be a 
point of F, and f be a local parameter at P . 

Then co can be written in the form co =fdt. We 
now set Vp(co) = Vp{f) and (w) = S Vp(o))P. Then 
(w) is a well-defined divisor, and the class of 
(w) in G/G; is independent of the choice of w. 
This divisor class is called the canonical class; 
any divisor in this class is called a canonical 
divisor (or differential divisor) and is denoted 
by 1. We have J(f) = y, deg f = 2g-^2. Given a 
divisor a, the index i(a) is equal to the number 
of linearly independent differentials oj such 
that (co)>a, i{q) = /(! — a). The equality J(a)= 
deg a —g -j- 1 + i(a) is called the Riemann-Roch 
theorem. 

For any irreducible algebraic curve F, there 
exists a birationally equivalent nonsingular 
curve r that is unique up to isomorphism. The 
genus of r is called the effective genus of F. A 
curve whose effective genus is zero is called a 
rational (or unicursal) curve. An elliptic curve is 
a curve whose effective genus is 1. An irreduc- 
ible curve F with genus ^ 2 is called a hyper- 
elliptic curve if K(F) is a quadratic extension of 
a field K(t) for some t. 

A positive divisor a on a nonsingular com- 
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plete irreducible curve F is called a special 
divisor if i(a) > 0. In this case, 2 (J(a) ~ 1) ^ deg a, 
where equality holds if and only if a = 0 or a ^ 

1, or F is a hyperelliptic curve (Clifford’s 
theorem). Let F be a nonsingular complete 
irreducible cufve and k be a sublield of K such 
that r is defined ovcr k. Denoting by k the 
algebraic closure of k, we call a divisor p = 

Z n,F; on F a prime rational divisor over k if 
p satisfies the following three conditions: (i) p 
is invariant under any automorphism c of k/k; 
(ii) for any j, there exists an automorphism 
0 ) of k/k such that Pj = Pp; (in) Hi= . =n,= 
[_k{Pi):k/\i. An element in the subgroup of 
G(F) generated by prime rational k-divisors is 
called a k-rational divisor. Let k{F) be the 
subset of K(F) consisting of functions f defined 
over k. Then k(F) is a subfield of K(Fj, and the 
quotient field of k(F) ® = K(T). k(F) is 

called the function field of F over k. Let p be a 
prime rational k-divisor, and let F be a point 
of p. Then Rp fl /c(F) is a valuation ring of k(F) 
uniquely determined by p and independent of 
the choice of the point F in p. We call this 
valuation ring the valuation ring determined 
by p- 

D. Algebraic Function Fields 

Let k be a field, and let K be a finite separable 
extension of a purely transcendental extension 
k(x) of k such that k is maximally algebraic in 
K. Then K is called an algebraic function field 
over k of dimension 1 (or of transcendence 
degree 1). The equivalence class of texponen- 
tial valuations of K that are trivial over k is 
called a prime divisor of K/k. An element of the 
free Abelian group generated by prime divisors 
is called a divisor of K/k. The group opera- 
tion in the divisor group of K/k is usually 
denoted multiplicatively. Let Rp be the 
valuation ring of the prime divisor F, and 
let Mp be the maximal ideal of R,. The de- 
gree deg F of the prime divisor F is defined by 
[(Fp/Mp): k]. If we replace the terms: cUfVC F 
by function field F/k; K(F) by K; K by k; and 
points on F by prime divisors of K/k, we can 
develop the theory of the function field K/k, 
which is similar to the theory of nonsingular 
curves F (- Sections B, C). Thus we define the 
genus of the function field K/k. 

Suppose we are given an algebraic function 
field K/k of dimension 1. An algebraic curve F 
defined over k is called a model of K/k if k(F) 
and K are k-isomorphic. For any function field 
of dimension 1, there always exist two ele- 
ments X and y in F such that K= k{x, y). Let 
f{X, Y) be an irreducible polynomial such that 
f{x, y) = 0. Then the plane curve defined by the 
equation f{X, Y) = 0 is a model of K/k. Among 
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the models of K/k there exists a tnormal model 
Fq over k that is unique up to isomorphism 
(and the uniqueness of the normal model of 
the function field within the birational equiva- 
lence class of varieties holds only for curves). In 
particular, if k is the complex number tield, the 
normal model Fq is the tRiemann surface of 
the function lield K/k. If Fq has no singular 
point, the theory of the curve Fq and the 
theory of the function tield K/k are essentially 
identical. (This occurs, for example, when k is 
^perfect.) In that case the genus of Fq is equal 
to that of K /k. In general, the genus of the 
function field is not less than the genus of the 
normal model Fq, and it is greater than the 
latter if Fq has a singular point. If the genus of 
K/k is zero, we can take a plane quadratic 
curve as a model of K/k. Moreover, K/k has a 
prime divisor of degree 1 if and only if iC is a 
purely transcendental extension of k. A func- 
tion lield K/k of genus 1 is called an elliptic 
function field. If K has a prime divisor of de- 
gree 1, an elliptic function field K has a model 
of a plane cubic curve. Moreover, if the char- 
acteristic of the lield k is different from 2, we 
can take as the model Fq the curve defined by 
anequationoftheform = —Q 2 X — g^- 
This is called Weierstrass’s canonical form. 

The number j = (6(2^—2703^)''\92^ (#0)is a 

birational invariant of Fq. 

E. Jacobian Varieties 

Let F be a nonsingular curve. A Tgroup variety 
J is called the Jacobian variety of F if it has the 
following four properties (we fix an algebrai- 
cally closed tfield k of definition for F and J): 
(i) There exists an isomorphism (of abstract 
groups) of Gq(F)/G,(F) into J. (ii) 0 is contin- 
uous in the following sense: Let ii, b be elements 
of Go(F)/G,(F) represented by a, b. If b is a 
specialization of a over a field K{z:> k), then 
<I)(b) is also a specialization of (I>(d) over K. (iii) 
If there exists a K-rational divisor in the class 
Q, then the point d)(d) is also K-rational. (iv) 
For any J, there exists a k{(^)-rational divi- 
sor a in Gq such that 0(a mod G,) = <^. A group 
variety J satisfying these conditions is neces- 
sarily a complete variety, hence an tAbelian 
variety, and is determined uniquely up to 
isomorphism. The construction of Jacobian 
varieties over a lield of arbitrary characteristic 
is due to A. Weil [27] (for analytic construc- 
tion -^11 Algebraic Functions C). 

Let P be a tgeneric point of F over k, and let 
Pq be a fixed k-rational point. Then (p{P} = 
0(P— Fq) defines a rational mapping of F into 
J, and (p, which is an isomorphism of F and its 
image <p(F), is determined uniquely by $ up to 
translation on J. This mapping (p is called the 



canonical function on F. The dimension of J is 
equal to the genus g of F. If Fj , . . . , are 
independent generic points of F over k, then 
k(Fj , . . . , Pg)s is the function field of J over k, 
where k(Fi , . , Pg)s is the subfield invariant 
under the group of g! automorphisms (P, , . . , 
Fg)-^(Fj, , , Fj ). The Jacobian variety of F is 
also the -i-Picard variety of F, and it is equal to 
the tAlbanese variety of F (- 16 Algebraic 
Varieties P). Hence for any function / on F 
with values in an Abelian variety A, there 
exists a unique homomorphism 2 of J into A 
such that f <p + const. This /J, is called the 
linear extension of f. 

Let 0 be the set of points on J that can be 
written as (p{Pi) + + (p{Pg-i). Then 0 is an 
irreducible subvariety of codimension 1. The 
divisor 0 is called the canonical divisor of J. 
The Jacobian variety that is polarized by the 
divisor 0 is called the canonically polarized 
Jacobian variety (-^ 3 Abelian Varieties G). If 
two curves F and F' are birationally equiva- 
lent, the canonically polarized Jacobians of F 
and F' are isomorphic. Conversely, if the 
canonically polarized Jacobian varieties J of F 
and J' of F' are isomorphic, then F and F' are 
birationally equivalent (Torelli’s theorem). Let 
T be any integer such that 1 ^g, and let VF, 

be the set of points that are written in the 
form 

(p{Pi)+... + (p{P,) 

(W^ = (p{r), Wg-i = @,Wg = J). Then we have 
Wg_^ (tnumerically equivalent) and 
(0(9)) = y!, where 0*6 ig the class of intersec- 
tions of r copies of 0. The existence of a divi- 
sor 0 is characteristic for Jacobian varieties. 
Actually, if A is an Abelian variety of dimen- 
sion n that has an irreducible subvariety X"~^ 
of codimension 1 and a positive 1-cycle C such 
that (V'"b = n! and = (n 1)! C, then C 
is a nonsingular irreducible curve, A is the 
Jacobian variety of C, and X is the canonical 
divisor. The canonical divisor 0 is defined 
by a ttheta function in the classical case. 

For a divisor a = Z n,P;, we define cp{a) to be 
En,<p(F,)e J. For lixed numbers l^r^g, 0<d, 
welet IF,'' denote {(p(a)G7 1 /(a) 1, deg a — 

Y } . Then IF,'' ^ IJ/ and is a tsubscheme of J. 
Wecall the number p = g—{d-f- \){g — r + d) 
the Brill-Noether number. Then dim IF,'' J: p. 
Moreover, if F is a general curve, then dim IF,'' 
= p. In particular, if p < 0, then IF,^ = 0. This 
result has been verified recently by S. Kleiman, 
D. Laksov, P. Griffiths, and J. Harris [9, 14]. 

Let F be a nonsingular curve, and let Cd be a 
differential form on F. If the divisor (w) is >0, 
the (p is called a regular 1-form or differential 
form of the first kind. Let Q be tbe tsheaf of 
germs of regular differential forms. A differen- 
tial form of the first kind is an element of 
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Q), and vice versa. Let f be a canonical 
divisor. Then we have a natural isomorphism 
Q) ^ L(f), and the number of linearly 
independent differential forms of the first kind 
is equal to the genus g of L. The ^residue of a 
differential can be defined as in the classical 
case. A differential that has nonzero residues is 
called a differential of the third kind. The re- 
sidue theorem Z Res^co = 0 holds for any dif- 
ferential to, The form Q) is called a differential 
form of the second kind if for any Pe T there 
exists a rational function fp such that OJ — dfp is 
regular at P. The set of differential forms of the 
second kind forms a linear space G 2 over the 
universal domain and contains the subspace 
G, consisting of the differential forms of the 
first kind. The quotient space Gj/dkiV) has 
dimension 29 or y according as the character- 
istic of the universal domain is 0 or not. 

When the characteristic p of the universal 
domain is positive, we have what is called the 
Cartier operator. Let F be a curve defined over 
a perfect tield k, let L = /c(F), and let t be an 
element of L that is transcendental over k and 
such that L/k{t) is separable. Then any dif- 
ferential w of L/k is written uniquely as O) 

+ where e L. 

Then the Cartier operator C given by Cco 
= fp-i dt is well defined and independent of 
the choice of t and leaves Gj invariant. Hcncc 
given a basis tUj, , of Gj, we obtain a 
matrix (a,) with coefficient in L by Co, = 

Z CiijOOji 1 ^ 1 ^ y). This gxg matrix A is called 
the Hasse-Witt matrix of F. The class of 
A modulo the transformations of the form 
S“'’/4S is a birational invariant of F and plays 
an important role in the theory of unramified 
cyclic p-extensions of the algebraic function 
field. 



F. Generalized Jacobian Varieties 

The notion of linear equivalence of divisors on 
a nonsingular curve can be extended to a more 
general situation. Such attempts have been 
made by M. Noether, F. Severi, and M. Roscn- 
licht, who succeeded in obtaining such a gen- 
eralization [22], 

Let r be an algebraic curve, and let 
P, , , Pj be singular points of F. Let Op. be 
the tlocal ring of Pj. We set 0 = P|J=, Op. and 
F' = r — |p^, , pj. An element of the free 
Abelian group G(F) generated by points of F' 
is called a F-di visor. Let a be a F-di visor and 
set L(a) = {/6O|(/)-|-a>-0}U{0}. Then L(a) 
is a finite-dimensional linear space (over the 
universal domain). The dimension of L(a) is 
denoted by 1(a), and we setdim|a|=7(a)— 1. 

The upper bound n of deg(a) - dim| a| is a 
nonnegative integer and is called the 0-genus 



of F. We call i(a) = 7i — dega-l-dim|a| the O- 
speciality index of the divisor a. Let C be a 
nonsingular curve birationally equivalent 
to F, and let Q j , . . . , be points of C that 
correspond to singular points of F. An O- 
differential at is a differential form on C (of 
K(F) = K(C)) such that Zi=i ReSg./o) = 0 for 
any /eO. Then i(a) is equal to the number of 
linearly independent 0-differentials w such 
that (at)>a in F'. The equality /(a) = deg a — 

TT + 1 + i(a) is called the generalized Riemann- 
Roch theorem. An O-differential U) is called an 
0-differential of the first kind if o) is regular 
everywhere on F'. The number of linearly 
independent 0-differentials of the first kind is 
equal to the 0-genus 7i. Let g be the effective 
genus of F, i.e., the genus of C. Then we have 
the equality n~g= dim^fO/O) = where 0 IS 
the integral closure of 8 in K(T). The set of O- 
differentials forms an D-module that is in 
general not of rank 1. Hence in this case, we 
do not have the “canonical divisor.” Let c be 
the conductor of 0/0. Then c determines a 
divisor in a natural way. If we denote the de- 
gree of this divisor by d, we have the inequal- 
ity ^ -|- 1 ^ d ^ 2(5. We have d = 2d if and only 
if the set of 0-differentials forms an D-module 
of rank 1. This case occurs, for example, if F is 
a curve on a nonsingular surface or a complete 
intersection. Two F-divisors a and b are said 
to be 0-linearly equivalent if there exists a unit 
/ of 0 such that a-b=(f). The set of F- 
divisors that are 0-linearly equivalent to zero 
forms a subgroup G,(F) of G(T). There exists a 
group variety J^, unique up to isomorphism, 
that satisfies the four conditions required for 
Jacobian varieties (- Section E) with respect 
to the class group Go(r)/G;(F). The variety Jq 
is called the generalized Jacobian variety. The 
generalized Jacobian variety is not complete, 
in general. If J is the Jacobian variety of C, 
then dp is an extension of J by a connected 
tlinear algebraic group Any Abelian exten- 
sion of the function tield of F can be obtained 
by the tisogenies of the generalized Jacobian 
variety of F [22], This fact plays an important 
role in claSS field theory over algebraic func- 
tion fields (-^ 59 Class Field Theory G). The 
theory for nonsingular curves is considered as 
the special case in which C = K(F). 

Suppose that T is situated in a projective 
space of dimension n. Let p be the prime ideal 
in k[2fQ, Xj, , XJdeiining F and j((p, m) be 
the number of linearly independent homoge- 
neous polynomials of degree m modulo p. Then 
;f(p, m) is a polynomial in m for large m. This 
polynomial is called the Hilbert polynomial of 
p (or F). Let c be the constant term of the 
Hilbert polynomial. The number Pa(F) = 1 — c 
is called the arithmetic genus of F and is equal 
to the C-genus of F. Let F be a nonsingular 
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irreducible curve in of degree d. If F is 
contained in a plane, then p„(F) = (d 1) • 

(d-2)/2. Otherwise, p„(F)^d^/4 — d+ 1 when 
dis even, and l)/4-d+ 1 when d 

is odd [11], 

G. Sheaf Theory 

Let r be an irreducible curve and (9p be the 
local ring of p. Then [jCJp is an talgebraic 
coherent sheaf, which is called the structure 
sheaf of F, and dim, //'(F, (9^) i® equal to the 
arithmetic genus 7t of F. Let a be a F -divisor, 
and let Fr(o) be the tsheaf of germs of rational 
functions / such that (/)+ a>0 and /eOg for 
every singular point Q of F (- 383 Sheaves 
D). Then dim, 0,(a)) is equal to the 

speciality index i(a), and dim, H*^(F, C-rfa)) 
is equal to 1(a). When F has no singular 
point, the Riemann-Roch theorem is deduced 
naturally from tSerrc’s duality theorem: 

H. Algebraic Correspondence 

Let F be a nonsingular curve. A divisor of 
the product variety F x F is called an alge- 
braic correspondence of F [26, 27]. Let Dq 
be the subgroup consisting of divisors that 
are linearly equivalent to degenerate divisors 
a X F -f- F X b. Then the class group '^^(^) = 

G(F X F)/f>o is called the group of classes of 
algebraic correspondences. We write X =0 if 
X is an element of Dq . Let X be an algebraic 
correspondence, k a field of delinition for F 
over which X is rational, and P a generic 
point of F over k. Then X(P) = pr2[X(P x F)] 
is rational over k(P). The composite Aj o Xj 
of two correspondences X, and X 2 is defined 
by (A, o X 2 ) {P)~X^ {X 2 {P}) whenever they 
have meaning. The composite X, o A2 deter- 
mines an element of '^(F) that depends only on 
the classes of X, and X,. This multiplication 
supplies the group '^(F) with the structure of 
an associative ring. This ring is called the 
correspondeuce ring of F. The correspondence 
ring '^(F) and the ring of endomorphisms of 
the Jacobian variety 3 are isomorphic, and the 
isomorphism is given by the following rule: Let 
^ be an element of and let X be a divisor 
in Let f* be a generic point of F with re- 
ference to k over which X is rational. Let be 
a k-rational point of F. Then the class of 
X(P) — A(Pq) modulo G,(F) is independent of 
the choice of a divisor X in the given class. We 
set T(P) = 0(X(P) — X(Pq)) andlet A be the 
linear extension of Y. The correspondence 
an anti-isomorphism of 'if(r) and s/. 
Now we set .s/q = .q/ ( g> Q_ Then contains 
an automorphism / of order 2 called an in- 



volution. Let 1 be a rational prime different 
from the characteristic p. Then A has a faithful 
representation by 2g x 2g matrices with co- 
efficients in Fadic integers. The ttrace o of 
this representation has the property that 
o{ jiO ]K') > 0 if / 0 (Castelnuovo’s lemma), 
is an algebra of finite tank over Q, and is a 
finitely generated Abelian group. Based on 
these results A. Weil proved the Riemann 
hypothesis for congruent (-functions on a non- 
singular curve (- 450 Zeta Functions P), 

I. Coverings 

Let F and C be nonsingular curves such that 
there exists a regular mapping k: F C. Then 
there is an injection of the function field K(C) 
into K(T). If K(f) is separably algebraic over 
K(C), then F is called a covering (curve) of C. 
The integer [K(I): K(C)] = n is called the de- 
gree of covering. Let P be a point of F and let 
Q=n(P). Let t, s be local parameters at P on F 
and at Q on C, respectively. The nonnegative 
integer Vpidsjdt) is called the differential index 
at P and is denoted by trip. The index trip is 
zero except for a finite number of points. The 
divisor ^ WpP is called the branch divisor. The 
covering F is called an unramified covering if 
the branch divisor is zero. If we denote the 
branch divisor by a, we have the formula 
2gf(F) — 2- n{2g(C) - 2) + deg a, where <^(F) 
and g(C) are genera of F and C, respectively. 
This is called the Riemann-Hurwitz formula. 
This formula yields at once that a rational 
curve has no nontrivial unramified covering 
and that F can be an unramified covering of 
itself if and only if F is an elliptic curve. 

J. Theory of Moduli 

Let ,y^g{k)he the set of isomorphy classes of 
complete nonsingular irreducible curves (here 
simply called “curves”) of genus g defined over 
a field k. We can endow J^g{k) with a structure 
of an algebraic variety over k with the prop- 
erty that for any smooth family (over k) n:^-t 
S of curves of genus y the map T:S^J^g{k) 
sending s to the isomorphy class of the fiber 
7t'''(s) is a tmorphism. This algebraic variety is 
called the (coarse) moduli space of curves of 
genus g. Furthermore, the moduli space over 
Spec(Z) exists (D. Mumford [16]). -#g(k) is 
tnormal, irreducible and tquasiprojective (but 
not complete for 3 > 0) of dimension 0 (g = 0), 1 
(^1 = 1), 3^-3(t/^2)( = 30-3-hdimAut(C)) 

with only quotient singularity [5]. 

Since when g = 1 any elliptic curve C over 
an algebraically closed lield k is isomorphic to 
a plane curve with the Weierstrass canonical 
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form — Ax^ —g^x — g^^iif char(/c) 2; the case 
of char(k) = 2 needs a slight modification), the 
correspondence C^j{g 2 , g^) Section D) 

defines an isomorphism (affine line). 

In the case of g ^ 2 and k = C we have another 
function-theoretic construction of Mg due to 
Teichmuller (-11 Algebraic Functions F). 

Let Mg be the coarse moduli space (over Z) 
of principally polarized Abelian varieties of 
dimension g (-^ 3 Abelian Varieties 1). For a 
curve C we denote the Jacobian variety of C 
by J(C) (- Section E). The correspondence 
C->J(C) defines a morphism v..Mg-^Mg which 
is injective (Theorem of Torelli). it is even an 
timmersion (F. Oort, J. Steenbrinck). If g= I, 

2, 3, then i is an open immersion whose image 
we can describe. 

For k =C we can express the above map 
i by using periods of curves. Namely, let 
«!, , oc^, , (ig he a (canonical) basis of 
H, (C, Z) detined by tnormal sections of C 
(considered as a real surface). Let co,, . . , w^be 
a basis of idifferentia! forms of the first kind 
on C with o)j = 3jj. Then the matrix Q = 

is symmetric and has a positive definite 
imaginary part, i.e., is an element of the ’’'Sie- 
gel upper half-space of degree y. With the 
identification Mg ~ S^/Spig, Z) (- 3 Abelian 
Varieties 1) the map i:Mg-^Mg is nothing but 
the one sending the isomorphy class of C to 
Rmod Sp(^, Z). For g =4 the closure of i(.y#J 
is a principal divisor detined explicitly with 
theta constants (Schottky, J.-I. Igusa). 



K. Stable Curves 

A reduced connected complete curve C over k 
is called a stable curve of genus g ( > (0) if(i) C 
has only ordinary double points as possible 
singularities; (ii) when F is a smooth rational 
irreducible component of C, then F intersects 
the other components in more than 2 points; 
(hi) dim, H'(C, &c) = Q- 
There exists a tcoarse moduli space 5^ of 
stable curves of genus g which contains Mg 
as a Zariski open subset. The space ,9^ detined 
over Spec(Z) is tcomplete and even tpro- 
jective; hence M'g gives a compactification of 
Mg. The completeness of M'g follows from the 
stable reduction theorem as follows: let R be a 
discrete valuation ring with quotient field K 
and C be a smooth connected curve over K. 
Then there exists a finite separable algebraic 
extension L of K such that the curve C x 
extends to a flat family of stable curves over 
SpecfR^), where denotes the integral clo- 
sure of R in L. In this case we say that the 
curve Cx f^L has a stable reduction in R,. 
With the above notation a curve C over K 
has a stable reduction in R if and only if its 
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Jacobian variety J(C) has a tstable reduction 
in R (- 3 Abelian Varieties N). 

Over C, Mg has the tSatake compactification 
Mg, which is a disjoint union of Mg>, 0 ^g' 
as a set. The injection i\Mg-^Mg (- Section J) 
extends to a morphism y.^g-^^g that sends 
the isomorphy class of a stable curve C = 

C] U U Q (Q irreducible) to that of J{Ci) 

X X J{QeMg-, ^' = Sigenus(Q, where Q is 
the tnormalization of Q. In particular the 
closure of the image of i in Mg is the set of 
products of Jacobian varieties. 
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Algebraic Equations 

A. General Remarks 

LetG(Xp...,X,), , F,{X„ . ,AJbe r 
tpolynomials in m variables X^, ■ , X^ over a 
tfield k. Then the equations 

F, - 0 , . ..) F=Q 

are called algebraic equations in m unknowns. 
When we consider these equations simulta- 
neously, where r ^ 2, we call them a system of r 
equations or simultaneous equations. (For r = 1, 
a system of one equation means the single 
equation fj = 0.) Coefficients of E, , , F^ are 
called coefficients of the system, and the great- 



est of the degrees of E, , . . , f,. is called the 
degree of the system. 

To solve a system of equations (henceforth 
in this article we shall omit the word “alge- 
braic”) means to fmd the common tzero points 
(in an talgebraically closed field containing k) 
of elements f j , . . , E^ of the +polynomial ring 
/c[2fj, , X,]. If there exist no common zero 
points, the system is said to be inconsistent; if 
there exists a finite number of such points, it 
is said to be regular; and if there are an infinite 
number of such points, it is called indetermin- 
ate. The telimination method allows us to 
reduce the problem of solving a system of r 
equations to the case r = 1 . In particular, any 
regular system of equations can be reduced to 
the case m=r= 1. 



B. Equations in One Unknown 

For the above reason, it is important to con- 
sider an equation of the form/(X) = 0, where 

f{X)^aoX'' + a,X"-^ + ...+a„, ( 1 ) 

This gives the general form of an algebraic 
equation in one unknown. 

According asf{X) is reducible or not in the 
tpolynomial ring k[J], the equationf(X)=0 
is called reducible or irreducible (- 337 Poly- 
nomials). In some talgebraic extension field K 
of k, f(X) can be factored: 

f{X) = a,{X-^,){X-a,)...{X^a„). ( 2 ) 

* 1 , . ,a„ are called the roots of the equation 
/ (X) =0. Hence, any algebraic equation of 
degree n has exactly n roots (Kronecker’s 
theorem). Now, ( — l)‘u,/flo is equal to the tele- 
mentary symmetric function of degree i of 
a J, .-'jan- Some of the roots may be 

identical. If a appears p times in (Xj , . , we 
say that a is a p-tuple root, and p is called the 
multiplicity of the root oc. When p= 1, a is 
called a simple root, and when p ^ 2, a is called 
a multiple root. Let /(j, ... , jS^be all the distinct 
roots among aj, . , a„, and let p- be the multi- 
plicity of j?;(i= 1, , v). Then 

( 2 ') 

Pi + . -I- Pv = n. 

If Pi, • , Pv divisible by the '•'character- 

istic of k, the greatest common divisor g off 
and 

= + (”“ + + ^«-i 

is (X — ...(X — Thus we can 
reduce the multiplicity of every root to 1 by 
dividing / by g. Any irreducible equation over 
a field of characteristic 0 has no multiple roots. 
Equation (1) has multiple roots if and only if 
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its discriminant D is equal to 0 (- 149 Fields; 
172 Galois Theory). 

C. Equations of Special Type 

In Sections C and D we assume that the char- 
acteristic of k is zero. 

Binomial Equations. An equation of the type 
A''" — a =0 is called a binomial equation. It is 
solved by root extraction. Let (mtb root of 
u) be one of the roots (if a is a positive real 
number, usually denotes a positive real 
root). Then multiplied by 1, C, 
are the roots of X"' — a — Q, where ( is a ■''prim- 
itive mth root of unity. 

Reciprocal Equations. An equation a, X" + 

-I- = 0 is called a reciprocal 
equation if = ai =u„_i, U 2 = u„_ 2 , ■ A 
reciprocal equation of an odd degree n = 

2m + I has a root X = — 1 , and dividing the left 
side by X + 1 we get a reciprocal equation of 
degree 2m. A reciprocal equation of degree 
n= 2 m is reduced to an equation of degree m 
in Y= X -I- X and the quadratic equation 
X^-XY+l=0. 

D. Equations of Lower Degree 
(- Appendix A, Table 1) 

(1) A linear equation «o X + a, = 0 has a single 
root —aja,y (2) the roots of a quadratic equa- 
tion dfjX^ + a^X + a 2 = 0 are given by (— 
±\/uf — 4aoa2)/2flo. (3) To solve a cubic 
equation a^X^ + a^X~ + U 2 X + a 2 — 0,we set 
Ay =9aoa,U2 — 2u]-27c/ofl3, A2 = a] — 3aQa2, 
and solve the quadratic equation A, T 
+ A\ — Q. Let ty and t 2 be the roots of this 
quadratic equation, and let u) be any cube root 
of I. Then (-a, Xoi^/Ty A-<j)^^fi2)l2>a(y is a 
root of the original cubic equation. (Cardano’s 
formula). If we apply this method to a cubic 
equation -h hX^ -t- cX 4- d = 0 with real 
coefficients, we need to use complex cube 
roots even if the roots of the equation are real. 
In fact, it has been proved that it is not pos- 
sible to solve this equation within the real 
numbers in this case; i.e., if the cubic equation 
is irreducible over the extension Q(a, b, c, d) of 
the rational number field Q, and if all of its 
roots are real, it is impossible to find the roots 
only by rational operations and with real 
radicals. This is called the casuS irreducibilis. 

(4) A quartic equation UqX‘^ + a, X ^ + a 2 l ^ 

-I- ^3 X -h «4 = 0 can be solved by means of 
reduction to a cubic equation (L. Ferrari) (-^ 
Appendix A, Table 1). Generally, the proce- 
dure of solving an algebraic equation, i.e., 



finding the roots of a given equation from its 
coefficients by means of a finite number of 
rational operations and extractions of radicals, 
is called a solution by radicals (or algebraic 
solution). The ''"general algebraic equation 
whose degree is ^ 5 cannot be solved by 
radicals (N. H. Abel) 172 Galois Theory). 

E. Analytic Theory 

In this section, k denotes the field R of real 
numbers or the field C of complex numbers. 
These cases have been studied for a long time, 
for practical reasons. 

Concerning the case k = C , the field C is 
talgebraically closed; i.e., every equation with 
coefficients from C has a root in C (Gauss’s 
theorem, called the fundamental theorem of 
algebra). Accordingly, in the field C, we always 
have equations (2) and (2’). 

Let «!, , be the roots of equation (1). 
Then each a,, is a continuous function of coeffi- 
cients a„ a„ , a,. Concerning the location of 
roots of/(X) = 0 and/'(X) = 0 on the complex 
plane, we have the following theorems: 

(1) Any convex polygon on the complex 
plane containing the roots of/(X) = 0 also 
contains the roots off (X) = 0 (Gauss). 

(2) Let C be a rectifiable i"Jordan curve not 
passing through a root off(X) = 0. Then 
the number (C,/) of the roots off(X) = 0 
lying in the region enclosed by C is equal to 

( 1/2ti/) jt^-(/'(z)//(x))dz, where the multiplicity 
of the roots is taken into account. 

(3) Let C be a Jordan curve on the complex 
plane. If |/(z)l> \ g{z) \ at every point z on C, 
then the equations /= 0 and /-I- g = 0 have the 
same number of roots (counting multiplicity) 
within the region enclosed by C (Roucbe’s 
theorem). 

(4) The absolute value of a root of equation 
(1) is less than 

M = max(|fli/ao|,...,|fl„/uol)+ I- 

(5) Let D be the discriminant of/, and as- 
sume that |a,-| (/^ 1, ...,n). Then |a, — 

^D/(2M)"'"“‘'“^ — E. Since the value of 

|D| is known fromf’and one value of M is 
given by theorem (4), we have one value of E. 
If we draw a circle on the complex plane with 
center at the origin and with radius M, and if 
we cover it with a net whose meshes have 
diameters less than y/Ejl, then the interior of 
each mesh contains at most one root of/ = 0. 

'When A: = R, i.e., /e R [X ], let , . . . . 
denote the distinct roots off= 0, and recall 
equation (2’). Suppose that , //e R and 
the others Then v-A is an even integer 2 k, 
and we can renumber ,/v so that 

= /^;+^+,,...,^A+K= (/^ denotes the con- 
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jugate of P) and = P;+,+i , . . . , = p,. 

In this case, are the real roots of 

equation (2), and the other j5’s are called the 
imaginary roots. 

(6) If / gR[J 6] and aQ>Ui> . . >u„>0, then 
the absolute value of any root of equation (1) 
is less than ] (Kakeya-Enestrom theorem). 

Concerning the real roots of an equationf 
= 0, where/eR[2f ], we have the following 
theorems: Let N{a, b){a,beR) denote the 
number of real roots in the interval (a, b). 
Furthermore, let K(Ci , C2, • , Cp) denote the 
number of changes of sign in the sequence 
c,,c'2, , Cp of real numbers, which is defined 
as follows: Suppose that we have the sequence 
, . . . , c^,^ after deleting the terms c,- = 0 from 
the sequence c^, C2, , Cp. Then 

1 <?-! 

V{ci,C 2 ,...,Cp) = ~ Y. (1 -sgnc ,c ). 

Zj=i J J 

(7) N(0, Go)= K(cfo, a„ , a„) (mod 2) and 
N ^ LfDescartes’s theorem). 

(8) Let V(c)= V{f{c),f'{c), ■ ■ ■ , f“‘(c)). Then 
N(a, b) = V{a) V(b) (mod 2) and N ^ V{a) — 

V(b) (Fourier’s theorem). 

(9) We may assume that/ =0 has no multi- 
ple roots. Construct a finite series /o =//, = 
f\ , / of polynomials over R such that 

fori = l,2, . , . , 1 and/;eR, 

by successive application of the tdivision al- 
gorithm. Let V(c)= K(/o(c),/,(c), ,/(c)). 

Then N(a, b) = V{a) — V(b) (Sturm’s theorem). 

By means of this theorem we can determine the 
location of real roots as precisely as we wish. 

(10) In order that every root a,- of an equa- 
tion /= 0 with a, > 0 lies on the left side of 
the imaginary axis, i.e., Re a; < 0, it is necessary 
and sufficient that in the following matrix the 
^principal minors composed of the first r rows 
and first r columns be positive for all r ~ 

1, 2, , II (Hurwitz’s theorem): 

a, U3 CI 5 a, 

3 , (I 2 U4 Cl(, 

0 U3 Uj 

0 a, U 2 .. . 

\o 0 a „_2 

Also, for/ e C [X ], various results have been 

obtained about under what conditions all the 
roots off= 0 lie on one side of a given straight 
line or inside a given circle (o-g-> the unit circle) 
(- 301 Numerical Solution of Algebraic 
Equations). 
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Algebraic Functions 

A. Definition 

An algebraic function is a multiple-valued 
^analytic function w = w(z) defined by an tirre- 
ducible algebraic equation P[z, w) =0 with 
complex coefficients. 

B. History and Methods 

The theory of algebraic functions evolved from 
the works of C. E. Gauss, N. H. Abel, and C. 

G. J. Jacobi on telliptic functions in the early 
19th Century. Stimulated by their works, B. 
Riemann and K. Weierstrass established the 
foundations of the theory of complex functions 
and developed the important theory of alge- 
braic functions. 

The equation P{z, w) = 0 defines a curve in 
the 2-dimensional complex tprojective space 
with inhomogeneous coordinates z, w. Inves- 
tigations from this point of view were initiated 
by Riemann, A. Clebsch, and P. Gordan. 

This approach was followed by A. Brill, M . 
Noether, and the Italian school (E. Severi, C. 
Segre, etc.) and has developed into contempo- 
rary algebraic geometry (-^ 9 Algebraic 
Curves, 12 Algebraic Geometry). 

The set of tfunction elements w(z) satisfying 
P{z, w) = 0 is a tcomplex manifold 91, a closed 
(= compact) tRiemann surface, on which z and 
w are tmeromorphic functions. The field Ktjj 
consisting of the meromorphic functions on 91 
is an talgebraic function field C(z, w). Con- 
versely, for any closed Riemann surface 9?, the 
field Xg, is an talgebraic function field in one 
variable over C, and any pair of functions z 
and w with Kgj = C(z, w) has the property that 
91 is tconformally equivalent to the Riemann 
surface determined in the above fashion by the 
irreducible algebraic equation P(z, w) = 0 satis- 
fied by z and w. Two Riemann surfaces 91i, 9?2 
determined by the equations P^ = 0, P 2 = 0 are 
conformally equivalent if and only if the fields 
and are C-isomorphic. This condition 
is equivalent to the existence of a tbirational 
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transformation between the algebraic curves 

= 0, = 0- The “analytic method” (the 

method of studying algebraic functions as 
functions on Riemann surfaces) is the creation 
of Riemann. It was extended by F. Klein and 
D. Hilbert, and later by H. Weyl, who es- 
tablished in his monograph [6] a rigorous 
foundation of the analytic method for the 
theory of algebraic functions. 

Given an arbitrary algebraic function field 
K in one variable over C, the set 91 of its 
tprime divisors with a suitable topology and 
analytic structure is a closed Riemann surface 
whose function field K95 coincides with K. The 
“algebraic method” (the method of studying 
algebraic functions as elements of an algebraic 
function field) was founded by J. W. Dedekind 
and H. Weber at the end of the 19th Century. 

In the 20th Century, the algebraic method has 
made remarkable progress, owing to the devel- 
opment of abstract algebra. It covers the case 
of an arbitrary ground lield as well as that 
of more than one variable. The theory of 
algebraic functions has had considerable in- 
fluence on the development of number theory 
because of a basic analogy between the two 
subjects. 

The tuniversal covering spaces (surface) ^ of 
a closed Riemann surface 9? can be regarded, 
by conformal mapping, as the Riemann 
sphere, the plane, or the unit disk (or, equiva- 
lently, to the upper half-plane) if the tgenus g 
of 91 is 0, 1, or ^ 2, respectively. Then the 
tcovering transformation group G, consisting 
of tlinear fractional transformations without 
fixed points in is tproperly discontinuous 
and has a compact tfundamental domain. 
Conversely, if D is one of the three domains 
just mentioned and if G is the group just de- 
scribed, then 9i = D/G is a closed Riemann 
surface such that D and G are its universal 
covering space and covering transformation 
group. A meromorphic function on 91 is repre- 
sented as an tautomorphic function on ^ with 
respect to G. If y = 0, then G = {1 }, ^ = 91, and 
is the field of rational functions. If g= 1, 
then is the field of telliptic functions. The 
study of algebraic functions as automorphic 
functions was initiated by H. Poincare and 
Klein. Recently, C. L. Siegel made a remark- 
able contribution to the investigation of the 
case of several variables. The theory of auto- 
morphic functions is also related to number 
theory. Works of E. Hecke, M. Eichler, and 
G. Shimura on this domain are noteworthy 
(- 32 Automorphic Functions, 73 Complex 
Multiplication). 

Another important topic concerning alge- 
braic functions (closed Riemann surfaces) is 
the problem of moduli. Riemann stated, with- 
out rigorous proof, that the set of conformal 
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equivalence classes of closed Riemann surfaces 
of genus g(^2) depends on 3g — 3 complex 
parameters, called moduli. This has led to the 
modern theory of Teichmiiller spaces, which is 
developing into an extensive new field (- 234 
Kleinian Groups, 416 Teichmiiller Spaces). 

In the rest of this article, we deal mainly 
with the analytic method (for the case of two 
variables - [I]}. 

C. Abelian Differentials 

An Abelian differential on a closed Riemann 
surface 91 is, by definition, a complex +dif- 
ferential form co = a(z)dz, where a(z) is a mero- 
morphic function of a local parameter z. Such 
a differential is said to be of the first kind if 
a(z) is holomorphic, of the second kind if the 
residue vanishes everywhere, and of the third 
kind otherwise. 

The indelinite integral W(p) = oj of an 
Abelian differential co, where is assumed not 
to be a pole of co, is called an Abelian integral. 
It is said to be of the first, second, or third kind 
if the same holds for co. If y is a 1-tcycle on 91, 
the quantity to is referred to as the period of 
to along y. An elliptic integral is defined to be 
an Abelian integral on a closed Riemann sur- 
face of genus 1. Eor example, this is the case if 
the equation P(z, w) = 0 defining the surface is 
of degree 2 with respect to w and of degree 3 
or 4 with respect to z. More generally, a closed 
Riemann surface is called byperelliptic if P(z, w) 
is of degree 2 with respect to w, or, equiva- 
lently, if 91 carries a meromorphic function 
with exactly two poles. An Abelian integral on 
such a surface is called a byperelliptic integral. 

On a closed surface 91, let be the linear 
space over c of the Abelian differentials of the 
first kind. Given a 1-cycle a of 91, there exists a 
unique co^e such that Re is equal to the 
tintersection number (a, y) for every 1 -cycle y. 
This differential is also characterized by the 
property ((m, m Jjj = ) f m a * m, = - 2^^ 
for every coeV^. If {aj, . . ..a.,} form a 
basis of the I -dimensional thomology group 
with integral coefficients, then Re 0),^ (i = 1, . , 
2g) form a basis of the linear space over R 
of the tharmonic differentials on 91 as well as 
that of the space { Re w cu 6 Fa} - Accordingly, 

dime Fa = g, dim, F^ = 2g. 

These identities show a close relationship 
between the topological structure of 91 and the 
space of the Abelian differentials on 91 (-^ 194 
Harmonic Integrals). 

One can choose a 1 -dimensional homology 
basis so that {y.i,(Xj) = {ag+i,(Xg+j}=^ 

0 , 1, and (a;,o(g+^.) = 0 = 

1,2,, g). Such a basis is called a canonical 
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homology basis, if a> and a are Abelian dif- 
ferentials of the first kind, then we have 




Together, these are called the (Riemann) period 
r el ati 0 n . The second formula implies that oj 
vanishes identically if the periods of co along 
all «/ (j = 1, . . , y) are zero. 

Let , a 2 be a 1 -dimensional homology 
basis, and let o), , . ,o)g form a basis of over 

C. The g X 2^ matrix Q with (Uj as its 
component is called a period matrix. Corre- 
sponding to the change of bases (a) and (w), it 
is subject to transformation into the form 
AQM, where A is a g x g invertible complex 
matrix and M is a 2g x 2g integral square 
matrix with determinant + 1. Conversely, two 
Riemann surfaces are conformally equivalent if 
they possess period matrices transformable to 
each other in this manner (Torelli'S theorem). 
We can choose w 1 , . . . , so that the cor- 
responding period matrix with respect to a 
canonical homology basis becomes (7^, T) with 
the g X g unit matrix Ig. Then from the period 
relation, T is symmetric and Im T is positive 
definite. 

On the complex linear space C®, consider 
the subgroup generated by the 2g column 
vectors of a period matrix Q (the subgroup 
is also denoted by Q). Since it is of rank 2g 
and properly discontinuous, a group mani- 
fold C^/Q is obtained. It is determined by 5? 
uniquely up to analytic isomorphism and is 
called the Jacobian variety of 91. The general- 
i zed Jacobian va r i ety is introduced in a simi- 
lar fashion by means of Abelian integrals of 
the second and third kinds (- 9 Algebraic 
Curves). 

D. The Riemann-Roch Theorem 

In the present context, a 0-tchain with inte- 
gral coefficients on a Riemann surface 91 is 
referred to as a d i vi SO r . A divisor d = 2 
(«,• e Z, p; e 91) is an integral divisor (or positive 
divisor) if ^ 0 in the reduced expression; d is 
a prime divisor if it consists of a single point 
and Ui= 1. A divisor of a meromorphic func- 
tion for an Abelian differential O) is defined 
by taking the p,- as the zeros (poles) off or oj 
and M;{ — rii) as the multiplicity of the zero 



(pole) at Pi- The divisors on 91 constitute an 
Abelian group in which principal divisors, 
i.e., divisors of meromorphic functions, consti- 
tute a subgroup )p. The factor group D/'ip is 
called the divisor class g r O U p ; an element of it is 
called a divisor class. The divisors of Abelian 
differential constitute a single divisor class, 
which is referred to as the canonical divisor 
class (or differential divisor class). The degree 
and the d i men si 0 n of a divisor class D are 
detrned as follows, independent of the choice of 
the representative d = S D\ degD = S Wj, 
dim D = dim(-{ f\f is meromorphic, (divisor of 
/) -I- d is a positive divisor}. For example, the 
degree of the principal divisor class is zero. 

In terms of these concepts, the Riemann- 
Roch theorem is stated as follows: For a divi- 
sor class D on a closed Riemann surface 91 of 
genus g and for an integer n, we have 

dim(D-l-«lF)-dim(-D-(l —n) W) 

= degD-f( 2 n-l)( 0 -l), 

where W is the canonical divisor class (- 9 
Algebraic Curves). 

This theorem implies the following prop- 
erties of 91: (i) deg \N = 2g 2. (ii) The holo- 
morphic invariant forms (pdz^ (i.e., analytic 
tensors of order 2), referred to as quadratic 
differentials, constitute a linear space over C of 
dimension 0 (if p = 0), 1 (if g = 1), or of dimen- 
sion 3p — 3 (if p ^ 2). The quadratic differentials 
have close connection with textremal quasi- 
conformal mappings and play an important 
role in the theory of Teichmuller spaces (-^ 

352 Quasiconformal Mappings, 416 Teich- 
muller Spaces), (iii) For a point pe 91, a posi- 
tive integer m is called agap valueif91 carries 
no meromorphic function having a pole only 
at p with multiplicity m. Then if p = 0, no point 
has gap values; and if g 1, every point p has 
exactly g gap values; in this case, p also has 
a nongap value m^g+ 1. A point p is called 
an ordinary point if the gap values at pare 
1,2, , p; otherwise p is called a W eierstrass 
point. If y ^ 2, then the total number N of 
Weierstrass points is not less than 2g -i- 2 and 
not greater than (g ~ l)p(p + 1) (A. Hurwitz). 
Moreover, the case N = 2p -i- 2 occurs if and 
only if 91 is hyperelliptic, and then the gap 
values at Weierstrass points are 1, 3, . . , 2p 1. 
This implies that every closed Riemann surface 
of genus 2 is hyperelliptic. (iv) Suppose the 
genus g of 9? is ^ 2. A conformal mapping / of 
91 onto itself with the property that every 1 - 
cycle y is always homologous to f(y) is neces- 
sarily the identity transformation. Also, 91 is 
known to admit only a finite number of con- 
formal mappings onto itself (H. Schwarz); 
the total number does not exceed 84(p— 1) 
(Hurwitz). 
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E. Abel’s Tbeorem 



Abel’s tbeorem is stated as follows: A divisor d 
of degree zero is a prineipal divisor if and only 
if it is expressed as d = by means of a 1- 
ehain y that has the property that a> = 0 for 
every coeV„. 

Given a divisor elass D of degree zero, con- 
sider a 1-ehain y with dyeD. For every 1-eyele 
IX there eorresponds the quantity 



X^D) - exp Re j 



independent of the choice of y. Thus D deter- 
mines a tcharacter on the 1 -dimensional ho- 
mology group, ealled the integral character. 
Conversely, every eharacter on the homology 
group is shown to be the integral eharaeter 
of some D. In terms of this notion, Abel’s 
theorem can be stated as follows: D is the 
prineipal divisor elass if and only if = 

1 for every x. This result shows that the 1- 
dimensional homology group with integral 
eoeffieients and the group of the divisor elasses 
of degree zero (with eompaet topology) are, 
with respeet to integral eharaeters, mutually 
dual (in the sense of Pontryagin) topologieal 
Abelian groups (-^ 422 Topologieal Abelian 
Groups). For the relationship between Abelian 
integrals and Jacobian varieties, in partieular 
the ■'■Jacobi inverse problem, ■'■Abelian func- 
tions, and tRiemann theta functions, 3 
Abelian 'Varieties L. Also references to 234 
Kleinian Groups, 352 Quasieonformal Map- 
pings, 367 Riemann Surfaces, 416 Teichmuller 
Spaces. 
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Algebraic Geometry 

A. Introduction 

Algebraic geometry is the branch of mathe- 
matics that deals with talgebraic varieties, that 
is, point sets defined by several algebraic equa- 
tions in a space of any dimension or those 
derived from these sets by means of certain 
constructions (- 16 Algebraic 'Varieties). It 
may also be considered to be a theory of the 
■'■field of algebraic functions in several variables 
in geometric language, and it is closely related 
to the theories of complex analytic manifolds, 
commutative algebra, and homological alge- 
bra. It also has an important connection 
with number theory through the theories of 
automorphic functions, Diophantine equa- 
tions, and zeta functions. 

To investigate local properties of algebraic 
varieties we consider varieties embedded in an 
taffine space; to study global properties we 
usually consider varieties contained in tprojec- 
tive spaces. A quantity (or property) that is 
invariant under tprojective transformations, 
isomorphisms, i.e., +biregular and +birational 
transformations, or birational transformations 
is called a tprojective invariant, a relative 
invariant, or an absolute invariant (birational 
Invariant), respectively. The study of projective 
invariants is a part of projective geometry, 
whose methods are important in algebraic 
geometry. The notions of relative invariant 
and absolute invariant are used, for example, 
in the classification of algebraic varieties. 
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We usually assume that the coordinates of 
each point of the variety belong to a certain 
fixed tfield K. In the classical case, namely, 
when the field K is the field C of complex 
numbers, the algebraic varieties are considered 
as complex spaces and are studied by apply- 
ing the theories of partial differential equa- 
tions, ■^differential geometry, etc. Topological 
methods may also be applied. Algebraic geom- 
etry originated from such studies, but, for the 
study of properties such as trational mappings 
or talgebraic systems, it became necessary 
to consider as well the case where the ground 
field K is not talgebraically closed. Further- 
more, to apply this to number theory, it is 
necessary to establish the theory over the field 
of any tcharacteristic p. For this purpose it is 
necessary to establish a theory for varieties 
having ground domains as general as possible. 

B. H istory 

Analytic geometry began with the study of 
lines and quadratic curves (surfaces) and later 
came to include the study of cubic and quartic 
curves (surfaces), and so on. These subjects 
originally belonged exclusively to analytic (or 
projective) geometry. At that time, the study 
could not have been described by so specific a 
title as algebraic geometry. 

The study of such theories as the construc- 
tion of an algebraic plane curve by families 
of curves of lower degree or the talgebraic 
m-n correspondence on a straight line prob- 
ably began with research such as that by M. 
Chasles. The most outstanding event in the 
history of algebraic geometry was the intro- 
duction and development of the theory of 
algebraic functions (-11 Algebraic Func- 
tions) by B. Riemann (1857). Before that time 
the degree of an algebraic curve (surface) was 
the only quantity known to be a projective 
invariant of the curve (surface). 

With the theory of algebraic functions, 
Riemann gathered into one family all the 
curves that can be transformed onto each 
other by birational transformations. As the 
basis for his study, Riemann examined bira- 
tional transformations in place of projective 
transformations. This idea led to the notion of 
the so-called tRiemann surface. The tgenus of 
the surface was obtained as the characteristic 
number of the family of curves. The concept 
of genus was the first absolute invariant to 
appear in the history of algebraic geometry. 
Riemann based his theory on tAbelian inte- 
grals using tDirichlet’s principle, under the 
assumption that any algebraic curve reduces 
to one without tsingularities. 

After Riemann many mathematicians tried 



to reconstruct the theory more precisely with- 
out using transcendental methods. M. Noether 
attempted this reconstruction by using geo- 
metric methods. Using the tCremona trans- 
formation, he confirmed Riemann’ s assump- 
tion for curves: that any algebraic curve on a 
plane can be transformed by a birational 
transformation to a plane curve without sin- 
gularities except for simple tnodes. He also 
contributed in making more precise the basis 
conditions for the tRiemann- Roch theorem, 
which is considered to be one of the most 
important theorems in the field. His results 
on space curves and surfaces are also note- 
worthy. J. Pliicker detined the concept of 
genus in geometric terms and introduced the 
tPliicker coordinates. A. Cayley and A. Brill 
worked along similar lines. Cayley’s idea was 
developed later by B. L. van der Waerden and 
W. L. Chow, who introduced the tassociated 
form of an algebraic variety and its tChow 
coordinates. 

Around 1890 the Italian school of algebraic 
geometry appeared. Following the tradition 
established by Noether, they employed alge- 
brogeometric methods and uncovered many 
new facts concerning algebraic surfaces. 

Among those who belonged to this school 
were G. Castelnuovo, F. Enriques, and F. 
Sever!. 

In France, H. Poincare and E. Picard ini- 
tiated their study of algebraic functions of two 
complex variables. After them S. Eefschetz 
investigated the theory of complex algebraic 
surfaces [11, 123. The results attained by the 
Italian and French schools were very sugges- 
tive but lacked rigorous foundations. 

On the other hand, rigorous number- 
theoretic theories of algebraic curves appeared 
in Germany. R. Dedekind and H. Weber de- 
veloped the theory of algebraic function fields 
parallel to that of talgebraic number fields. K. 
Hensel introduced the concept of tp-adic num- 
bers in analogy to '•■power series expansions of 
analytic functions. E. Noether constructed an 
abstract theory of tpolynomial ideals from a 
formal theory by E. Easker and E. S. Macau- 
ley. Under her influence there appeared the 
arithmetic algebraic geometry (of curves) 
over an abstract tield as developed by E. K. 
Schmidt and others. 

In the higer-dimensional case, van der 
Waerden attempted to create a more rigorous 
foundation for algebraic geometry under the 
influence of Noether’s abstract ideal theory 
(c. 1930) [14]. He introduced the concept of 
■'■generic points and tspecialization, and specifi- 
cally defined the tmultiplicity of intersections 
of two varieties in a projective space. He 
succeeded in getting a rigorous proof of Be- 
zout’s theorem: In n-dimensional projective 
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space, the number of intersections of an r- 
dimensional algebraic subvariety of degree / 
with an (n — r)'dimensional subvariety of 
degree m is always Im if they intersect in only 
a finite number of points. 

The problem of intersections was taken up 
by C. Chevalley and A. Weil in the 1940s. 
Chevalley developed the ideal theory of tlocal 
rings (studied initially by W. Krull); he intro- 
duced topological concepts and applied them 
to the problem of intersections. The theory in 
this direction was later extended further by P. 
Samuel, M. Nagata, and J.-P. Serre. 

Weil gave foundations of algebraic geometry 
over an abstract field and reconstructed the 
theory by introducing geometric language to 
designate objects of abstract algebra [15]. He 
thus gave quite a new aspect to the theory and 
extended H. Hasse’s arithmetization of the 
theory of algebraic functions in one variable to 
the case of several variables. Reconstructing 
Seven’s theory of algebraic correspondence 
over abstract fields, he succeeded in proving 
an analogy of the tRiemann hypothesis on 
tcongruent zeta functions (- 450 Zeta Func- 
tions N). He also constructed, purely algebrai- 
cally, the entire theory of tAbelian varieties 
independent of characteristic. 

Around 1930, 0. Zariski gave another 
foundation to algebraic geometry by applying 
the generalized -i- Valuation theory that had 
been introduced by Krull. Zariski clarified 
especially the properties of birational trans- 
formations by using valuation theory. Zariski’ s 
main theorem states that if a birational map- 
ping is not tregular at a tnormal point P (- 
16 Algebraic Varieties 1), each component of 
the image of P by the mapping is of dimension 
^ 1 . 

Zariski also solved the problem of treso- 
lution of singularities in the affirmative in the 
case of characteristic 0 for varieties of dimen- 
sion 3. The affirmative resolution of this 
problem (which Riemann assumed) says that 
any algebraic variety in a projective space can 
be transformed birationally to a projective 
algebraic variety without singularities. In 
1964, H. Hironaka gave an affirmative 
answer for any dimension in the case of char- 
acteristic 0. 

Along with the achievements in algebraic 
methods, great development took place in 
analytic methods. Unification of the concepts 
of Riemann surfaces and tRiemannian mani- 
folds led to the concept of tcomplex analytic 
manifolds. Furthermore, G. de Rham’s theo- 
rem on the duality of topologically defined 
homology and cohomology based on differen- 
tial forms was proved; also, W. V. D. Hodge’s 
theory of tharmonic integrals was developed. 
In the case of the complex dimension 1, any 
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compact Riemann surface is derived from a 
certain projective algebraic curve. However, 
the situation is not so simple in the case of 
higher dimensions. Weil’s concept of tabstract 
complete algebraic varieties can be consid- 
ered as an analog of compact complex mani- 
folds. If a compact complex analytic variety is 
projective, then it must be an algebraic variety 
(tChow’s theorem). K. Kodaira proved that a 
necessary and sufficient condition for a com- 
pact complex analytic manifold to be biholo- 
morphically equivalent to a projective com- 
plex analytic manifold is that the manifold is 
a tHodge variety. 

Using results on harmonic integrals, J. Igusa 
and Weil established the theory of -i-Picard and 
+Albanese varieties associated with algebraic 
manifolds of arbitrary dimensions as a gen- 
eralization of the theory of tJacobian varieties 
associated with algebraic curves (- 9 Alge- 
braic Curves E). Thus many ambiguities in 
the theory as developed by the Italian school 
were clarified. Later the theory was gen- 
eralized to the case of characteristic p by T. 
Matsusaka, Chow, and S. Lang. The duality 
theorem in this case was later proved by M. 
Nishi and P. Cartier (- 3 Abelian Varieties 
D). 

The concept of '’'sheaves (- 383 Sheaves) 
had already been used in Kodaira’ s theory. 
Serre defined an abstract algebraic variety as a 
'•'ringed space by using an analogy to the con- 
cept of complex analytic spaces ; he considered 
it as a topological space with respect to the 
tZariski topology. By introducing tcoherent 
algebraic sheaves, Serre claritred the idea that 
classical invariants (such as tarithmetic genus) 
may be considered cohomological quantities 
16 Algebraic Varieties E). 

A. Grothendieck invented the concept of a 
tscheme, which is far more general than that of 
an algebraic variety, by admitting the existence 
of '•'nilpotent elements in structure sheaves and 
taking as a coordinate ring a general commu- 
tative ring with unity element. By the device 
of taking into account nilpotent elements, an 
analog of the method of successive approxima- 
tion in analysis is now applicable. By master- 
ful use of cohomological techniques, Grothen- 
dieck derived many results, including Zariski’ s 
important theorems. 

In the classical case, E. Hirzebruch gen- 
eralized the Riemann-Roch theorem to higher- 
dimensional manifolds. He made use of the 
language of sheaves and some topological 
results of A. Borel and R. Thom [S]. Later 
Grothendieck generalized the theorem for the 
abstract case as well. His idea in this work is 
recognized as the origin of tX-theory. 

Every nonsingular complete curve of genus 
0 is isomorphic to the projective line, and any 
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nonsingular complete curve of genus 1 is iso- 
morphic to a projective curve delrned by the 
equation XjXQ = Xy{Xy -X,)(X, -iXo) for 
some /I/O, 1. On the other hand, the set of all 
isomorphism classes of nonsingular complete 
curves of genus > 1 is parametrized by a 
normal quasiprojective variety of dimension 
3g — 3. Such facts were first discussed by Rie- 
mann as the problem of moduli. 

Concerning the moduli of manifolds of 
higher dimensions in the classical case, 

Kodaira and D. C. Spencer developed their 
theory of tdeformations of complex structures 
(-^72 Complex Manifolds G). The meaning of 
number of tipoduli is clarified by deformation 
theory. Deformation theory has been extended 
in various ways, and deformation is considered 
as one of the fundamental concepts in alge- 
braic geometry. 

To investigate the global structure of the 
moduli varieties, D. Mumford introduced 
geometric invariant theory (- 16 Algebraic 
Varieties W). 

Etale and crystalline cohomologies initiated 
by Grothendieck and others are useful for the 
study of algebraic varieties of positive charac- 
teristic. In particular, the conjecture made by 
Weil concerning tcongruent zeta functions has 
been solved affirmatively by P. Deligne with 
the help of etale cohomology [2]. 

Many important questions have been 
answered by means of the geometric theory. 

For example, (1) every tvector bundle on AJJ is 
trivial (- 16 Algebraic Varieties Z); (2) there 
exist tunirational but nonrational fields over C 
(-16 Algebraic Varieties J); (3) the funda- 
mental groups of the complements of node- 
curves on are commutative (-16 Alge- 
braic Varieties I); (4) the cancellation theorem 
holds for A^ (-^ 15 Algebraic Surfaces H); (5) 
whenever the tangent vector bundles are am- 
ple, the varieties are PJJ (-^ 16 Algebraic Var- 
ieties R). 
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Algebraic Groups 

A. Definitions and General Remarks [1,2,8] 

Let k be a field and Q a tuniversal domain 
containing it. An affine algebraic group G 
defined over k is, by definition, a group G 
which has the structure of a (not necessarily 
irreducible) talgebraic variety defined over k in 
an affine space such that the group oper- 
ation (x, y) h- > X y on G is an everywhere 
■•■regular trational mapping defined over k. For 
such a group G, the set of all k-trational 
points on G is an abstract group. The tjrreduc- 
ible component Gq of G (viewed as an alge- 
braic set) containing the identity element ^ is 
unique and is a normal subgroup defined over 
k with finite index in G; the decomposition of 
G into (absolute) irreducible components 
coincides with that into the cosets of G by G,. 
When G = Gq, the group G is called connected. 
It should be noted that for an algebraic group 
G defined over k, a coset qGq is not necessarily 
defined over k; and if it is, a representative g 
cannot necessarily be taken to be k-rational. 
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However, if k is an infinite field and G is con- 
nected, and if moreover k is '''perfect or G is 
■^reductive, then is ^Zariski dense in G ([9], 
A. Borel and T. A. Springer [30]); hence G is 
uniquely determined by G^. (In this case, is 
sometimes referred to as a k-group.) When 
k is a ttopological tield, G,^ is a -i-topological 
group with respect to the natural topology de- 
fined by that of k, which is generally stronger 
than the Zariski topology on G^. For instance, 
when k = R, (Gq)r is a tLie group with finitely 
many connected components. 

A subgroup H of an affine algebraic group 
G, which is Zariski closed, is an affine alge- 
braic group with respect to its natural induced 
structure and is called an algebraic subgroup 
of G. If H is defined over k, then H is k-closed; 
the converse is also true when k is ^'perfect. 

(An affine algebraic set A is called k-closed 
if A is a common zero of a set of polynomial 
equations with coefficients in k. The set A is 
k-closed if and only if all irreducible com- 
ponents of A are detined over the algebraic 
closure k of k, and for every IGalois auto- 
morphism (T of k/k, A^ — A.) The notions of 
homomorphism and isomorphism for alge- 
braic groups can be detined in a natural 
manner. For instance, for affine algebraic 
groups G and G’ defined over k, a rational 
homomorphism (p : G G’ defined over k or a 
k-morpbism for short, is a homomorphism of 
G into G’ that is at the same time an (every- 
where regular) rational mapping defined over 
k. For a k-morphism cp of G into G’, the (set- 
theoretic) image (p(G) is a closed subgroup 
of G’ defined over k, the kernel ^ (e’) is a fc- 

closed subgroup of G, and dim (p(G) = dim G — 
dim (e’). In particular, when dim G = 
dim (p(G) = dim G’ (or equivalently, when 
<p(Gq)= G'o and (p~'(e') is finite), cp is called an 
isogeny. (Two groups G and G’ are called 
isogenous if there exist a third group G” and 
isogenies G"->G, G"->G'.) When a k-morphism 
(p is bijective and ^ is also an (everywhere 
regular) rational mapping defined over k , (p is 
called a hirational isomorphism defined over k 
or a k- isomorphism for short. It should be 
noted that a rational homomorphism which 
is an isomorphism of abstract groups is 
not necessarily an isomorphism of algebraic 
groups (e.g., a Frobenius homomorphism); a 
similar statement holds for an injective con- 
tinuous homomorphism of topological groups. 

Given a connected affine algebraic group G 
and a closed subgroup H , both defined over k, 
the quotient space G/H has the uniquely deter- 
mined structure of an algebraic variety defined 
over k such that the canonical mapping G~* 
G/H is separable. The tfunction field of G/H 
is then identified with the subfield of the func- 
tion field of G formed by all H-invariant ele- 
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ments. In particular, if H is a closed normal 
subgroup, then G/H has a natural structure of 
an affine algebraic group (defined over the 
same ground field k) [8, 10, 11]. 

As an example, we have the group GL(n) of 
all n X n nonsingular matrices (x,j). GL{n) may 
be viewed as an algebraic set in defined 

by a single equation det(Xi^) y = 1, and as such 
is a connected algebraic group defined over 
the 'Iprime field. In general, an algebraic group 
reahzed as a closed subgroup of GL(n) is called 
a linear algebraic group. Since an affine alge- 
braic group is always isomorphic to a linear 
algebraic group, these two terminologies are 
essentially synonymous [1,2]. 

B. Generalization of the Definition 

Replacing the term affine algebraic set (or 
affine cariety) in the definition of an affine 
algebraic group by a more general term ^alge- 
braic variety, we obtain the notion of an 
algebraic group (algebraic group variety, or 
simply group variety). On this subject, the 
following facts are fundamental. A ^complete 
connected algebraic group is an Abelian 
variety (-^ 3 Abelian Varieties). More gener- 
ally, given a connected algebraic group G 
defined over k, there always exists a (k-closed) 
largest linear connected closed normal sub- 
group L, and the factor group G/L is an 
Abelian variety. Furthermore, for a closed 
normal subgroup H of G, the factor group 
G/H is complete if and only if H d L (Che- 
valley’s theorem [8]). In particular, if a con- 
nected algebraic group G is complete and 
linear at the same time, then G reduces to the 
identity group. In view of these theorems, the 
study of algebraic groups can be reduced, in a 
sense, to the study of Abelian varieties and 
linear algebraic groups. For this reason, we 
henceforth restrict ourselves to linear algebraic 
groups, which are simply called algebraic 
groups. (The notion of ^generalized Jacobian 
variety, introduced by M. Rosenlicht [12], is 
an example of an algebraic group in a general 
sense; 9 Algebraic Curves.) The notion of 
algebraic groups has been generahzed further 
to that of 'tgroup schemes by A. Grothendieck 
[3,4]. 

C. Lie Algebras 

Since an algebraic group G defined over k has 
no singularities, the ttangent space g to G at 
the identity element e is detined and has the 
same dimension as G : dim g = dim G. The space 
g can be identified in a natural manner with 
the space of all left-invariant '^derivations of 
the function field of Gq and thus has the struc- 
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ture of a Lie algebra defined over k (-^ 248 Lie 
Algebras). We call g (the Lie algebra g^ over k 
of all k-rational points in g) the Lie algebra of 
G (of the k-group Gj^]. If G is a hnear algebraic 
group contained in GL{n), then g^ is a Lie 
subalgebra of gl(n, k) with the Lie product 
defined by [x, y] = xy — yx; a hnear Lie algebra 
corresponding to a linear algebraic group is 
caUed an algebraic Lie algebra. When the 
characteristic of k is zero, conditions for a 
linear Lie algebra to be algebraic can be given 
in terms of the replica [5]. Also, in the case of 
characteristic zero, for x 6 gl(n, k), xe if and 
only if exp(fx)e G, where t is a variable over k 
and exp (lx) is understood as a tformal power 
series in t (contained in Q), From this, we can 
prove, exactly as in the theory of Lie groups 
249 Lie Croups), a one-to-one correspon- 
dence between k-closed subgroups H of G and 
algebraic Lie subalgebras l)^ of g^, establish- 
ing a complete parallelism of the theories of 
algebraic groups and Lie algebras [5]. This 
parallelism breaks down when k has positive 
characteristic [1, 2]. On the other hand, ovcr a 
field of characteristic p > 0, we have formal 
groups, an analog of local Lie groups intro- 
duced by J. Dieudonne [ 13], and also '•'hyper- 
algebras, which play the role of Lie algebras in 
characteristic 0. 



D. Tori [1,2] 

The group = GL(1), the multiplicative 
group of nonzero elements in fi, is a 1 - 
dimensional connected algebraic group defined 
over the prime field, hi general, an algebraic 
group G that is isomorphic to the direct prod- 
uct (GJ " is called an (algebraic) torus. When a 
torus G defined over k is isomorphic to (G J 
over an extension Kofk,G is called K-trivial 
(or K-split), and the field K is called a splitting 
field for G. A torus G defined over k always has 
a splitting field K which is a finite separable 
extension of k. 

In general, a rational homomorphism y of 
an algebraic group G into G^ is called a char- 
acter of G. If we define the sum of two charac- 
ters ziandz 2 ofGby(/, +X 2 ){g) = Xi{g)- yjg) 
(g eG), the totality of characters of G is an 
additive group, called the character module of 
G and denoted by X(G). Let G be a torus 
defined over k and X =X(G) its character 
module, and let K be a splitting field for G 
that is a finite tGalois extension of k. If a K- 
isomorphism G = (GJ" is given by the corre- 
spondence G^g^(xj(g), ,xJg)), then the A- 
are characters of G, and X is a tfree module of 
rank n generated by Xi, ,y„. Furthermore, if 
r denotes the Galois group of K/k, then for 
(7 6 r and xeX, the conjugate y'’ is also a char- 



acter of G; under this action of F, X becomes a 
right f-module. We have complete duahty 
between a torus G and its character module X 
in the following sense. There exists a one-tO- 
one correspondence between the closed sub- 
groups Gj (defined over k) of G and a (F- 
invariant) submodule Xj of X for which X/X, 
has no p-+torsion (where p is the character- 
istic of k). This correspondence is determined 
by the relation of the annihilators Xj = G|, 

Gj =X|, and under the correspondence the 
character modules of G^ and of G/Gj are 
canonically identiiied with X/X, and Xj, 
respectively. Furthermore, let G’ be another 
torus (defined over k and split over K) with the 
character module X’, and suppose that we 
have a (k-) homomorphism (p:G-*G\ Then we 
can define a (F-) homomorphism ^(p:X'-*X, 
called the dual homomorphism of (p^ by the 
relation '(p(y') = y' O (p for £ X’ ; conversely, 
any (F-) homomorphism of X’ into X is ob- 
tained uniquely in this manner. In particular, 

^ is a (k-) isomorphism if and only if its dual 
'(/) is a (F-) isomorphism. Since for any free (F-) 
module X of finite rank there always exists 
a torus G (defined over k and split over K) 
such that X(G) = X (as a special case of the 
existence theorem of k-forms), the tcategories 
of all tori (defined over k and split over K) and 
that of all free (F-) modules of finite rank are 
mutually dual. 



E. Semisimple Elements and Unipotent 
Elements 

A matrix a is called tsemisimple if it is diago- 
nalizable, i.e., if the tminimal polynomial of a 
has only simple roots. A matrix q is called 
■''unipotent if a 1 is nilpotent, i.e., if all charac- 
teristic roots of a are equal to 1 . (When the 
characteristic of the ground field is zero, the 
unipotent elements u in GL{n, k) and the nilpo- 
tent elements x in gl(n, k) are in one-to-one 
correspondence by the relation u = expx.) Any 
nonsingular matrix a can be written uniquely 
as a product of a nonsingular semisimple ma- 
trix a’ and a unipotent matrix a” of the same 
size which are mutually commutative: a = 
u’a” = a" a' ('^multiplicative Jordan decompo- 
sition); a' (a”) is called the semisimple (unipo- 
tent) part of a and is denoted by (a,); a, can 
be expressed as a polynomial of the matrix a 
with scalar coefficients. For an element a of a 
(linear) algebraic group G, the semisimplicity 
(unipotency) of a does not depend on the 
matrix representation of G. Moreover, these 
properties are preserved by homomorphisms 
of algebraic groups. Also, if G, then a„ UyE 
G. 

For an algebraic group G, we denote the 
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totality of semisimple (unipotent) elements 
contained in G by (G„) and call it the semi- 
simple (unipotent) part of G. (Note that Gj and 
G„ are not necessarily subgroups.) A torus G is 
then characterized by the property that G = 

Gq = Gj. On the other hand, an algebraic group 
G such that G = G„ is called unipotent. For 
instance, the additive group of the universal 
domain, 




is a 1 -dimensional connected unipotent alge- 
braic group. 

F. Solvable Groups and Nilpotent Groups 
[1,2, 7,9] 

For two closed normal subgroups //,, H 2 
(deiined over k) of an algebraic group G, the 
■'^commutator group [//j , H 2 ] (in the sense of 
abstract group theory) is also a closed normal 
subgroup (defined over k) of G. In view of this 
fact, an algebraic group G is called solvable 
(nilpotent), when it is tsolvable (^nilpotent) as 
an abstract group. For example, the totality 
T(n) of ux n nonsingular upper unipotent ma- 
trices, i.e., matrices of the form 

/ I M ^ \ 

, is a connected solvable algebraic 

1 0 * I 

group. A unipotent algebraic group is always 
nilpotent. 

For any connected solvable algebraic group 
G e GL(n), there always exists an element a in 
GL(n) such that a~^ Ga c T(n) (Lie-Kolcbin 
theorem [1,2]). A connected solvable algebraic 
group G has a tcomposition series G = GqoGj 
=) ■ ■ ■ =3 G,. = {e} such that each G,- is a connected 
closed normal subgroup of G and G,- i/G,- 
is isomorphic to either or G,. If G is de- 
fined over k, the subgroup G„ is a connected k- 
closed normal subgroup of G, and for any 
maximal torus T in G, we have a decompo- 
sition into a tsemidirect product G = F-G„ (in 
the sense of algebraic groups, i.e., the natural 
map r X G„^G is birational). It is known 
that for any algebraic group G defined over k 
there exists a maximal torus defined over k 
([4], A. Borel and T. A. Springer [30]). G is 
nilpotent if and only if G has a unique maxi- 
mal torus T ; when that is so, T= G, and T is 
contained in the ''‘center of G. For a connected 
solvable algebraic group G defined over k, we 
can take a e GL{n, k) such that Gn c T(n) 

(see the Lie-Kolchin theorem) if and only if all 
characters /eA'(G) are defined over k; when 
this condition is satisfied, G is called k-solvable. 
G„ is then defined over k, and G/G„ is a k- 
trivial torus. 
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■When the characteristic of k is zero, any 
commutative unipotent algebraic group (de- 
fined over k) is (k-) isomorphic to the direct 
product (G J". When k is an algebraically 
closed field of characteristic p > 0, any con- 
nected commutative unipotent algebraic group 
defined over k is k-isogenous to a direct prod- 
uct of a certain number of the groups of 
■''Witt vectors (of length m) (Chevalley-Chow 
theorem [ 12]). A 1 -dimensional connected 
unipotent algebraic group detined over a per- 
fect field k is k-isomorphic to [ 1,2]. 

G. Borel’s Theory 

Let G be an algebraic group and V an alge- 
braic variety (both defined over k). We say 
that Visa transformation space of G (defined 
over k), or simply G acts on F, if there is given 
an everywhere regular rational mapping G x 
VB{g,v)^gvey (defined over k) such that 
!7i(.d2^) = (3i.‘72)G ev=v (gi,g 2 eG,vG V). When 
the action of G on V is ‘'"transitive, V is called a 
^‘homogeneous space of G. For a closed sub- 
group H of a connected algebraic group G 
(both defined over k), the quotient space G/H 
has the natural structure of a homogeneous 
space of G (deiined over k). A. Borel [1,2] 
proved the following theorems: 

(1) If G is a connected solvable algebraic 
group and V a complete transformation space 
of G, then G has at least one fixed point in V . 
More precisely, in order that a connected 
algebraic group G defined over k be k-solvable, 
it is necessary and sufficient that for any com- 
plete transformation space V of G defined over 
k for which # 0, G have at least one k- 
rational fixed point in V [9], 

(2) Let G be a connected algebraic group. A 
maximal connected solvable closed subgroup 
of G is called a Borel subgroup of G. Then (i) all 
pairs (T, B) formed by a maximal torus T in G 
and a Borel subgroup B containing it are con- 
jugate to each other with respect to inner auto- 
morphisms of G. (ii) For a closed subgroup 

H of G, the quotient space G/H is complete 
if and only if H contains a Borel subgroup 
of G; and, when that is so, G/H is actually a 
^projective algebraic variety. (For instance, 
if G = GL{n),B = T(n), then G/B is a so-called 
tflag manifold.) (iii) The conjugates of B(T) 
cover the whole group G (GJ. A closed sub- 
group of G is called parabolic if it contains 
a Borel subgroup of G. A parabolic subgroup 
H coincides with its own +normalizer N(H); in 
particular, H is always connected. Parabolic 
subgroups are significant in the theory of 
automorphic functions. (For the parabolic 
subgroups associated with BN-pairs Sec- 
tions Q, R.) 
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When G is a connected algebraic group 
detined over a perfect field k, proposition (i) 
can be sharpened: The pairs (A, H) formed by 
a maximal k-trivial torus A in G and a maxi- 
mal connected k-solvable subgroup H con- 
taining it are conjugate to each other with 
respect to the inner automorphisms detined by 
elements in The normalizer N (H ) of such a 
k-solvable subgroup H is a minimal k-closed 
parabolic subgroup of G. When the maximal 
connected k-solvable subgroups of G are re- 
duced to the identity group, G is called k- 
compact or k-anisotropic. (Otherwise, G is 
called k-isotropic.) For instance, the tortho- 
gonal group G = SO{n, f) of a tquadratic form 
/ of n variables is k-compact if and only if the 
form / is anisotropic, i.e., the homogeneous 
equation f=0 has no solution other than 
zero in k. Similar facts hold for other classi- 
cal groups. When k is a +local field, G is k~ 
compact if and only if Gjj is compact as a 
topological group. In general, a k-compact 
group is +reductive. 

H. The Weyl Group 

Let G be a connected algebraic group and Q 
an arbitrary torus in G. The ^’centralizer Z(Q) 
of Q is then connected and coincides with the 
connected component of the normalizer N(Q). 
Hence the factor group W= N(Q)/Z{Q) is finite 
and can be identified vvith a subgroup of the 
automorphism group of Q (or of its character 
module X(Q)) in a natural manner. The group 
W is called the Weyl group of G relative to Q. 

In particular, when Q = T (a maximal torus), 
the order of W is equal to the number of Borel 
subgroups containing T. In this case, the cen- 
tralizer C = Z(T) is called a Cartan subgroup of 
G; it is characterized by the property that C is 
a (maximal) connected nilpotent closed sub- 
group of G which coincides vvith the connected 
component of its own normalizer N(C). The 
notions of Borel subgroups, Cartan subgroups, 
and maximal tori are preserved under rational 
homomorphisms of algebraic groups. 

I. Semisimple Groups and Reductive Groups 

In an algebraic group G defined Over k, there 
exists a largest connected solvable closed 
normal subgroup R , called the radical of G. 

The unipotent part of R is called the unipo- 
tent radical of G. When R = {e}, G is called 
semisimple. When R is a torus, namely, = 
{e}, G is called reductive. Semisimplicity and 
reductiveness are preserved under forming a 
direct product and taking the image (or in- 
verse image) of an isogeny. For a reductive 
group G, the -i-commutator subgroup D(G) is 



semisimple, and G = D(G) • R , D (G ) fl R = finite; 
in other words, G is isogenous to the direct 
product of a connected semisimple algebraic 
group and a torus. In general, if R is the rad- 
ical of a connected algebraic group G and is 
the unipotent radical of R, then the factor 
groups G/R, G/R^ are semisimple and reduc- 
tive, respectively. Furthermore, if the charac- 
teristic of the field k is zero, there exists a 
reductive closed subgroup H of G such that G 
decomposes into a semidirect product G = 

H- Ru (Chevalley decomposition [5]). (In this 
case, R and R^ are k-closed, and H can be 
taken to be k-rational; such an H is unique up 
to inner automorphisms defined by elements in 
Gj.) Also in the case of characteristic zero, 
reductive algebraic groups are characterized 
by the property that all rational representa- 
tions are completely reducible. But when k 
has the characteristic p >0, this property char- 
acterizes tori (M. Nagata). 

J. Root Systems [1,2,14] 

Let G be a connected semisimple algebraic 
group, T a maximal torus, and X =X(T) its 
character module. A character ae X is called a 
root of G relative to T if there exists an iso- 
morphism of Ga onto its image in G such 
that 

r‘x,(0t = x,(a(t)c) for all teT. 

For a root a, such an isomorphism is 
uniquely determined up to a scalar multipli- 
cation in G^; hence we put = x^(GJ. 

If we denote by r the totality of roots (rela- 
tive to T), r satisfies the following axioms, 
where E = X ® Q and E* is the ’'’dual space of 
E with respect to the inner product ( ): (i) For 
each a e r, there corresponds a* 6 E* such that 
<3c*,a> = 2 and <a*,j5>£Z for all [ier. (ii) If we 
define a reflection of E by 

vv^x = x-<a*,x>a for xe£, 

then for all jSer. (In particular, w^cl = 

— aer.) (in) If a, are linearly dependent, 
then fi= ±ot. (iv) If dim E = r, r contains r 
linearly independent elements. 

In general, a finite subset r in a finite- 
dimensional vector space E over Q satisfying 
the axioms (i)-(iv) is called a root system in E. 
(This root system is sometimes said to be 
reduced, to distinguish it from the root system 
defined in Section Q, which does not satisfy 
axiom (hi).) For a root system r, the elements 
X* of E* corresponding to XGI are uniquely 
determined by these conditions, and the set 
r* = {a*} is a root system in E* (a* er* is 
called a coroot). Also, the group W of linear 
transformations of E (£*) generated by 
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(w,,*) with aer is finite and is called the Weyl 
group of the root system r, If we identify E* 
with E by means of any M/-invariant (positive 
definite) metric on £, then a* =(2/<a, a))a. 
When r is a root system of a semisimple alge- 
braic group, 

(a*,x)eZ for all aer, yeX, (I) 

so that X is M^-invariant, and the Weyl group 
of the root system r can be identified with the 
group N(T)/Z(T) of Section H. (In general, a 
maximal torus in a connected reductive alge- 
braic group coincides with its own centralizer, 
so that the Weyl group W can be identified 
with N( T)/T.) 

When a tfinear ordering (compatible with 
the addition) is given in E, we denote by the 
set of all positive roots in r. An element aer+ 
is called a simple root if it cannot be written as 
a = a' + a" with a', a"er+. If A = {ai,...,a,} is 
the totality of (distinct) simple roots in r+,the 
elements are linearly independent, 

and any root a 6 f can be written uniquely in 
the form a = ± Xj=i with m, 6 Z, m,- ^ 0. In 

general, a subset A of r having this property is 
called a fundamental system; a fundamental 
system is always obtained in the manner ex- 
plained from a linear ordering on £. For a 
fundamental system A, the cone A, in 
defined as the set of x in E* satisfying the 
inequalities x) > 0 (1 ^ i ^ r), is called a 
Weyl chamber. If we denote by L, the hyper- 
plane defined by the linear equation (a, x) = 0 
for a root a, then £* — = IJa A^, and W 

acts ■•'simply transitively on the set of all Weyl 
chambers {A^}. The Weyl group W is gen- 
erated by r reflections (1 ^ < r). 

In a semisimple algebraic group G, Borel 
subgroups B containing a (fixed) maximal 
torus Tare in one-to-one correspondence with 
the fundamental systems A (or r-i-) relative to T 
by the relation B^,= riaer+ where (GJ. 

(More precisely, every element in can be 
written uniquely as a product of the elements 
in Pj,, where the ordering of the is taken 
arbitrarily.) 

K. Bruhat Decomposition 

If we take a representative of VVG W in N(T), 
there is a decomposition G= IJwefr f^S^B 
(disjoint union). Furthermore, if for w £ IT we 
put = riaer+ nwr+ iu particular N = 

denote by Wq the unique element 
in W such that WqA= -A, then the element 
in Bs„B can be written uniquely as a product 
of elements in s^T,N. Hence we have 

G= Uiv..,-.s„r-)v, 

which is called a Bruhat decomposition of G. In 
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particular, if we put N’ = (which is 

the unipotent part of the Borel subgroup cor- 
responding to -A), then N’TN is a Zariski 
open set in G, and the natural map N’ x T x 
N-^G is birational. This implies that the 
function field of G is rational (i.e., tpurely 
transcendental O'Ver H). 

L. Structure of Semisimple Groups 

A subset of a root system r is called a closed 
subsystem if tj ^ fl r = r, , where r, ^ denotes the 
submodule of X generated by A closed 
subsystem satisfies conditions (i), (ii), and (hi) 
of a root system. For a closed subsystem of 
a root system r of a semisimple algebraic 
group G, the subgroup G(tj) of G generated by 
the Pj, (a £ r j) is a semisimple closed subgroup 
with a maximal torus T, =(G(ri)n T)q, of 
which the root system relative to T, coincides 
with the restriction of on T^ and the corOOt 
system can be identified with rf == (a* |a£ri}. 
The subgroup GjrJ is normal if and only if 
r — is also a closed subsystem; when this is 
so, G = G(ri)- G(r-r]),G(r,)nG(r-r,) = finite. 
All connected closed normal subgroups of G 
are obtained in this manner. In order that G 
be simple (sometimes called absolutely simple 
or almost simple) as an algebraic group (i.e., 
without proper connected normal subgroups), 
it is necessary and sufficient that r be irreduc- 
ible (i.e., r cannot be decomposed into a dis- 
joint union of two proper closed subsystems). 
In general, a root system r can be decomposed 
uniquely into the disjoint union r = tj UU r, 
of irreducible closed subsystems t; such that 
r, UU r,- (1 ^ ^ s) are also closed subsystems; 
correspondingly, G is isogenous to the direct 
product Gi X X Gj of (absolutely) simple 
algebraic groups G; = G(t;). (G is actually a 
direct product if it is simply connected or an 
adjoint group.) The subgroups G,- are deter- 
mined uniquely and only by G. 

M. k-Forms [15] 

Let K be an extension of k and Gj an alge- 
braic group defined over K. An algebraic 
group G defmed over k is called a k-form 
of Gj if there is a K-isomorphism f of G ontO 
G[. Suppose further that K/k is finite sepa- 
rable, and for every Galois automorphism 
(T of k/k, put f~K Then is an iso- 

morphism of Gj onto G[, and the satisfy 
the relation (p/^o (p^= (p„^. Conversely, given 
a collection of isomorphisms { (p„} satisfying 
these conditions, there always exists a k-form 
G (with a K-isomorphism f onto Gj SUCh 
that (p^— f’o /’“'), which is unique up to k- 
isomorphism (Weil). In particular, if K/k is a 
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finite Galois extension with Galois group f, 
then { is a (continuous) +1 -cocycle of F in 
AutK(Gj) (the group of all K- automorphisms 
of G,), and by the above correspondence the k- 
isomorphism classes of k-forms G are in one- 
to-one correspondence with the (continuous) 

1 -■^cohomology classes of the cocycle {cp^} (in 
the cohomology set //'(r, Aut^(G,))) (- 172 
Galois Theory J) 

To a given finite separable extension K/k of 
degree d and an algebraic group Gj defined 
over K of dimension n, we can associate a 
certain algebraic group defined over k 

of dimension dn, which is obtained from Gj by 
restricting the ground field [ 19]. A more pre- 
cise definition is as follows. Let {cr, , (72? • > 

((Ti = 1) be a set of automorphisms of k/ksuch 
that (T; K ( 1 ^ I d) are all distinct. Then one 
can find a k-form G of G, = nf=i whh an 
isomorphism /: G->-Gi such that = /" o/~^ 
is given by ^„((x,)) = (x,.r), where r is defined 
by the relation (a, a) K = (t,<7 K. If we denote 
by Pi the canonical projection of G, onto its 
first component and put p= P\Of then the 
pair (G, p) is uniquely characterized (up to k- 
isomorphism) by the following universality 
property: If G' is any algebraic group defined 
over k and q>is a K-morphism of G' into Gj, 
then there exists a (uniquely determined) k- 
morphism ip of G' into G such that (p= po ip. 
The group Q (together with p) is denoted by 
'(Ri^,,j,(G,). For the group of rational points, 

G^ = G, jf. 'When the algebraic group Gj has 
some additional structure (such as that of 
■’'vector space, talgebra, etc.), then iR*;/jt(Gi) 
automatically has the same kind of additional 
structure. 



N. Chevalley’s Fundamental Theorems 

Let G, G’ be connected semisimple algebraic 
groups, and let T( T’) be a maximal torus in 
G(G'), X(X’) its character module, r (r’) a root 
system of G (G’) relative to T (T’), etc. If we 
have an +isogeny cpof G onto G’ such that 
<p(T) = T’, then there is a bijection of r 
onto r' such that i^(a') = q^<x, where ijj is the 
dual homomorphism of cp T and is a posi- 
tive integer, which equals 1 if the characteristic 
is zero and is a power of p if the characteristic 
is p > 0. Conversely, any injective homomor- 
phism if/ : X'-yX satisfying this condition (with 
respect to a certain bijection r->r' and qj 
comes from an isogeny (p: G-^G' in the mannei 
already stated. In particular, is an isomor- 
phism if and only if ij/ is an isomorphism such 
that i//(r') = r (i.e., q^ = 1 for all aer) [7]. The 
isomorphism class of G is thus completely 
determined by the pair (X, r), so that we some- 
times write G = G{X, r). A connected semi- 



simple algebraic group G defined OVer k is 
called of Chevalley type over k(or k-split) if 
there exists a k- trivial maximal toms T in G. If, 
in the above theorem, G and G’ are of Cheval- 
ley type over k and T and T' are k-trivial, then 
the theorem remains tme if we replace isogeny 
by k-isogeny. In particular, the k-isomorphism 
class of a connected semisimple algebraic 
group of Chevalley type over k is completely 
determined by (X, r). Chevalley also showed 
that, for any pair (X, r) satisfying condition (1) 
above, there exists a connected semisimple 
algebraic group G{X, r) of Chevalley type 
defined over the prime field. Therefore, since 
the classification of semisimple algebraic 
groups of Chevalley type is reduced essentially 
to that of root systems (X, r), it turns out that, 
over any ground field k, there exist as many 
connected simple algebraic groups of Cheval- 
ley type as connected simple complex Lie 
groups (-^ 249 Lie Groups; Appendix A, Table 
5.1). 

For a given semisimple algebraic group 
G = G(X, t) defined over k, put Xq = (= 

the submodule of X generated by r), X” = 
{xeF| <a*,x>eZ for all aer}. Then we have 
natural isogenies G(X®,r)->^G(X,r)^G(Xo, r) 
(with q^= 1 ), all of which can be taken to be 
defined over k. The group G(X°, r) (G(2(o, r)) is 
called the simply connected group (the adjoint 
group) isogenous to G. 'When the characteristic 
of k is zero, these isogenies (which are already 
known in the classical theory of complex Lie 
groups) are essentially the only possible iso- 
genies among the semisimple algebraic groups. 
But when the characteristic is p > 0, there are, 
in addition to these, the Frobenius homo- 
morphism (with = p) and the following 
“singular” isogenies (for which = 1 or p de- 
pending on F 4 ^F 4 , (p = 2), G 2 -»G 2 

(p = 3). In particular, when k is a finite field, 
taking the set of fixed points of the singular k- 
isogenies, we obtain the simple finite groups of 
M. Suzuki and R. Ree (- 151 Finite Groups). 

0. Classification Theory 

A connected semisimple algebraic group G 
defined over k is called k- (almost) simple if 
there is no proper connected closed normal 
subgroup of G defined over k. CWhen G is k- 
simple and /c-split, the factor group D{G^)/ 
center is an abstract simple group except for 
a few special cases [17]. For more general 
results Section Q.) For a k-simple algebraic 
group G, let G, be any one of its absolutely 
simple components, and let k, be the smallest 
field of definition for Gj containing k. Then 
k^/k is a finite separable extension, and G is k- 
isogenous to ,t(Gi). Hence the problem of 
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classifying all k-simple groups (up to isogeny) 
is equivalent to that of finding all k, -forms of 
simple groups of Chevalley type. This latter 
problem can be reduced, in principle, to the 
classification of compact k,-forms and that of 
certain diagrams (i.e., tDynkin diagrams along 
with an action of the Galois group) [ 16], J. 

Tits [30] (- Appendix A, Table 5.1). For 
instance, when k is a finite field (or, more gen- 
erally, a field of dimension 1 [15]), there is 
no compact k-simple group; hence, using a 
simple classification theory of the diagrams, 
we can show that the only absolutely simple 
algebraic groups G defined over k ^re either of 
Chevalley type or of the types introduced by 
R. Steinberg (denoted by M„, ^D^,, 

Connected semisimple algebraic groups 
composed of the groups of these types are 
characterized by the property that they have a 
Borel subgroup defined over k. Such groups 
are said to be of Steinberg type over k (or fc- 
quasi-split). Absolutely simple algebraic groups 
over a ''■p-adic held have been classified by M. 
Kneser and J. Tits [30,31]. When the charac- 
teristic of k is not equal to 2, the classification 
of simple groups of classical type (except for the 
type D4) is known to be equivalent to that of 
semisimple tassociative algebras with '•'invo- 
lution [ 18]. A similar relation also holds be- 
tween some of the exceptional simple groups 
and ■''Cayley algebras or t Jordan algebras (H. 
Hijikata, T. A. Springer, J. Tits). 

The following is a fist of absolutely simple 
algebraic groups of classical type. 

I. k-forms of SL{n) (n ^ 2). 

1.1. C,i = SL(ffi,«) = {<,eM.(5<)|N(9) = l(, 
where 51 is a '•'central division algebra 
over k with (5^ ; k) = r^, n = mr, and N 
denotes the deduced norm in M^(5^). 

1.2. G,fc = SC(m,5C/)-{<7eSL(m,5f)| 
f{gx,c]y)^f{x,y) for ye5l'"}, where 
5^ is a central division algebra over a 
quadratic extension k’ of k with an in- 
volution I of the second kind (which 
means that [^Ek'\^' = (] = k), {K:k') = 
r^, n = mr, and / is a (nondegenerate) 
^Hermitian form of m variables over 

with respect to the involution 

II. k-forms of S0(n) (n ^3, n ^4), Spin) {n 

even, n ^ 2), 

Gik — SV{n,S\,f), where 5^ is a central 
division algebra over k with an involu- 
tion I of the first kind (i.e., such that 
{(56k|^' = <^} = k), (5Tk)-r^n = mr, 
and f is a nondegenerate c-Hermitian 
form of tn variables over with re- 
spect to the involution i. In this case, 
dim{(^eK I ^‘^Q=r{r + Eo)/2 with = 
+ 1, and Gj is a k-form of SO or Sp 
according as ceq = 1 or - 1. (50(8) may 
have other k-forms coming from the so- 
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called triality.) The case where the 
characteristic of k is 2 can also be dis- 
cussed by a method given by J. Tits 
{Inventiones M ath. 5 , 1968). 

When k is a local field or an algebraic num- 
ber field, the only central division algebra with 
an involution of the first kind is a •'quaternion 
algebra (and, if / is the “canonical involution,” 

thenfio= 1). 

P. Algebraic Groups over an Algebraic 
Number Field 

Let G be a connected algebraic group detined 
over an algebraic number field k of finite de- 
gree. Let {r} be the totality of tprime divisors 
(i.e., equivalence classes of valuations) of k 
Taking the •'restricted direct product of a 
family of locally compact topological groups 
{ GkJ, we obtain a locally compact topological 
group G_4, called the adele group of G[19] (- 6 
Adeles and Ideles). In particular, when G = 

G„i, the adele group / = (G„,)^ is exactly the 
■•idele group introduced by Chevalley in class 
field theory. If we identify xeG^ with an 
adele whose components are all equal to x, 

Gk becomes a discrete subgroup of G^. 

Concerning the tiniteness property of G^/G^, 
the following results have been obtained 
[21,22]: A character /eXk(G)( = the module 
of all k-rational characters of G) gives rise 
to a (continuous) homomorphism : G^-> 

^ = (Gm)A- PutG^ = {c/eG^||;^^(<;/)|=l forall 

X,(G)1, where || is the standard norm in /. 
Then G“ is ^unimodular, and the quotient 
space G^/Gk is of finite volume with respect to 
the (unique) invariant measure on it. G^/G^ 
(G^/GJ is compact if and only if the semi- 
simple part G/R (the reductive part C/J?„) of G 
is k-compact. From the arithmetic point of 
view, it is important to determine explicitly the 
volume of G^/G^ with respect to the invariant 
measure normali z ed in a certain manner; such 
a volume is called the Tamagawa number of G 
and is usually denoted by 7(G) [19,23]. For 
instance, Siegel’s formulas on the volume of 
the fundamental domain of the unit group of a 
quadratic form f over k are essentially equiva- 
lent to a theorem on the Tamagawa number 
stating that t(SO(/))= 2. 

Let 0 be the •'ring of integers in k and L an 
o-lattice in the vector space on which G is 
acting. We can define in a natural manner an 
action of G^ on the set of all o-lattices; then the 
orbit G^L (G^L) of L with respect to G^ (GJ is 
called the genus (class) of L. The tstability 
subgroup G^ ^ of L in G^ is open, and the 
double coset space G^ i^\G^/Gk is finite (fi- 
niteness of the class number). Moreover, let 
{r'l, , r,.} be the totality of •'infinite prime 
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divisors of k, and put = 11;= i Gkc - Then 
is a Lie group, and the canonical projection 

on G^ of G^ I o G^ is a discrete subgroup of 
finite type. (In general, (discrete) subgroups of 
G^ which are ■'"commensurable with G^ are 
called arithmetic subgroups.) As in the adele 
case, xe^JG) gives rise to a (continuous) 
homomorphism G^-^(R ^ and if G° = 
{geG^ I Zco (.'?) I = ^ then the quotient space 
is of finite volume. Moreover, G^^jG^ 

(G, /GJ is compact if and only if G^^^/G^ {G^jG^) 
is compact. 

In addition to these, the tapproximation 
theorem and the tHasse principle are also 
extended to (classical, or general) algebraic 
groups (M. Eichler, M. Kneser, G, Shimura, 
Hijikata, Springer; [30]). 

Q. Structure of Reductive Groups [24] 

Let G be a connected ^reductive group defined 
over a field k. Then G has a ^maximal torus 
defined over k and G is ^split over a finite 
'^separable extension K of k. The structure of 
the group Gf^ can be discussed as in Sections I, 
J, K, and L. Here, we discuss the structure of 
G^, i.e., a t/c-form of G,. 

Let S' be a maximal k-split torus of G, i.e., a 
k-subtorus of G that is /c-split and maximal 
with respect to these properties. Any two such 
tori are conjugate over k, i.e., by an element of 
Gj^. Their dimension is called the k-rank of G. 
To say that G is of k-rank zero is equivalent to 
saying that G is t/c-anisotropic. The centralizer 
Z = Z,(S) of S in G is a reductive k-group, 
and its derived group is k-anisotropic. Let 
N = N,(S) be the normalizer of S in G. The 
^Weyl group iV/Z relative to S is called 
the k-Weyl group of G. A k-root of G with 
respect to S is a nontrivial character of S that 
appears when one diagonahzes the represen- 
tation of S in the -i-Lie algebra g of G, S operat- 
ing via adjoint representation. Denote by 
the set of all k-roots of G with respect to S. 

There is a decomposition of the Lie algebra cj 

of G : q = po + fla where q, = { JV e q | Ad(s) A 
= a(.s)A for all seS). Then po is the Lie alge- 
bra of Z, and there is a unique unipotent k- 
subgroup of G normalized by S such that its 
Lie algebra is q^. The set is a 'Iroot system in 
a suitable Euclidean space whose Weyl group 
is isomorphic to if G is k-split, is the 
ordinary root system, and the are as in 
Section J. In general, need not be reduced 
(i.e., axiom (iii) in Section J need not be satis- 
fied), nor should dim q^, = dim P^ always be 1 . 

A closed subgroup of G defined over k 
which is minimal among the parabolic sub- 
groups of G is called a minimal parabolic k- 
subgroup of G. (If G is fc-split, a minimal para- 



bolic k-subgroup is a +BoreI subgroup of G.) 
Any two minimal parabohc k-subgroups are 
conjugate to each other over k. If P is one such 
subgroup, then there exists a maximal /c-split 
torus S such that P is the semidirect product 
of the reductive k-subgroup Z = Z,(S) and the 
lunipotent radical U = R,(P) of P. The ex- 
pression P = ZG is called a Levi-decomposition 
of P, and Z is called a Levi-suhgroup of P. Any 
two Eevi-subgroups of P are conjugate by an 
element in G^. There is an ordering of such 
that P is generated by Z and with a > 0. The 
minimal parabolic k-subgroups containing a 
given maximal /(-split torus S correspond to 
the ■^Weyl chambers of ^r. They are permuted 
simply transitively by the Weyl group ^W. 

Eix an ordering of ;^r, and let ,^A be the Tun- 
damental system of with respect to the given 
order. Eor any subset () of denote by P^ the 
subgroup generated by Z = Z,(S) and P^, 
where a is a linear combination of the roots of 

in which all roots not in 0 occur with a 
coefficient ^ 0. Then P^ contains P ^d, in 
particular, P^ = P . P^ is called a standard para- 
bolic k-subgroup of G containing P. Any para- 
bolic k-subgroup of G is conjugate over k to a 
unique P^. If Sj, is the identity component of 
P|jgg(kera), then S 3 is a k-split torus of G and 
P 3 = Z(S 3 )P„(Pg). This shows that any parabohc 
k-subgroup of G has a Eevi-decomposition 
and its Eevi-subgroups are conjugate to each 
other over k. Eet P be a minimal parabolic k- 
subgroup of G containing a maximal k-split 
torus S. Put G= R,(P), Z = Z,(S), and N = 
N,(S). Then, N = N^Z, so Gj^ is the disjoint 
union over of double COSetS G^n^.P^ (wG 
i^W), where is a representative in A/^ of 
wej^W. More precisely, if we^kk, there exist 
two k-subgroups G(^ and G" such that V = 

G'„, X G" (product of k-varieties), the map 
G(,, X P^ Gn^ P sending (x, y) onto xn^,y is an 
isomorphism, and 

{G/P), = GJP,= U 7i{(U'M, 

where n is the projection G-^GjP. This is 
called a relative Bruhat decomposition. If G is 

/(-split, this gives an ordinary Bruhat decom- 
position (-^ Section K). If (/ is a subset of ^A, 
let kkg be the subgroup of ^kk generated by 
reflections defined by the a’s in 0. If (), O' are 
subsets of ^A, then there is a bijection of dou- 
ble cosets 

{Pe\\GM\=W,\,W/We.. 

(Note that all these properties follow from the 
fact that (p^, iVJ is a BN-pair in G^; Section 
R.) 

If G is k-isotropic, let G be the subgroup of 
G(j generated by all Gjj, where G runs over the 
unipotent radicals of the minimal parabolic k- 
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subgroups of G. Kneser and Tits conjectured 
that when G is semisimple and simply con- 
nected and k arbitrary, G^ = G ^ . Platonov 
showed that this conjecture fails for some Re- 
forms of SL„ (n ^ 2), but it is true in many 
cases, e.g., when G is ^/c-split or ^'/c-quasi-split 
(J. Tits, Sem. Bourbaki29Q,no. 505, 1976677). 

In this case, if G is simple as an algebraic 
group and the cardinality of A: is ^ 4, then 
GJZ(G^) is simple as an abstract group. The 
connected semisimple algebraic k-groups over 
a finite field k are k-split or k-quasi-split 
Section 0). J. Tits has given a reduction pro- 
cess for classifying the groups over an arbi- 
trary field k. He defined the index of the k- 
group (to a certain extent a generalization of 
Witt’s theorem characterizing a quadratic 
form by means of its index and anisotropic 
kernel) and gave all possible indices of the k- 
groups and also a complete list in the case of 
local tield (J. Tits [30,31]). 

R. Buildings and BN-pairs [25] 

The origin of the notations of buildings and 
BN-pairs lies in an attempt to give a systema- 
tic procedure for the geometric interpretation 
of the semisimple Lie groups and, in partic- 
ular, the exceptional groups. The theory has 
various applications to the groups of Lie types. 
To describe this precisely, we must introduce a 
number of detinitions. 

A complex A is a set with an order relation 
c , read “is a face of’ or “is contained in” such 
that for a given element A, the ordered subset 
S(A) of all faces of A is isomorphic to the set of 
all subsets of a set. The subset S(A) of A is 
called a simplex in A. A complex has a smallest 
element, which we denote by 0. For an element 
A, the number rk A of minimal nonzero faces 
of A is called the rank of A. Define rk A = 
sup(rk A I A e A). A morpbism a : A-> A' (where 
A and A’ are complexes) is a mapping of the 
underlying sets such that for every AeA, 

S(A) ^S(a(A)). A subcomplex of A is a com- 
plex whose underlying set is a subset of A such 
that the inclusion is a morphism. If A e A, the 
star St(A) of A is the set of all elements of A 
containing A. With the order relation induced 
from A, St(A) is a complex. If BeSt(A), the 
rank of B in St(A) is called the codimension of 
A in B and is denoted by codim^A. 

A complex A is called a chamber complex if 
every element is contained in a maximal ele- 
ment, which is called a chamber, and if given 
two chambers C, C’, there exists a sequence of 
chambers C = Q, C, , , = C’ such that 

codirp._^ (Q_i nCj) = codimc. (C^.jllCj)^ 1 
for all i= 1, 2, , m. A chamber complex is 
called thick (resp. tbin) if every element of 
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codimension 1 is contained in at least three 
(resp. exactly two) chambers. An endomor- 
phism (p of a thin chamber complex A is called 
a folding if cp^= (p and if any chamber on <p(A) 
is the image of exactly two chambers in A by 
(p. For any folding (p, (/)(A) is called a root. A 
thin chamber complex is called a Coxeter 
complex if for any pair of adjacent chambers 
C, C’, i.e., chambers such that codim(-(C fl C’) = 
C there is a root containing C and not C’. Let 
27 be a Coxeter complex. For any root (p{12) 
of 27, there is only one root (p'{Z), called the 
opposite of (p{I), such that (p(I)r\ (p'(27)does 
not contain any chamber and <p(27) U (p'{2f) = 27; 
there is also an involutive automorphism 
called the reflection associated with cp that 
transforms cp(Z) onto (p'{L). The group W(C) 
generated by all reflections of 27 is called the 
Weyl group of 2, which turns out to be a 
Coxeter group, i.e., there exist sets / and 
(i,j) E 1 I, where are integers or oo and 
mji = 1, such that the group is presented by 
the generators and the fundamental 

relations = 1, j, jel, A Coxeter 

complex is called irreducible if it is not the 
join of two nonempty Coxeter subcomplexes. 

A building is a thick chamber complex A 
with a system (JI of Coxeter subcomplexes, 
called the apartments of A such that (i) every 
two simplexes of A belong to an apartment, (ii) 
If 2", 2'e9l, there exists an isomorphism of 2 
onto 2' that fixes 27 fl 2' (elementwise). Since 
the apartments of a building are isomorphic to 
each other, we can define the Weyl group, 
rank, and irreducibility of the building to be 
those of its apartment. A building with finitely 
many chambers is said to be of spherical type. 
If a building can be realized as a simplicial 
decomposition of a Euclidean space, then one 
says it is of Euclidean type. 

Examples of buildings are provided by BN- 
pairs. A BN-pair or Tits system in a group G is 
a system (B, N) consisting of two subgroups of 
G such that 

(MO) B and N generate G; 

(J3N1) B n N = Ho N; and 
(BN2) The group W= N/H has a generating 
set R such that for any r e R and any VV6 kk, 
(BN2’) rBw c BwB n BrwB, 

(BN2”) rBri^B. 

The group W is called the Weyl group of the 
BN-pair. Eor any subset S of R, let denote 
the subgroup of W generated by S. Set = 

B Wg B. Then is a subgroup of G and the 
mapping Si— >P<,- is a bijection of the lattice of 
all subsets of R onto the lattice of all sub- 
groups of G containing B. A subgroup of G is 
called parabolic if it is conjugate to some P^. 
Eet A be the set of all left cosets of all sub- 
groups Pj, S c ordered by the opposite of 
the inclusion relation. Eet G operate on A on 
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the left. Let 2” be the subset [nP^ M 6 N , S c R} 
of A, and ^21 be the set of G-translates of Z. 

Then (A, 31) is a building whose Weyl group is 

W, 

Let G be a reductive algebraic group defined 
over a field k and P be a minimal parabolic k- 
subgroup of G containing a maximal /c-split 
torus S and set N = N,(S). Then N^) \s a 
BN-pair in Gj^. Therefore there is a building of 
spherical type associated with the group G. 
Conversely, the buildings of rank ^ 3 and 
irreducible spherical type (roughly speaking) 
all turn out to be associated with simple alge- 
braic or classical groups (J. Tits [25]). This 
result gives a complete and unified description 
of structures that were discovered previously 
in certain cases. For example, the building of 
type A, gives a tprojective space (E. Abe, T. 
Tsuzuku), that of type C„ gives a polar space 
(Veldkamp), and that of a k-form of type 
gives a Cayley space (J. Tits). As an applica- 
tion, one can show that a finite building of 
rank ^ 3 and irreducible spherical type is 
isomorphic to the building of an tabsolutely 
simple algebraic group over a finite field. 

When k is local (i.e., endowed with a com- 
plete discrete valuation whose residue field is 
Perfect), the reductive group defined over k 
has another BN-pair such that the associated 
building is Euclidean. This theory was ini- 
tiated by N. Iwahori and H. Matsumoto [26], 
who considered split semisimple groups. Later, 
quasi-split and classical groups were studied 
by H. Hijikata, and the theory for the general 
case was given by E. Bruhat and J. Tits 
[27,28]. To distinguish from the usual BN- 
pair structure, the subgroups conjugate to B in 
this case are called Iwabori subgroups, and 
parabolic subgroups are called parabolic sub- 
groups. The Euclidean buildings are the “ultra- 
metric analogs” of the +Riemannian symmetric 
spaces. In other words, in the study of p-adic 
simple groups, they play a role similar to that 
of the symmetric spaces in the theory of simple 
groups. 
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Algebraic Number Fields 

A. Introduction 

A complex number that satisfies an algebraic 
equation with rational integral coefficients is 
said to be an algebraic number, if the coeffi- 
cient of the term of highest degree of the equa- 
tion is 1, this algebraic number is said to be 
an algebraic integer. The set A of all algebraic 
numbers is a field which is the talgebraic clo- 
sure of the rational number tield Q in the 
complex number tield C. The set 1 of all alge- 
braic integers is an ^integral domain which 
contains the integral domain Z of all the ra- 
tional integers. The tfield of quotients of / is A. 



B. Principal Order 

An extension field k of Q of finite degree 
(which we shall always suppose to be con- 
tained in C) is said to be an algebraic number 
field of finite degree, and k is a subfield of A . 

The intersection o = k H / is an integral domain 
whose field of quotients is k; o is called the 
principal order of k. (More generally, a subring 
R of o containing 1 is said to be an order of k if 
the field of quotients of R is k. The set f of all 
elements y of o such that 70 c R is an ideal of o; 
in addition f is called the conductor of R.) Eet n 
be the degree of k over Q- Then the additive 
group of the principal order o of k is a tfree 
Abelian group of +rank n. A tbasis (cO] , , m„) 



14 D 

Algebraic Number Fields 

of o as a free Abelian group (or Z-module) is 
said to be a minimal basis of o (or of k). Eet 
(i = 1, , n) be tconjugate elements of (Oj over 
Q, and let A = |co9] be the determinant whose 
[ij) entry is ft)9\ Then D,^ = is a rational 
integer that is independent of the choice of a 
minimal basis of o. is called the discrimi- 
nant of k. If k ^Q, then |D^|> 1 (Minkowski’s 
theorem, 1891). Eor any given rational integer 
m there are only a finite number of algebraic 
number fields whose discriminants are equal 
to m (C. Hermite and H. Minkowski, 1896). 
The proof of these theorems depends on the 
methods of geometry of numbers (- 182 
Geometry of Numbers). 



C. Ideals of tbe Principal Order 

An tideal a of the principal order o is said to 
be an integral ideal of k, In particular, a prime 
ideal ( 7^: 0) of o is called simply a prime ideal of 
k. The domain o is not necessarily a tprincipal 
ideal ring but is always a tDedekind domain. 
That is, every ideal a of o is uniquely ex- 
pressed (up to the order of the factors) as a 
finite product of powers of prime ideals of o. 
This theorem is called the fundamental theo- 
rem of tbe principal order 0. 

The quotient ring o/o of o by an ideal a ( ,^0) 
of o is a finite ring. The number of elements 
of o/q is called the absolute norm of a and is 
denoted by N(a). We have N{db) = N{a)N{h). 
Every prime ideal p ( 7^ 0) of o is a tmaximal 
ideal of o, and o/p is a Imite field. Eet the 
^characteristic of o/p be p, where p is a prime 
number. Then o/p is a Imite extension of the 
Iprime field Z/pZ. Eet the degree of o/p over 
Z/pZ be f. Then N(p) = p^, and f is said to be 
the degree of the prime ideal p. 

Eet s be a complex variable. The (complex- 
valued) function 

C.W=II/(N(a))- = n 1/0 -(«(?))■’) 

0 P 

of s e C is called the Dedekind zeta function of 
k (R. Dedekind, 1871). Here the summation 
extends over all ideals a of o, and the product 
extends over all prime ideals p of o. This series 
converges absolutely for Re s > 1, and the 
function ^^(s) has a single-valued tanalytic 
continuation to a -i-meromorphic function 
on the whole complex plane (- 450 Zeta 
Functions). 



D. Units 

An algebraic integer g of k is said to be a unit 
of k if g"Ms also an algebraic integer. Hence fi 
is a unit of k if and only if the -i-Principal ideal 
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(E) is o. The set of all units of k forms an 

Abelian group under multiplication, which is 
called the unit group of k. The set of all ele- 
ments of of finite order coincides with the 
set of all the roots of unity contained in k and 
forms a cyclic group of a finite order w. Let n 
be the degree of k over Q. Then for each ele- 
ment ae k there are n conjugate elements 
over Q- Let {i = 1, , rj) be real for any 
ae k, and let and (j = t, • » r 2 ) 

be pairs of complex conjugates for any ae k. 
Then we have n = r^ + lr 2 - The unit group 
of k is the direct product of a cyclic group of 
order w and the free Abelian multiplicative 
group of trank r = + ^2 — 1. This theorem 

is called Dirichlet’s unit theorem (1846). A basis 
(fij, , 8^) of this froo group is called a system 
of fundamental units of k. 

Let /*'^a = log|a*'*| (i = l,...,r,), = 

21og|a‘-'’| {j = r^+ 1, . . .,r, + r 2 )for aek.Lor r 
elements rj^, , f/r of E^, 

P^rj2 ... 

nr 1 

is called the regulator of (t/j, , rj^) (Dedekind). 
In order for ... ,rj^ to be multiplicatively 
independent, it is necessary and sufficient 
that R [;|j, , r},] 0. The absolute value of 

R [fji, . , takes the minimum positive 
value R for fundamental units (ej, . . , e^). R = 

R [fij, , E,] is independent of the choice of 
fundamental units (Ej, . , £^) of k. R is called 
the regulator of k. In general, R[r}i,, |/£ 

is equal to the index : Hj of the group 
H generated by the roots of unity in k and 
?/,, ,t],. H. W. Leopoldt conjectured that 
units in k, which are multiplicatively indepen- 
dent over Z, remain multiplicatively indepen- 
dent over Zp (the ring of p-adic integers) when 
they are considered as elements of the tensor 
product k 0 Qp over Q. This conjecture was 
affirmatively proved in some special cases by 
J. Ax (Illinois J. Math., 9 (1965)) and others. 

If k/Q is a Galois extension, there exists a 
unit £ of k such that the conjugates of £ over Q 
contain r multiplicatively independent units 
(Minkowski’s theorem). 



E. Ideal Classes 

An o-module contained in k (i.e., oo c a) such 
that ao c o holds for some element a ( ?^ 0) of k 
is said to be a fractional ideal of k. For two 
fractional ideals a, b of k the “product” ab 
defined by (Za;/?; (finite sum)|a,£a, fteb} is 
also a fractional ideal. Thus the set of the 



fractional ideals of k forms a multiplicative 
commutative semigroup. For a fractional ideal 
a the set a “^ = (a e k aa <= o} is also a fractional 
ideal of k, and we have aa“^ = o. Thus the set 
of all nonzero fractional ideals of k forms an 
Abelian group 3k under multiplication with 
o as identity. Each fractional ideal a (^0) 
is uniquely expressed as a finite product of 
powers of prime ideals, if we admit negative 
powers. Namely, 3k I® ^ f^e Abelian multi- 
plicative group with the set of all prime ideals 
as basis. Given fractional ideals a and b, we 
say that a is divisible by b if a c b; in this case, 
we call b a divisor of a and a a multiple of b. 
Also, a c b if and only if there exists an integral 
ideal c such that a = be. Given fractional ideals 
d = n pf ' and b == IT q / i (^, 0), we say 

that a and b are relatively prime if {pj and 
are disjoint. Usually a fractional ideal of k is 
simply called an ideal of k. 

For an element a ( ^ 0) of k, (a) = ao is a 
(fractional) ideal of k, and (a) is said to be a 
principal ideal of k. The set of all principal 
ideals (a) (a e /c, a 7 ^ 0 ) is a subgroup of 3 /^. 

Since (a) = o is equivalent to a 6 £fc,we have 
P^ = k*/E^, where k* is the multiplicative group 
of all nonzero elements of k. 

Each coset of 3k modulo P^ is called an 
ideal class of k, and the group = 3 k/^k 
called the ideal class group of k. Each ideal 
cl ass c ontains an integral ideal a with N(a) ^ 
s/\^\ (more precisely, with A(a)^( 4 / 7 r)'' 2 (n!/«") 
I ). From this it follows that is a finite 
Abelian group. The order h of is called the 
class number of k. For the calculation of the 
class number the tresidue at the pole s = 1 of 
the Dedekind zeta function is used. Namely, 

lim(s- l)C^{s) = gh, 

s->l +0 

where is the regulator of k and is the 
number of roots of unity in k (Dedekind, 

1877). This formula is used, in particular, for 
the computation of the class numbers of 
■'■quadratic fields and tcyclotomic fields (- 347 
Quadratic Fields). The class numbers of cubic 
and quartic (real) tcyclic fields over Q were 
computed by H. Hasse in the case where 
the ■'■conductor of k/Q is less than 100 (Abh. 
Deutsch. Akad. Wiss. Berlin, 2 (1948)). Hasse 
has also given a detailed computation of the 
class number of cyclotomic fields [ 15). In 
general, let the degree n = [k: Q] be fixed and 
let ID^I^oo. Then 

Um(log(hM/logyi^) = 1 . 

(This formula was proved for n = 2 by C. L. 
Siegel, 1935, and for general n by R. Brauer, 

Amer. J. Math., 69 (1947).) 
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F. Valuations 

All the t Archimedean and tnon- Archimedean 
valuations of an algebraic number field k can 
be obtained as follows (- 439 Valuations): 

Archimedean Valuations. Let n= [k; Q], and 
let n conjugates of aG k be a*'*, , a'"' such that 
{i = 1, , fj) is real, and and 
{j=\, , ^2) are pairs of complex conjugates. 

W e writ^ |a|; = |a*-^^| (j = I/1 + r,); these 
are Archimedean valuations of k that are 
not mutually equivalent. The equivalence 
classes of these valuations are denoted by 

respectively, and are called the 
infinite prime divisors of k. The first ^’i infinite 
prime divisors are called +real and the remain- 
ing ^2 are called timaginary (or complex). The 
valuations of k defined by 

|o^lp'p = |ab^ j=l, 

= 7 = ^1 + 1,..., r, +f 2, 

are called tnormal valuations of k. Here Ipw 
( j = r, + 1 , . . . , r, + ^2) are valuations in the 
wider sense. If ^ we call k a totally real 
field, and if r, = 0 we call k a totally imaginary 
field. 

Non-Archimedean Valuations. Let p be a prime 
ideal of k and a an element of k. Let (a) = p'^b, 
where p and b are relatively prime. Put v,(a) = 
a. Then for any constant p (0 < p <I), 

is a non- Archimedean valuation of k. This 
valuation of k is called the tp-adic valuation of 
k; p-adic valuations for different prime ideals 
are mutually inequivalent. The valuation |a|p 
with p — {N{p))~^ is called a '•'normal valuation 
of k. The equivalence class of valuations con- 
taining Ip is denoted by the same letter p 
and is called a '•'finite prime divisor of k. 

A formal finite product of powers of finite 01 
infinite prime divisors m* = IT pf' is called a 
divisor of k. If all C; > 0, then m* is called an 
integral divisor of k. Given divisors m* = 
n and n* = n p{\ we write m* n* if e,- ^ 

/, (i = 1>2, ). 

Any valuation of k is equivalent to one of 
the valuations defined previously (A. Ost- 
rowski, 1918; E. Artin, 1932). For any element 
a ( 7^ 0) of k the '•■product formula Ilplalp = 1 
holds, where p runs over all finite and infinite 
prime divisors of k and |p are the normal 
valuations of k. Conversely, let k be a field, 
and let K= { ]p} be a set of inequivalent 

valuations of k such that (i) for any a e k (a 0) 

I a Ip 1 holds only for a finite number of p in 

V; (ii) the product formula rip|a|p= 1 (ae/c, 
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3£ ^0) holds; and (iii) there is at least one 
Archimedean valuation in V. Then k is an 
algebraic number field and Vis the set of all 
the prime divisors of k (Artin and G. Whaples, 

Bull. Amer. Math. Soc., 5 1 (1945)). 

G. Ideal Classes in the Narrow Sense 

For a, /5g k, the expression a = (i (mod p^i?) 
means > 0 for a real infinite prime divi- 
sor p)||) and 0 for an imaginary infi- 

nite prime divisor We call an element 
a e k totally positive if all real conjugates 
(/= 1,. , r,) are positive. In the notation just 
given, this means a = 1 (mod p*<(*) (i = 1, , rj. 
The set of all principal ideals (a) generated by 
totally positive elements a g k is a multipli- 
cative subgroup of Each coset of 3^ 
modulo is called an ideal class of k in the 
narrow sense. Fet be the group of all totally 
positive units of k. Then we have (3/^ ’ Pk) = 

hT^I{E,:En 

H. Multiplicative Congruence 

Fet m be an integral ideal of k, and let k*(m) 
be the multiplicative group of all elements a in 
k such that (a) is relatively prime to m. Any 
element <XG k*(m) can be expressed in the form 
a = (i/y such that /5, y G o and (P), (y) are rela- 
tively prime to m. 

Consider an integral divisor m* = m H P* 
which is a formal product of m and infinite 
prime divisors p*4^ of k. We call trt the finite 
part of m*. Given an element oiE k*(m) and 
elements /3, y 6 k*(m) fl o such that a = 
we set a = 1 (mod’' nt*) if y (mod nr), and 
a = 1 (mod p'^l). The set of all a in k*(m) such 
that a = 1 (mod” m*) forms a multiplicative 
group. We write a = (mod” m*) for a, /ie k if 
a//i G k*(m) and a//i = 1 (mod’' m*). This con- 
gruence is called the multiplicative congruence. 
In the following discussion we shall write 
modm* for mod” m*. 

We denote by 3fc(m) the group of all ideals 
of k that are relatively prime to an integral 
ideal m, and by S(m*) the group of all prin- 
cipal ideals (a) such that a e k*(m), a = 1 
(modm*); S(m*) is known as the ray modulo 
m*. Any subgroup H of 3k(tti) which contains 
S(m*) is called an ideal group modulo m*, and 
the factor group 3k (tn)/// is called a group of 
congruence classes of ideals modulo m*. 

If n* m* for integral divisors m* and n* of 
k, then 3k(i^) ^ 3k(tt) and S(m*) c S(n*). If H 
is an ideal group modulo ri*, then Q(H) = 

H n 3k(i^) group modulo m*, and 

we have 3k(tt)/I/ = s,(m)/@(H). For any given 
ideal group Hq modulo m* there is a smallest 
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integral divisor f* such that f* m*, and there 
exists an ideal group H modulo with ^{H) 

= Hq (i.e., if there is an ideal group H ' modulo 
n* with = Hq, then f* n*). We call f* the 
conductor of the ideal group H . The notion of 
multiplicative congruence is used in tclass field 
theory and in the theory of tnorm-residue 
symbols. 

I- Ideal Theory for Relative Extensions 

If an algebraic number held K has a subfield k, 
we say that K/k is a relative algebraic number 
field. Let O be the principal order of K. For a 
(fractional) ideal a of k, Oa is an ideal of K. 

We write Oa = £(a) and call E(a) the exten- 
sion of a to K . For ideals a, b of k, we have 
£(ab) = £(a)£(b) and E(a)r\k = a. 

Let 'I'j : K ^ C be k-isomorphisms (j = 

I, where n = [X :k]. We write = 

‘F,(X) andd<')=Y,:(A)for/leX. For an 
ideal ^ of K, = j/4*'* A e *il} is an ideal 
of K^‘\ and is called the conjugate ideal 
of in Let L be the composite field of 
X*’*, , X‘"( Then the ideal generated by 
'■iF’L.. in L is the extension of an ideal a 
of k. We write a = and call a the rela- 
tive norm of iR over k. We have = 

and N^i^{E{a)) = a” (for an 
ideal a of k). In particular, for k = 

MN(m 

Let p be a prime ideal of k. Then E(p) = 

'’jSgS in C, where j, , iPg are 
prime ideals of K . Let f- be the degree of the 
finite field over o/p. Then = p-^'; 

[■ is called the relative degree of . over k, and 
e, is called the (relative) ramification index of 
'jP; over k. We have the relation n = 
between these numbers. If e, = =Cg= 1, the 
prime ideal p is said to be unramified for K/k. 
Otherwise, p is said to be ramified for K/k. If 
every prime ideal of k is unramified for K/k, we 
call K/k an unramified extension. (For an in- 
finite prime divisor p of k we write = 
nf=] ' if the Archimedean valuation 
|„ of k can be extended to g Archimedean 

'•■cc 

valuations (i= 1, . . . , ^) of X, where c, = 

2 if is imaginary and p^^ is real, c,- = 1 
otherwise.) 

J. Relative DilTerents and Relative 

Discriminants 

Let K/k he a relative algebraic number field 
and 0 , 8 be the principal orders of k, X, re- 
spectively. Put 9Ji= {A 6 X Tr^ii^{kD) cz o}, 
where is the ttracc (- 149 Fields J). Then 
'JJi is a (fractional) ideal of X and ftJJ ' = is 
an integral ideal of K; is called the rela- 



tive different of X over k. When k = Q, is 
simply called the different of X. For L zj X => k , 
we have the chain theorem: 

Let the conjugates of A g X over kbe 
A'*’, , A"", and assume that A*‘*= A. Put 
for A G X. If A 6 0, 

thena/i;/(((A)G is generated by 

{<5|s:/k(A)| AgO}. The integral ideal gen- 
erated by {A — A‘‘* A gO} in the field L = 
X(i>x<2) was called an element by D. 
Hilbert. We also have 
The integral ideal b^-y^ = of k is 

called the relative discriminant of K/k. If k = Q, 
h/c/Q = (Dk). 

For the relative different to be divisible 
by a prime ideal “ilj of X, it is necessary and 
sufficient that E(p) = with e > 1, 

where p = ''jj fl k (Dedekind’s discriminant 
theorem, 1882). Hence a prime ideal p of k is 
ra mi lied for K/k if and only if p divides the 
relative discriminant b^y^; there are thus only a 
finite number of prime ideals of k which ramify 
for K/k. In particular, K/k is unramified if and 
only if b^yt = 0. 

K. Arithmetic of Galois Extensions 

Eet K/k he a relative algebraic number field 
such that X is a ^Galois extension of k of 
degree n, and let G be the ^Galois group of 
K/k. Eet o, T* be the principal order of k, X, 
respectively. The conjugate ideals of an ideal 
''A of X are given by = {A"| Ag9I} {(jeG). 

If = a, then E(a) = ^ 

prime ideal p of X, £(p) = (‘!Pi A ^2 ••• 
where X^;yt('!j8y) = P"'^(i = U ■ ■ ■ ■ g),n = efg, and 

are mutually conjugate prime ideals 

of X over k. 

Hilbert (1894) developed the decomposition 
theory of a prime ideal p of k for a Galois 
extension K/k in terms of the Galois group G 
as follows: Eet be a prime ideal of 0. Then 

z={(7GGi'‘r=^j8} 

is a subgroup of the Galois group G of K/k. Z 
is called the decomposition group of 'jP over k. 
Eet G = (Jy Zt; be the left coset decomposition 
of G. Then ''jP,- = (i = 1, . , g) are all the 
conjugate ideals of ''jP over k. 

The subgroup 

£={ggZ| A"-A(modA^),AGO} 

of the decomposition group Z is normal, and 
T is called the inertia group of 'jP over k. The 
quotient group Z/T is a cyclic group of order / 
(the relative degree of ip). There exists an 
element g of Z such that 

A^ = A''"’>(rnodm AeC, 

and 0 is uniquely determined mod T; crT gen- 
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crates the cyclic group ZjT. This a is called the 
Frobenius substitution (or Frobenius automor- 
phism) of 'ip over k. For = 1, 2, , 

F'™* = e Z I = d (mod '), /4 e C } 

are normal subgroups of Z; the group K*™’ is 
called the mth ramification group of '*p over k. 
Let 

J/(0)_ _ |/(t’i+l)_ 

_ j/(tv+l)_ j 

Let Vp - (p = 0, 1, where Vq= —1. 
In particular, Kq ^ = T. The integers y, , 

V 2 , ..are called the ramification numbers of ''P. 
The group T/Fj is isomorphic to a subgroup of 
the multiplicative group of the finite field O/'P. 
Hence T|V^ is a cyclic group whose order Cq 
is a divisor of At (^p)- 1. The group F„/F„+i 
(m ^ 1) is isomorphic to a subgroup of the 
additive group of the finite field O/pI. Hence 
yjVm+\ is an Abelian group of type (p, p, , p) 
whose order divides NCp). From e ^ T| = 

(T: F,)|F, it follows that e^eoP*‘,(e(f,p)= 1. 
Here G| denotes the order of a finite group G. 
Hence the decomposition group of 'P is a 
'^solvable group. The relation between the 
ramification numbers for X//c and those for an 
intermediate Galois extension f/k was com- 
pletely determined by J. Herbrand (J. Math. 
Pares AppL, 10 (1931)) [11]. 

Let be the 'p-component of the relative 
different DK/k of a Galois extension K/k. Then 

(I r'l-i). 

p=0 

In particular, d = 0 if T = 1 , and d = e 1 if 
F">=1. 

Let /c^, kj, and ky{m) be the intermediate 
fields which correspond to the subgroups Z , T, 
and respectively, in the sense of ^Galois 
theory; the fields ky,kj, and kyim) are called 
the decomposition field, the inertia field, and 
the mth ramification field of p, respectively. 

Let pi be a prime ideal of K containing p, and 
let P2 pj, be prime ideals in k^ and kj such 
that P2 = P n and p-/- = pi H /c^-. Then we 
have £(p) = p^p^^’ p^^* for kjk; £(p^) = p^. 
and tV,^/,^(p.,.) = p| for kj/k/, and £(pj.) = p" 
for K/k y. 

If a prime ideal p of k is unramified for a 
Galois extension K/k then we have E(p) = 

1 '^^2 (i = lo 9), %) = 

pf, and n = fg. The Frobenius automor- 
phism a,- : /4 "<■ = (mod P,) (A e O) for the 
prime ideal p. is uniquely determined, and its 
order is /.' Since p.= pT( we have a- = 7-. 

Hence (7, , , belong to the same fcon- 
jugate class of G. In particular, if G is an 
Abelian group, then 0 ^= . . = and 
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We then write 




and call this symbol the Artin symbol for p 
for the Abelian extension K/k. For an ideal 
a = n p of A' that is relatively prime to the 
relative discriminant of K/k, we define 




Evidently, we have 

©-(?)(?) 



The arithmetic of quadratic fields (-^ 347 
Quadratic Fields) and the arithmetic of cyclo- 
tomic fields (-^ Section L) have been developed 
since the 19th Century. 

L. Arithmetic of Cyclotomic Fields 

A complex number [, whose mth power is 1 but 
whose m’th power is not 1 for m’ <m is called 
an mth primitive root of unity. There are q)(m) 
primitive roots of unity; exp(2;ri>/t«)((r, m) = 1), 
where (pis tEuier’s function. These q(m) primi- 
tive roots of unity are the zeros of an irreduc- 
ible polynomial over Q of degree <p(m): 

where p is the '^Mobius function. The coeffi- 
cient of the highest term of E,(X) is 1, and the 
other coefficients are all rational integers. 

F,(X) is called a cyclotomic polynomial. An 
example is 

F,2(X) = {X^^^\){X^-mX^~l){X^-i) 

The algebraic number field K^ = Q(Cm) 

(C„, = exp(27ri/m)) obtained by adjoining an 
mth primitive root of unity to Q is a Galois 
extension over Q of degree (p{m) whose Galois 
group G is isomorphic to the multiplicative 
Abelian group of ^reduced residue classes of Z 
modulo m:G = {GACm' = Crn^ {r,m)^ I}. K^ is 
called the mth cyclotomic field. Cyclotomic 
fields are tAbelian extensions of Q. Conversely, 
every Abelian extension of Q is a subficld of a 
cyclotomic field (Kronecker’s theorem, 18.53, 
1877). 

We can choose (1, Ci , . , Cm ^ 

minimal basis of K,. Eet m = be 

the decompositions of m in powers of prime 
numbers (j, Put K'‘*= Then K^ 
is the composite field K^ — K^^\ 

The different of is given by ,q = 
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and the discriminant 

of is given by (n, = 

(p{m)/(p{lfi)). If w= the discriminant of 
is {a = l’'~^hl — h— 1)), where g = 

— 1 if /'' = 4 or / = 3 (mod 4), and g = 1 other- 
wise. Hence the discriminant of K is D^- = 
(.,y — 1 Suppose that 

(mod 4). Then a prime number p ramifies 
for KJQ if and only if p divides m. In par- 
ticular, if m= (m> 2), then only 1 ramifies for 
KJQ and (/) = !''’•'">, N(I) = / (I is explicitly 
given by 1= (1-i,,,)). For 2 the ramification 
numbers for I are Vj = I' 1 (i = 1,2, and 
the ramitication fields are K,, Kp, For 1= 2 
the ramification numbers are 1,3, 7, and the 
ramification fields are Q, Kg, 

In KJQ a prime number p {p-{m} is decom- 
posed as (/?) = p , (p.) = p^i = 1 , y) 

and fg = qn(m). Here the degree f of p; is de- 
termined as the minimal positive integer f 
such that = 1 (mod m). Hence the decom- 
position law of a prime number p in KJQ is 
derermined by its residue class modulo m. This 
is a prototype form of class tield theory (-59 
Class Field Theory). 

The class number of the cyclotomic tield K 
can be calculated by Dedekind’s formula (- 
Section E; see also Hilbert [4]). Here we shall 
give the result for m = / (a prime number). 

Let r be a -i-Primitive root modulo 1. For ^ = 
exp(27a//), we put 



g = g(0 = 



Vi-c 1-c-v 



Then g is a unit in K,. Define an element (j of 
the Galois group of K,/Q by J= and put 
g. = £"' (i = 0,I,...). Theneo,ei,---,£p-i (P = 

(I — 3)/2) are multiplicatively independent units. 
That is, the regulator R [£q, a^, # = 

E^O. The units . gp_i are called 

circular units. The class number of K; is the 
product of two factors, h = /ij h,. Here /j^ is the 
class number of the real subfield KJ = Q(C + 

( ’ ). E. E. Kummer called h , the first factor 
and /I 2 the second factor of the class number h. 
Let Xi, > '/j-\ be the multiplicative char- 
acters of the treduced residue classes of Z 
modulo 1, and let y j = 1, , p -|- 1) be the 
characters among them such that — 1) = — 1. 
Then 



(-ir'opYv ,1) t 



Rr 



(Kummer, 1850). Here Rq is the regulator of 
K'). Since circular units belong to K\, the class 
number /i 2 of KJ is equal to the index of the 
subgroup generated by +1, g^, , £^_, in the 
group of units of KJ. The class number h of 
K, is equal to 1 for 19 and it has been con- 
jectured that there exist no more fields K; 



with h= 1. H. L. Montgomery and K. Uchida 
solved this conjecture by proving that the first 
factor /i, > 1 for / > 19 {Tdhoku M ath. j., 23 
(1971)). J. M. Masley and Montgomery proved 
that there are precisely 29 distinct K^ with 
class number 1 (J. Heine Angew. Math., 

286/287 (1976)). 

According to Kummer, / divides h if and 
only if I divides h,. Since can be computed 
explicitly, we can readily determine whether 
l\ h or not (- 145 Eermat’s Problem; Ap- 
pendix B, Table 4. 1 1 1). A prime I is called regu- 
lar if Ifh, otherwise it is called irregular. Thus 
an odd prime 1^5 is irregular if and only if 
there exists an even integer j with 2^j ^[—3 
such that I divides the numerator of the jth 
Bernoulli number Bj. This criterion of Kum- 
mer can be strengthened as follows. Let $ 
denote the /-Sylow subgroup of the ideal 
class group of the prime cyclotomic tield 
K, = Q(Ci). Because the Galois group G of 
Q(C;)/Q operates on S, the Abelian group S 
becomes naturally a Z,[G]-module. Choosing 
the canonical character 0:G~*Zf defined 
by J = Cf for any ne G, we have a direct 
decomposition S = IllTio S^'\ where = 
{s6S|s'^ = s*' ae G}. Then it is necessary and 
sufficient for /| (the numerator of Bj) that 

(K. A. Ribet, Inventiones Math., 34 (1976)). 
Moreover, because G is naturally isomorphic 
to the multiplicative group of the reduced 
residue classes of Z//Z, each becomes a 
tDirichlet character. Let 

^ 1 , 0 -'=7 Z 0~\a)a- 

then this number, regarded as an element in 
Qhis equal to a factor of the product (up to 
+ 1 and 2) appearing in the class number 
formula for h,. Let m; be the /-exponent of 

y-[- for each odd / (2 < / ^ / — 3). Then it is 
conjectured that the order of is precisely 
equal to l"'i for each odd / with 2</^/ — 3. In 
particular, if S’ = IT;. even trivial, then 
this is known to be true. In general, it is also 
conjectured that l\h 2 holds for any prime / 
(Vandiver’s conjecture). If the group b 
cychc, then 5*'^ is of order /'"i (A. Wiles, Inven- 
tiones Math., 58 (1980)). 

When a Galois extension K over a finite 
algebraic number field k has the Galois group 
isomorphic to the additive group of the /-adic 
integer ring Z,, the extension K/k is called a 
Z, -extension or r-extension. Then for each 
integer n^O there exists a unique subfield k„ of 
K with degree /" over k. Let k = K; = Q(C,) (/ > 2), 
and let K be the union of all th cyclotomic 
fields K,n+ 1 (n ^ 0). Then K/k is a typical ex- 
ample of a Z, -extension with subfields k„ = 

K,n+ 1 . Let K/k be a Z, -extension with sub- 
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fields k„, and let denote the /-component 
of the class number of k„. Then there exist 
integers A ^ 0, jU ^ 0, v independently of n 
such that e„ = Xn + lil” + v for all sufftciently 
large numbers n (K. Iwasawa, Bull. Amer. 

Math. Soc., 65 (1959)). These numbers X,ji, 

V are called the Iwasawa invariants for the 
Z, -extension K/k. There exists a unique 
Z, -extension K° over the rationals Q. An ex- 
tension K°k over k obtained by taking the 
composite of K° and k is called the basic Z,- 
extension of k or the cyclotomic Z;-extension 
of k. We denote its Iwasawa invariants by 
Alik), piiik), v,(k). It is known that there are Z,- 
extensions with arbitrarily large But it is 
conjectured that /t;(/c) = 0 holds for the basic 
Z, -extension of any k. In particular, in the case 
k = ^^, = Q(Qit has been computed that /T((/c) = 
0 for / < 125,000 (for / < 4001 by K. Iwasawa 
and C. C. Sims, 7. Math. Soc. Japan, 18 (1966); 
for /< 30,000 by W. Johnson, M ath. Comp., 29 
(1975); for /< 125,000 by S. Wagstaff, Math. 
Comp., 32 (1978)). By applying the theory of 
uniform distributions to /-adic situations, B. 
Ferrero and L. C. Washington (Ann. M ath., 
109 (1978)) proved that p,{k) = 0 when k/Q is 
Abelian. In particular, when k = K , and k„ = 
K,""''' , > 0 if and only if the class number 

ofX, is divisible by / (P. Furtwangler, 1911). 

Since any quadratic field is a subficld of a 
cyclotomic field (by a ^Gaussian sum formula 
we have Q(y^) c Q(C|,;|)? where d is the dis- 
criminant of Q(>yw)), the computation of the 
class number of quadratic fields and the proof 
of the law of reciprocity for the t Legendre 
symbol follow from the arithmetic of cyclo- 
tomic fields. 



M. Arithmetic of Rummer Extensions 

Assume that an algebraic number field k con- 
tains an nth primitive root of unity. Then a 
■''Kummer extension K = kiyii) (psk) is a 
■•■cyclic extension of k. Assume that [K: k] = n. 
In order that a prime ideal p of k ramify for 
K/k, it is necessary that p be a divisor appear- 
ing in (n) or (p). If pKn) and v^(p)^0 (modn), 
then p ramifies for K/k. A prime ideal p which 
is relatively prime to (p) has the decomposition 
E(p) = ^ 1 distinct primes '(p- in K if 

and only if the equation p=^" (mod p”) is 
satisfied by some e o for any positive integer 
m. In particular, if p-l-(n) and v^[p) = 0, we have 
E(p) = 'ip j 'ip„ if and only if p = ^" (mod p) is 
solvable in o. 

If for an element of 0 

^ = ^"(modp) 

is solvable by some ^eo, pis said to be a 



residue of the ntb power modulo p. Assume 
that p)(n) and Vp(/.i) = 0. Let / be the minimal 
positive integer such that p^ is a residue of the 
nth power modulo p. Then p is decomposed 
in K as £(p) = )Pi '^P,, and NK/k(^i) = P^ 

(1 = 1 , 

N. Power-Residue Symbol 

Let = exp{2ni/n) e k, and let p be relatively 
prime to (n) and (a) (ae k). Then for some r we 
have 

and we write 




This symbol is called the nth power-residue 
symbol (Rummer). Generalizing this definition, 
we can define the nth power-residue symbol 
(a/b)„ for an ideal b of k which is relatively 
prime to the relative discriminant of k{^//oi)/k 
by using the +Artin symbol (( K/k)/h): 




This symbol satisfies 




f *^ 2 ^ 

. b /„ \h J„\h ), 



if all the symbols are well delined. In partic- 
ular, (X is a residue of the nth power modulo p 
if and only if (a/p)„ = L This symbol coincides 
with the tquadratic residue symbol for n = 2, 
k = Q, and p ^ 2. 

0. Law of Reciprocity for tbe Power-Residue 
Symbol 

Several formulas concerning the power-residue 
symbol are known which are similar to that 
for the quadratic residue symbol (L. G. M. 
Lisenstein, Rummer, Lurtwangler, Takagi, 
Artin, Hasse). These can be proved by means 
of Artin’ s '•'general law of reciprocity in class 
field theory (-^ 59 Class Lield Theory). 

There are many formulas concerning the 
reciprocity of the power-residue symbol. One 
of them is as follows: Let n = / be a prime 
number. Let a, k and assume that (i) a is 
totally positive; (ii) v,(a) = 0 (mod /) if v^([i) ^ 0 
(mod/), and Vp(//) = 0 (mod/) if Vp(a)^0 (mod/), 
for any prime ideal p; and (iii) a = 1 (mod /) and 
^ = 1 (mod( 1 — ^()). Then 
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=cr, 



fl-Tr,,Q 



/a-l/l-l\ 

V / 1-c,/ 



(general law of reciprocity, Hasse, 1924). This 
result is a generalization of the formula of 
Eisenstein (1850). If a(ek) is totally positive 
and a = 1 (mod 1), then 




b=Tr 



klQ 




If a = 1 (mod /( 1 — (,)), then 



a 



sc— 1 



1-Ci 

sc 



= Cf, 



a— 1 

/(I'C,) 



(complementary law of reciprocity, Hasse, 
1924). 



P. Norm Residue 

Let m be an integral divisor of k such that 
TTt = n,- pf' rij Pto (e,- > 0) with finite prime divi- 
sors {p,} and infinite prime divisors {p^^?}, 
and let be an element of k that is relatively 
prime to m. For a relative algebraic num- 
ber iield K/k and an element 5 of K, we set 
i? = I^Kiki B) (mod m) if the following two con- 
ditions are satisfied: (i) P = blK/k(B) (mod pf>) for 
every finite P, and (ii) P^^^ > 0 for every infinite 
prime p*i’ such that is real and its extension 
to K is imaginary, pe k is then said to be a 
norm residue modulo m for K/k if there exists a 
number 5 of X SUCh that P = (modm). 

Let p be a finite prime divisor of k. If pis a 
norm residue modulo p‘" for a sufficiently large 
c, then fi is a norm residue modulo p^^ for any 
e > c. Let c be the smallest such integer (c ^ 0). 
Then the ideal = p^ is said to be the p- 
conductor of norm residue for K/k. If p is un- 
ramified for K/k, then c = 0; i.e., every Psk 
which is relatively prime to p is a norm residue 
modulo p® for any e > 0. For a ramified p put 
fp = p^ For a Galois extension K/k, c is not 
greater than 



Q. Norm Residue Symbol 



For an Abelian extension K/k, the positive 
divisor 



f=nf. 

p 

(where p runs over all finite and infinite prime 
divisors of k) is called the conductor of K/k 
(— >■ 59 Class Field Theory). For ae k (a ^0) 
take ocq such that oc/aQ = 1 (mod f^) and oCq = 1 
(mod and put (a^) = p^b with b relatively 
prime to p. Then b is relatively prime to the 
relative discriminant We define a new 
symbol by 




where ((X//c)/b) is the Artin symbol. This 
value is independent of the choice of the 
auxiliary element aQ. The new symbol is called 
the norm-residue symbol (Hasse, J. Reine 
Angew. Math., 162 (1930)). In particular, for 
an infinite real prime divisor p*^' of k whose 
extension for K is imaginary, we have 




according to whether the conjugate is 
positive or negative, where cj is the automor- 
phism of K/k induced from the complex con- 
jugation of the completion C of X with respect 

to 

The norm-residue symbol has the following 
properties: 



0) 



olol', K/k 





(2) if p is unramified for K/k, then 

(3) in order that a be a norm residue modulo fp 
for K/k, it is necessary and sufficient that 




(4) the product formula for the norm-residue 
symbol (Hasse) is 



P-o\Vo\ 



(vp+\ 






- 1 , 



In particular, for an Abelian extension K/k this 
value is an integer and is equal to c (Hasse, J. 
Fac. Sci. Univ. Tokyo, 1934). For example, the 
/-conductor of the cyclotomic field X,/i/Q is /''. 
We define the p ^-conductor for K/k for an 
infinite prime divisor p^o of k by = Poo if Pag 
is real and its extension to K is imaginary, and 
f =1 otherwise (- 257 Local Fields F). 

'Px ^ 



where p runs over all finite and infinite prime 
divisors of k; and (5) if the domain of the 
variable a is the whole k ( 7 ^ 0 ), or the set of 
all a such that (a) is relatively prime to p, or 
the set of all 0 £ such that a = 1 (mod p”) [Vp -\- 
I ^m^Vp+i), then the range of ((a, K/k)/p) is 
the decomposition group Z of p, the inertia 
group T of p, or the ramification group Vp 
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of p, respectively (Hasse, S. lyanaga, 1933) 
(- 257 Local Fields F). 



R. Hilbert Norm-Residue Symbol 



The symbol first introduced by Hilbert for 
quadratic fields can be defined in a general 
algebraic number field k containing an fith 
primitive root of unity. Let cn, (ie k (a?^0, 

P ^ 0), and let p be a prime divisor. Then the 
following nth root of unity ((a, j?)/p)„ is defined 
by using the norm-residue symbol: 







This symbol ((a, /i)/p)„ is called the Hilbert 
norm-residue symbol. It is also called the 
Hilbert-Hasse norm-residue symbol. For a, sc', 
k, we have 




(2) the law of symmetry: 



/a, /1\ //1, 0f\ ' 

V P /n \ P A ’ 



and (3) the product formula for the Hilbert 
norm-residue symbol: 




(Hilbert, Furtwangler, Takagi, Artin, Hasse). 
For detailed properties concerning the norm- 
residue symbol, power-residue symbol, and 
Hilbert norm-residue symbol and for refer- 
ences for them see Hasse [6]. 

In general, the problem of obtaining various 
laws of reciprocity for the power-residue sym- 
bol is reduced to the one of computing the 
Hilbert norm-residue symbol explicitly. De- 
tailed formulas for these symbols are called 
explicit reciprocity laws; they are treated as 
a topic in the number theory of local fields 
(- 257 Local Fields H). 



S. Density Theorem 



et M be a set of prime ideals of k. If 



1 



L m y 

+ 0peM(A(p)) 



log 



«;-l 



exists, its value is said to be the density of M. 
The density of the set of all prime ideals of k is 
1 . Let H be an ideal group modulo an integral 
divisor m. Then the density of the set of all 
prime ideals contained in each coset of 3(m) 



modulo H is l/(3(nt): H ). In particular, let H 
be the ray S(m). Then this result implies that 
each coset of 3(m) modulo S(m) contains 
infinitely many prime ideals (a generalization 
to algebraic number fields of the tprime num- 
ber theorem for arithmetic progression). 

Let K/k be a Galois extension, C be a conju- 
gate class of the Galois group G of K/k, and 
M(C) be the set of all prime idcals p of k such 
that the tprobcnius automorphism of each 
prime factor of p in K belongs to C. Then 
the density of M(C) is 1C|/|G| (Cbebotarev’s 
density theorem, Math. Ann., 95 (1926)). 

Each element (j of the Galois group G of K/k 
can be expressed by the permutation z of the 
conjugate fields K^^\, of K over k. Let z 
be expressed as the product of y tcycles of 
length /j, ,/^. Hence n =/i + +ff. Let 
C(/i, . ,/,.) be the set of all such a in G, and 
let M(/i, ,/^) be the set of all prime ideals 
p of k such that p is decomposed in K/k as 
the product of r prime ideals of K with rela- 
tive degree /, , . ,/,. Then the density of 
mfuJr)is C(f„. . .,mG\{Anrn,Math. 
Ann., 89 (1923)). 

T. Relation tO the Arithmetic of Local Fields 

It is quite useful to investigate the relation 
between the arithmetic of algebraic number 
fields and that of local fields. For example, let 
a prime ideal p of an algebraic number field k 
be decomposed as E(p)= 

= p (t = 1 , ■ • • , d) in an extension K of k. Let 
and be the completion of K and k with 
respect to 'j3-adic and p-adic valuations, re- 
spectively. Then we have k,] = ^./jand 
K ®K + K,^i (direct sum). The 

relative different is expressed as (the 

p-component of = rif=i For a 

Galois extension K/k the p-conductor = p‘' 
for the norm-residue and the conductor p‘‘ of 
local extension K,^^/k^ have the same exponent 
c. For a local field K,^^/k^„ each norm-residue 
modulo h, is a norm of an element of K,. 

Hence precise results concerning the norm- 
residue in local fields can be applied imme- 
diately to a global field K/k (- 257 Local 
Fields). 

We can also apply the method of the idele 
group of an algebraic number field k, and 
therefrom we can prove results concerning the 
ideal class group, unit group, and zeta function 
of k (-^ 6 Adeles and and Ideles). 



U. History of the Arithmetic of Algebraic 
Number Fields 

C. F. Gauss (1832) was the first to generalize 
the notion of integers to algebraic number 
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fields in considering the elements of Z[^y — 1], 
now called Gaussian integers (Z[^— 1] is the 
principal order of Q(^ — l)). After investiga- 
tions by G. L. Dirichlet and Kummer, the 
notion of ideals was introduced by Dedekind 
(1871) [2]. L. Kronecker gave another founda- 
tion for the arithmetic of algebraic number fields 
(1882) [3], Dirichlet proved the unit theorem 
and, introducing the analytic method to num- 
ber theory, gave the class number formula of 
quadratic fields (-^ 347 Quadratic Fields). H. 
Minkowski first applied the theory of lattice 
points to number theory (- 182 Geometry of 
Numbers), and K. Hensel introduced the p- 
adic method (- 257 Local Fields). Hilbert 
(1897) [4] and Hasse (1926, 1927, 1930) [6] 
summarized the main results on the arithmetic 
of algebraic number fields known at that time. 
In particular, Hilbert’s report centered around 
the arithmetic of Galois extensions, and 
Hasse’ s around the class field theory obtained 
by T. Takagi, E. Artin, and H. Hasse (- 59 
Class Field Theory). Since c. 1950, when the 
notions of ideles and adeles were introduced, 
cohomology-theoretic methods have been 
successfully applied to number theory 6 
Adeles and Ideles). Recently various local 
methods, for example, the Iwasawa theory of 
Z, -extensions, tformal groups, and tp-adic E- 
functions (T. Kubota and H. W. Leopoldt, J. 
Reine Angew. Math., 214/215 (1964)) have been 
frequently applied to research in algebraic 
number fields. 
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Algebraic Surfaces 

A. Definition 

An algebraic variety of dimension 2 is called 
an algebraic surface. In this article, by a sur- 
face we mean a complete irreducible algebraic 
surface defined over an algebraically closed 
field K. 

B. History 

The history of algebraic surfaces originated 
with the study of algebraic functions of two 
variables. In the case of algebraic functions of 
one variable, the introduction of IRiemann 
surfaces attached to such functions played an 
essential role in the development of the theory. 
The study of algebraic functions of two vari- 
ables led naturally to the consideration of the 
surfaces defined by a suitable polynomial 
equation. H. Poincare and E. Picard are 
among those who studied the homological 
structure of the surface defined by the equation 
P(x, y, z) = 0. The theory of Abelian integrals 
(Picard integrals) is one of the consequences of 
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such topological investigations. S. Lefschetz 
obtained further results in this direction. 

M. Noether and geometers of the Italian 
school, such as F. Enriques, G. Castelnuovo, 
and F. Seven, studied algebrogeometric prop- 
erties of algebraic surfaces. In particular, the 
Italian school geometers recognized the im- 
portance of irregularity and thoroughly inves- 
tigated its geometric meaning. In the early 
20th Century they succeeded in constructing 
the great edifice of the theory of algebraic 
surfaces. Though some of their results lack 
rigorous proof, efforts to build a foundation 
for those results have led to the recent devel- 
opments in algebraic geometry. A signifi- 
cant contribution to the modernization of 
the theory was made by 0. Zariski and K. 
Kodaira. 

The resolution of singularities of an alge- 
braic surface is one of the most fundamental 
problems in the field. When the tuniversal 
domain is the complex number field, function- 
theoretic methods were used by Italian-school 
geometers and R. J. Walker (Ann. Math., 36 
(1935)). Zariski introduced the tyaluation- 
theoretic method to deal with the problem 
when the characteristic of the universal do- 
main is zero. S. Abhyankar (1966) succeeded 
in resolving the case of positive characteristics. 



C. Divisors and Linear Systems 

In what follows, let S denote a nonsingular 
surface. S can be embedded into some projec- 
tive space. Let 27 be a linear system of divisors 
on S and /q, be a basis of the detining 
module for 27 over K. Associated with Z we 
have a rational mapping ^ 5^P" defined by 
Ov(P) = (/o(F), . . . ,/„(F)) for general points P 
on S. Pullbacks of hyperplanes by ^re 
called variable components of 27. Any member 
of Z" is a sum of a variable component and a 
fixed component of 27. Let S' denote the clo- 
sure of the image of S by . If dim S’ = 2, a 
general variable component is irreducible. If 
dim S’ = 1, then a general variable component 
is composed of an algebraic system of dimen- 
sion 1, which is called an algebraic pencil. 

These result from Bertini’s theorems. For any 
divisors D and D' on S, the intersection number 
(or the tKronecker index) I[D D') is defmed; 
this number is a symmetric bilinear form such 
that I{D- D) — I(D^ - D\) for any divisors Di and 
D\ linearly equivalent to D and D', respec- 
tively. If C is a nonsingular curve on S, /(C'D) 
coincides with the degree of the restriction 
C D of D to C. I(D - D) = {D^) issaidtobethe 
self-intersection number of D. If D is an ample 
divisor on S, then (D^) >0and/(D- C) > 0 for 
any irreducible curve C on S. These properties 
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characterize an ample divisor (Y. Nakai; ->.16 
Algebraic Varieties E). 

Let 27 be an irreducible linear system of 
dimension r( ^ 1), and let C be a generic com- 
ponent of 27. Let C' be a member of 27 different 
from C. Then the set of C-divisors C’ C’ forms 
a linear system of dimension Y- 1 on C. This 
is called the trace of Z on C and is denoted 
by Trt;27. The trace is, in general, not com- 
plete. The integer dim |Tr(.27| -dimTrc27 = 

(5(X) is called the deficiency of 27. The defi- 
ciency of the complete linear system | D | is 
denoted by S(D) . 

Let X be a point of S, and let (('^ be the local 
ring of X. Then is an talgebraic 

coherent sheaf, called the tstructure sheaf of S. 
Let D be a divisor on S. The sheaf of germs of 
rational functions f such that (y) + /) is 
denoted by G', (D) . Then 6^(0)) is a defin- 
ing module for the complete linear system D |. 
D>0 if and only if — D) is a sheaf of (Ps~ 
ideals. The quotient sheaf — will be 

denoted by 0,. If D is a tprime divisor, then (p^ 
is the structure sheaf of the algebraic curve D 
9 Algebraic Curves). Let be a sheaf on S. 
We set 

X(S, will be denoted simply by ;;f{S). We 
call p,(S) = 1 the arithmetic genus of 

the algebraic surface S. Sometimes ;((S) is re- 
ferred to as the arithmetic genus of S. We set 
Xs(^) = X(S) )((S, (5's( - £))). The integer p, (D) 

= 1 -XsiD) is, by definition, the arithmetic 
genus of the divisor D. If O is a prime divi- 
sor, then p, (D) coincides with the arithmetic 
genus of the algebraic curve, i.e., P, (D) = 

dim 



D. Riemann-Roch Theorem 

Let S be a nonsingular surface and let K de- 
note a canonical divisor on S, i.e., K = (o)) for 
some nonzero rational 2-form O) on S. If C is a 
nonsingular irreducible curve on S, (K -I- C)- C 
becomes a canonical divisor on C; hence, 
deg((/c -f C) • C) = 2pa{C) - 2 byacorollaryto 
the Riemann-Roch theorem on C (-*-9 Alge- 
braic Curves C). Since I[{K -t C) • C) = deg((/C -|- 
C) • C), it follows that Pa{C) = I{{K + C) ■ C)/2 -|- 
1. Moreover, the formula 

p,iD) = I{{K + DyD)/2+\ 

holds for an arbitrary curve D on S; this is 
called the adjunction formula. For any divisor, 
wedefinep, (D) to be I(D {D +K))I2+1 Then 

= l{D{D-K))/2 + xiS). 
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This formula is called the Riemann-Rocb 
theorem on S 366 Riemann-Roch Theo- 
rems C). Applying Serre’s duality theorem to 
D (->■ 16 Algebraic Varieties E) we have 

dim H‘{S, (9{D)) = dim &{K - D)) 

for 1 = 0 , 1 , 2 . 

In particular, dim H^{S,&{D))^l{K — D), 
which is called the index of speciality of D; 
dim H^{S, &{D)) — dim H^{S, (9{K —D)) is called 
the superabundance of D, denoted by h^{D). 

The inequality 

l{D) + l{K-D) 

>XiS,m) =l{D-{D-K))/2+x(S) 

is called the Riemann-Roch inequality, where 
equality holds if and only if h^(D) = 0 . If D is a 
curve with s connected components, then 

l{D + K)^h\-D) + p,{D)-s + i{S). 

In addition, if h^(0) = 0, then h^{ D) = 0 and 
thus 1{D + K) = p^(D)-s + x{S). This is called 
the Riemann-Roch theorem for the adjoint 
system. Note that D + is called the adjoint 
system of D. The Noether formula, (K^) + 
C 2 (S)= 12 z(S),isa special case of Hirzebruch’s 
theorem of Riemann-Roch type (- 366 
Riemann-Roch Theorems B). Here, t’ 2 (S) 
denotes the second Chem number of S, which 
coincides with the Euler number of S if K is 
the field of complex numbers. 

Eet Div(S) denote the group of all divi- 
sors on S; by linearity we can define the bi- 
linear form I{D • D') on Div(5)o = Div(S) 0 ^Q- 
J={DEDiv(S)o I{D- D') =0forallD'; is 
a subgroup and X = Div(S)o/J is a finite- 
dimensional vector space over Q, on which the 
nondegenerate bilinear form 1 is induced. I has 
a unique positive eigenvalue; thus the other 
eigenvalues are all negative. This is called the 
index theorem of Hodge; it is derived from the 
Riemann-Roch theorem on S. Prom this, we 
inferthatif = I{D- D)>0, then I{D-D')^^ 

Z )2 -D’^ for any D'. X is said to be the Neron- 
Severi group of S and dim A is said to be the 
Picard number of S (-" 16 Algebraic Vari- 
eties P). 

To study /(mD) as a fund jon of m » 0,0. 
Zariski writes an arbitrary effective divisor D 
as a sum of and D*~*eDiv(S)o with non- 
negative rational coefficients such that ( 1 ) 
is arithmetically effective (or, numerically 
semipositive), i.e., I • C) ^ 0 for any curve C 
on S; (2) = 0 or the intersection matrix of 

the support of is negative definite; and 
(3) = 0 . Such a decomposition is 

unique and is called the Zariski decomposition 
of D [7].IfmD'+> is a divisor for some m > 0, 
then — 



E. Invariants of Algebraic Surfaces 

There are many invariants besides the arith- 
metic genus discussed earlier. We set = 
dim, Q**). Then is equal to the 

number of linearly independent holomorphic 
2 -forms; it is called the geometric genus and is 
usually denoted by Pg. Since h^' ^ gives the 
maximum among the deficiencies ^(A) of linear 
systems on S, it is called the maximal defi- 
ciency of S. Por a divisor C such that /i* (C) = 

0, we have d(C) = /i°’'. The number was 
formerly called the irregularity of S, because 
h^' ^ was considered to be a correction term in 
the equality pJS) = Pg — h^'\ The study of 
higher-dimensional varieties showed, however, 
that it was unnatural to regard h'^' ^ as a cor- 
rection term. At present, by irregularity we 
mean the dimension of the Picard variety of S 
(^16 Algebraic Varieties P), and we denote 
this number by q. 

When S is defined over the complex number 
field, we have In particular, q = 

/jO’ ■ = This number is equal to the num- 
ber of linearly independent tAbelian simple 
integrals of the first kind; it is also equal to 
one-half the first Betti number of S. In cases 
with positive characteristic, these equalities 
do not hold in general. J.-P. Serre gave an 
example of an algebraic surface S SUCh that 
/|0’ ' and J. Igusa gave an example such 

that q <h^'^ =h^'^. Eet X be a canonical divi- 
sor of S. The number = l{iK) is called the i- 
genus, and P,- (i = 2, 3, . ) are generally called 
plurigenera. If P^ = 0 and d n, then P^ is also 
zero. The numbers Pg(S) = \ Pg(S) = 

^ /j ^ ^ /t ‘ , Pi (/ = 2 , 3, . . . ) (Pi = p,) are 
tabsolute invariants of S'; i.e., they take the 
same values for any nonsingular surface S’ 
that is birationally equivalent to S. However, 
h’*' is not an absolute invariant. Por a pro- 
jective plane, all plurigenera vanish and 
the irregularity q = 0. Thus if S is a rational sur- 
face (- 16 Algebraic Varieties J), i.e., a sur- 
face which is birationally equivalent to P^, 
all P„ = q = 0. Conversely, any surface with 
q = P 2 = 0 is a rational surface. This is called 
Castelnuovo’s criterion. A ruled surface is de- 
fined to be a surface that is birationally equiv- 
alent to a product of the projective line and a 
curve. A11P„ of a ruled surface equal 0 and 
any surface with P 4 = P^ = 0 is a ruled sur- 
face. This is called the criterion of ruled sur- 
faces (Enriques). 

F. Characteristic Linear Systems of Algebraic 
Families 

One of the central problems considered by the 
Italian school was to prove that the irregular- 
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ity q is equal to the maximal deficiency 
For that purpose, Seven introduced the notion 
of characteristic linear systems of algebraic 
families. Let iT be an irreducible algebraic 
family of positive divisors on S such that a 
generic member C of 27 is an irreducible non- 
singular curve, and let r be the dimension of 2". 
Let be a 1 -dimensional subfamily of £ 
containing C as a simple member, and let C 
be a generic member of . Then the speciali- 
zation of C'-C over the specialization C'->C is 
a well-defined C-divisor of degree n = /(C ■ C’). 
The set of C-divisors thus obtained is called 
the characteristic set. The characteristic set 
forms an (y — 1 )-dimensional linear system and 
contains Tr, C| as a subfamily. This linear 
system is called the characteristic linear system 
of Z. For any algebraic family of dimension r, 
we have V ^ dim Cl -i- q. In particular, there 
exists an algebraic family Z that contains oo® 
linear systems and such that for a generic 
curve C we have h^{C) = 0. For such an alge- 
braic family, we have the equality r = dim C\ 

+ q\ hence the inequality follows. 

Moreover, if the characteristic linear system is 
complete, we have q = /i®’ b The proof of the 
completeness of characteristic linear systems 
given by Severi is valid only in some special 
cases (e.g., the case Pg = 0). For a complex 
algebraic manifold, a rigorous proof was given 
later. When the characteristic is positive, the 
completeness does not hold in general (Igusa); 
however, for the surface with Pg = 0 , the com- 
pleteness holds (Y. Nakai). The completeness 
holds if and only if the Picard scheme of S is 
reduced [14]. 

G. Birational Transformations of Algebraic 
Surfaces 

Let S and S' denote nonsingular surfaces. If 
there exists a birational morphism 7: S->S', we 
say that S dominates S', and we write S ^ S'. In 
addition, if T is not an isomorphism, we write 
S > S’. In case there does not exist an S’ with 
S > S’, S is said to be relatively minimal. On the 
other hand, if we have S' > S for any S' which 
is birationally equivalent to S, S is said to be 
minimal. Any minimal surface is, by definition, 
relatively minimal. If a minimal (resp. rela- 
tively minimal) surface S is birationally equiva- 
lent to S', we say that S is a minimal (resp. 
relatively minimal) model of S' or of the field 
K(S). A necessary and sufficient condition for 
S to have a minimal model is that S not be a 
ruled surface (Castelnuovo and Enriques). 

Let S be a nonsingular surface and P be a 
point on S. Replacing P by a projective hne 
P\ we have a nonsingular surface S' and a 
birational morphism T;S'->S SUCh that T-'(P) 
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= £^P^ and S'-E^S-{P} by T. T:S'^S 
(or T~^ in some references) is said to be a 
locally quadratic transformation. Any bira- 
tional morphism between nonsingular sur- 
faces is a composition of locally quadratic 
transformations and an isomorphism. Given 
a birational mapping T: S->S' and P on S’, 
T-’fP} is called an exceptional curve when- 
ever it is not a point. Moreover, it is called an 
exceptional curve of the first kind if T is regu- 
lar along T"' {P}. Otherwise, it is of the second 
kind. Exceptional curves consist of irreducible 
rational curves. An irreducible curve E is an 
exceptional curve of the first kind if and only 
if (£^) = — 1 and £ ^ Pf S is relatively minimal 
if and only if S has no exceptional curves of 
the first kind; S is minimal if and only if S has 
no exceptional curves at all. A relatively mini- 
mal surface that is not minimal is a ruled 
surface. Such a surface is either P^ or a P*- 
bundle over a curve. In particular, a relatively 
minimal rational surface is either P^ or a P^- 
bundle over P’. Any surface of the latter type 
is occasionally called a Hirzebruch surface and 
its (K^) is 8, where K denotes ^ canonical 
divisor. However, (K^) of P^ is 9. Define tho 
linear genus of a rational surface to be 10. If S 
is not a rational surface, taking a relatively 
minimal surface S', we deline the linear genus 
p‘'*ofStobe {K^)+ 1 of S'. 



H. Examples of Algebraic Surfaces 

Let S„ denote a nonsingular surface in P^ 
defined by a homogeneous polynomial of 
degree m. Let H denote a divisor on in- 
duced from a (hyper)plane on P^. Then the 
canonical divisor K is linearly equivalent to 
(m — 4)H, i.e. K-(m-4)H. Hence, Pg = (m— 1) 
(m 2) (m 3)/6. Moreover, q = O', if K is the 
field of complex numbers, S„, is simply con- 
nected. S 2 is isomorphic to the product of two 
copies of P\ i.e, Sj = P’ X P^; hence, it is a ra- 
tional surface. S 3 is also rational. There exist 
27 lines on S 3 . Contracting 6 mutually disjoint 
lines among these 27 lines, we obtain a pro- 
jective plane. Conversely, given 6 points on 
P^ in general position, by performing locally 
quadratic transformations with these points 
as centers we get a cubic surface S 3 and a 
birational morphism T: S 3 -^P^. The inverse 
images by T of the 6 given points, the proper 
transforms of 15 lines connecting every pair of 
points chosen from the 6, and 6 conics passing 
every 5 of the 6 points by 7“*, exhaust 27 fines 
on S,. If m = 4, then K <v 0. In general, a non- 
singular surface with K ~ 0 and q=0 is said to 
be a K3 surface (-72 Complex Manifolds J). 
K3 surfaces have certain properties similar 
to Abelian surfaces that are defined to be 2- 
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dimensional Abelian varieties 3 Abelian 
Varieties). For K = C, the period mapping 
defined by integrating regular 2-forms has 
been studied extensively in connection with 
moduli theory. 

In general, surfaces with the bigenus ?2 = 

Pg= I and q =0 are birationally equivalent to 
K3 surfaces; surfaces with P^ = p^= I and q = 2 
are birationally equivalent to Abelian surfaces. 
Every Abelian surface has the involution i 
defined by i((x) = -a and the quotient surface 
by / is a singular surface with 16 ordinary 
double points. By performing 16 locally qua- 
dratic transformations successively with these 
singular points as centers we obtain a K3 
surface. Such a K3 surface is called a Kummer 
surface. The original singular surface is also 
called a Kummer surface. A quartic surface in 
with 16 double points is an example of a 
Kummer surface. 

The theory of birational classification of 
surfaces was developed by Castelnuovo, En- 
riques, and others of the Ttalian school. This 
theory has been extensively enriched and gen- 
eralized in various ways. Kodaira’s theory of 
analytic surfaces includes classification of 
algebraic surfaces (->■ 72 Complex Manifolds T, 
J), and classification of surfaces in the positive- 
characteristic case has been recently studied in 
detail [4, 5]. When the field K (S) is a subfield 
of a purely transcendental extension K(X,y), S 
is said to be a unirational surface. If the exten- 
sion K{X, y)/K(S) is separable, S is a rational 
surface. However, if it is inseparable, S may 
become nonrational, a K3 surface, an elliptic 
surface, or a surface of general type [5, 12]. 
Even for noncomplete surfaces, we have a 
classification theory [ 10] similar to the previ- 
ous ones. The following result is one of the 
applications: whenever S x A’ ^ A^, S is iso- 
morphic to A^, Here A" means the affine n- 
space (M. Miyanishi, T. Sugie, T. Eujita; 
[7,9,13]). 

Eet I be the ring of integers of a real qua- 
dratic field of discriminant d. The Hilbert 
modular group G = SL{2, /)/{ + 1 } acts on the 
product Jf ^ being the complex upper half- 
plane. The normal complex space Jf^/G can 
be compactified by adding a finite number of 
points and thus a compact norfhal surface is 
obtained. Resolving these singularities in the 
canonical minimal way, we have a nonsingular 
surface Y (d) over C, which is called the Hilbert 
modular surface with discriminant d. Y (d) is 
simply connected; hence, q[ Y(d)) =0. If d = 5, 8, 
12, 13, 17, 21, 24, 28, 33, 60, then Y (d) is a 
rational surface. If d = 29, 37, 40, 41, 44, 56, 57, 
69, 105, then Y (d) is birationally equivalent to 
a X3 surface. If d = 53, 61, 65, 73, 76, 77, 85, 88, 
92, 93, 120, 140, 165, then Y (d) is an elliptic 
surface with k{ Y (d)) = 1 . Otherwise, it is a 



surface of general type [ 19,201 in 72 Com- 
plex Manifolds). 

Eor 0-cycles on a surface 16 Algebraic 
Varieties J. 
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16 (VIII.4) 

Algebraic Varieties 

A. Affine Algebraic Varieties and Projective 
Algebraic Varieties 

Fix a field k. A subset of the n-dimensional 
taffine space k ” over k is called an affine alge- 
braic variety (or simply affine variety) if it 
can be expressed as the set of the common 
zeros of a (finite or intinite) set of polynomials 
i](A"i, , X,) with coefficients in k. Similarly, a 
subset of the n-dimensional '''projective space 
P”(k) over k that can be expressed as the set of 
the common zeros of a set of homogeneous 
polynomials G,( Y„ . , Y„) is called a projective 
algebraic variety (or simply projective variety). 
In this section, variety means either an affine 
or projective variety. A variety which is a 
subset of another variety is called a subvariety. 
These varieties are the forerunners of the 
modem, more general versions of algebraic 
varieties, which we will discuss later. 

When V is an affine variety in k ”, the set of 
the polynomials in k[X] = ,A„] that 

vanish at every point of V form an ideal I(V) 
of /c[X]. The residue class ring A, = k[X]//( V) 
is called the coordinate ring (or affine ring) of 
V. We can regard A, as the ring of k-valued 
functions on V that can be expressed as poly- 
nomials of the coordinates of k”. When Vis a 
projective variety, the tfiomogeneous ideal 
generated by the homogeneous polynomials in 
k[Y] = k[Y„ . , yj that vanish at every point 
of V is denoted by I(V), and the ring A, = 
k [ V]//( V) is called the homogeneous coordi- 
nate ring of V. 

A variety Vis said to be reducible or irreduc- 
ible according as it is the union of two proper 
subvarieties or not. A maximal irreducible 
subvariety of V is called an irreducible compo- 
nent of V. Any variety can be written uniquely 
as the union of a finite number of irreducible 
components. A variety V is irreducible if and 
only if /( V) is a ^'prime ideal. When that is the 
case, the field of quotients of A, (when Vis 
affine) or the subfield of the field of quotients 
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of A, consisting of the homogeneous elements 
of degree 0 (when V is projective) is called the 
function field of V and is denoted by k(V). 
Elements of k( V) are called rational functions 
(or simply functions) on the variety V. The field 
k(V) is tfinitely generated over k. The tran- 
scendence degree of k(V) over k is called the 
dimension of V. When V is reducible, the max- 
imum of the dimensions of its irreducible 
components is called its dimension. If VE is a 
subvariety of an irreducible variety V, then 
dim V — dim W is called the codimension of W 
on V. A subvariety of pure codimension 1 of 
an affine or projective space can be defined by 
a single equation and is called a hypersurface. 
If the ideal I(V) of a variety V of dimension r 
in a projective space P”(k) is generated by n-r 
homogeneous polynomials, then Vis called a 
complete intersection. Compared with general 
varieties, complete intersections have some 
simpler properties. On the other hand, many 
important varieties are not complete intersec- 
tions, e.g., ■''Abelian varieties of dimension ^ 2. 

The intersections and finite unions of sub- 
varieties on a variety V are also subvarieties. 
Thus the subvarieties can be taken as the 
'system of closed sets of a topology on V f- 
426 Topology), which is called the Zariski 
topology of the variety V. When k is the field of 
complex numbers, V can be viewed as an 
'analytic space, and the topology of Vas such 
(the “usual” topology) is much stronger than 
the Zariski topology. For the rest of this arti- 
cle, varieties will be considered as having Zar- 
iski topologies unless stated otherwise. Terms 
such as Zariski open, Zariski closed, and Zar- 
iski dense are used to mean open, closed, or 
dense in a Zariski topology. Suppose a con- 
dition (P) concerning the points of an irreduc- 
ible variety V (concerning the elements of a 
set M parametrized by the points of V) is 
satisfied in a nonempty Zariski open set of V. 
Then we say that the condition (P) holds at 
almost all points of the variety V or at general 
points of V (almost all elements of the set M). 

Eet U and V be affine varieties in k" and k'”, 
respectively. Then the product set U x Vis an 
affine variety in /("+'" and is called the product 
algebraic variety (or simply the product) of U 
and V. Note that the Zariski topology on 
U X Vis stronger than the product of the 
topologies of U and V. When k is 'algebrai- 
cally closed, U X Vis irreducible if U and 
V are irreducible. 

Suppose that k is algebraically closed. Eet ip 
be a tprime ideal of /c[X] = k[X^, ,X„], and 
let V be the affine variety in k" defined as the 
zero points of ip. Then I(V) = ip (t Hilbert zero 
point theorem) (- 369 Rings of Polynomials 
D). Therefore there exists a one-to-one corre- 
spondence between the set of prime ideals of 
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/c[AT] and the set of irreducible varieties in k”. 
In particular, the '•■maximal ideals correspond 
to the points of k”. Similarly, there exists a 
one-to-one correspondence between the set of 
homogeneous prime ideals of /c[ Y] other than 
2”=o Y] and the set of irreducible sub- 
varieties in P”(k). 

When we deal with nonlinear algebraic 
equations, we cannot expect a'simple, clearcut 
theory without assuming that k is algebraically 
closed. Hence we take an algebraically closed 
field K containing k and regard a variety V in 
k” as a subset of the variety Vf^ in K" defined 
by the same equations. From now on, we 
suppose that k is algebraically closed. If the 
ideal I(V) of k[Y] or Y] determined by a 
variety Vis generated by polynomials with 
coefficients in a subfield k' of k, we say that V 
is defined over k' or that k’ is a field of defi- 
nition for V. Any variety has the smallest field 
of definition, which is finitely generated over 
the prime field. In the theory of A. Weil [92], 
we fix an algebraically closed field K that has 
an infinite transcendence degree over the 
prime field. This K is called the universal 
domain. A point of Vis called a k'-rational 
point of V if all of its coordinates belong to a 
subfield k’ of K. Let Kj, K 2 be two extension 
fields of a field L, and let (x)gKJ, (yieX*. 

We say (y) is a specialization of the point (x) 
over L (notation: (x)-^(y)) if all polynomials 

/(Y)eL[Yj, X,] satisfying f(x) = 0 also 

satisfy f(y) = 0; in other words, if there exists a 
homomorphism of L-algebras L [Xj, ,xj 
L [y,, . , y„] mapping X; to y,-. Let K be the 
universal domain, Van irreducible variety in 
K", and k' {cz K) a field of definition for V 
having a finite transcendence degree over the 
prime field. Then there exists a point (x) of V 
such that all points of V are specializations of 
(x) over k’. Such a point (x) (in general not 
uniquely determined) is called a generic point 
of V over k’. The ring k'[x] is isomorphic to 
fc'[X]//( V) n/c'[2f] over k’. (Some authors use 
the term generic point to mean almost all points 
as defined earlier.) 

B. Local Rings 

Let V be an affine variety and let IV be an 
irreducible subvariety of V. Let be the 
subset of A, consisting of the elements that 
vanish identically on W. Then ip is a prime 
ideal of A,. The ring of tquotients of A, with 
respect to iPj^ is denoted by or by 
and is called the local ring of IF on V (or of 
V at W). Suppose for simplicity that Vis irre- 
ducible. Then is the subring [flg\f,g eAy, 
^Pk} of k(V), and the tresidue field of 
modulo the maximal ideal can be 



identilied with k( W). When a function (p 00 

V {(pek( V)) belongs to it is said to be 
regular at W. For a given function cpG k( V), 
the set of the points of V where cp is regular 
is Zariski open. In the case of a projective 
variety, the local ring is defined as the 
subring of the ring of quotients of A, with 
respect to consisting of the homogeneous 
elements of degree 0. 

A mapping from an open set (7 of a variety 

V to k that is regular at every point of U is 
called a regular function on U. The ring of 
regular functions on U is denoted by A,. By 
assigning A, to each open set U, we can define 
a •'sheaf of rings &y on V, of which the tstalk 
Oy at a point xe V is the local ring The 
sheaf (9y is called the sheaf of germs of regular 
functions on V (or the structure sheaf of V) 

(- 383 Sheaves H). 



C. General Definition 

Consider a pair (V, 0) of a topological space 

V and a sheaf (P of germs of mappings from 

V to k. If y has a finite open covering (Hj) 
such that each (C/,-, tP| (7;) is isomorphic to an af- 
fine variety (in the sense that there exists a 
homeomorphism from to that transforms 
0\ to the structure sheaf of the pair (V, d)) 
is called a prealgebraic variety over k, and & 

is called its structure sheaf. Usually (V, (9) is 
denoted simply by V. 

A regular mapping between prealgebraic 
varieties is detined as a continuous mapping 
g:V-*V satisfying (poge&y,c^OT any xeV 
and (p€ (Py',g(x)- Furthermore, if g is a homeo- 
morphism and g~^ is also regular, then g 
is called an isomorphism or a hiregular map- 
ping. The Cartesian product X x Y of pre- 
algebraic varieties X and Y is locally a prod- 
uct of affine varieties. Therefore X x Y has 
the structure of a prealgebraic variety. A pre- 
algebraic variety X is called an algebraic va- 
riety if the image of the diagonal mapping X ->■ 
X X X is closed in the Zariski topology of 
the product variety X x X (“separation con- 
dition”). (This definition is due to J.-P. Serre 
[81].) The separation condition corresponds 
to tHausdorff’s separation axiom. If IT is a 
locally closed subset (i.e., the intersection of 
an open set and a closed set) of an algebraic 
variety V, then it becomes an algebraic variety 
in a natural manner (the germs of regular 
functions at Fg VF are taken to be the germs of 
functions induced on W by the functions in 
Oy^p). Locally closed subvarieties of k" or P”(k) 
are called quasi-affme or quasiprojective alge- 
braic varieties, respectively. Definitions of 
irreducibility and local rings for general alge- 
braic varieties are given in the same manner 
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as before. In this artiele, algebraie varieties will 
often be referred to simply as varieties. 

The notion of an irredueible algebraie 
variety was developed from that of abstract 
algebraic variety (or simply abstract variety) 
defined by Weil. 

D. Schemes 

The set of prime ideals ( 71^ (1)) of a eommuta- 
tive ring A with unity element 1 is denoted by 
Spec(/4) and is ealled the spectrum of A. For 
any subset a of A, we denote by V(a) the set of 
the prime ideals eontaining a. We define a 
topology on Spec(/4) in whieh the elosed sets 
are V(a). This, again, is ealled the Zariski 
topology of Spec(ri). For an element / of A, the 
open set D(/) = Spec(/4)— V{f) is ealled an 
elementary open set. The elementary open sets 
form a base of open sets in the Zariski topol- 
ogy of Spec(y4). The set of elosed points is 
nothing but the set of maximal ideals of A. 
Assigning to each point of Spec(ri) the tring 
of quotients A„ we obtain a sheaf of rings A 
on Spec(v4). We have the equality r{D{f ), A) — 
Aj-, where Af is the iring of quotients by the 
multiplieative system {/" n'^0}.ln partieular, 
we have r(Spec(/4), ri) = A. Regarded as a 
tlocal-ringed spaee with A as the strueture 
sheaf, Spec(/4) is called an affine scheme. 

A loeal-ringed spaee X whieh is loeally 
isomorphie to an affine seheme is ealled a 
scheme. A morphism of schemes is, by defi- 
nition, a tmorphism between them as local- 
ringed spaces. Thus we obtain a '’'category 
whose objects are sehemes. We denote it by 
(Seh). Giving a morphism / : A^-»^Spec(/l) is 
equivalent to giving a ring homomorphism 
r(/): A^Y{X, 0,). Hence the eategory of 
affine sehemes (whieh is a tfull subeategory of 
(Seh)) is eontravariantly equivalent to the 
eategory of eommutative rings. If there is given 
a morphism of sehemes f:X^S,X is said to 
be an S-scheme or a scheme over S, and / is 
called the structure morphism and S the base 
scheme. For two S-schemes f:X~*S,g: Y-^S, 
a morphism of S-schemes is defined to be a 
morphism of schemes h’.X^X with / = g o /l. 
Thus we obtain the category of S-schemes 
denoted by (Sch/S). Spec(Z) is the unique tfina! 
object in (Seh), hence (Seh) is nothing but 
(Sch/Spec(Z)). 

The tfiber product always exists in (Seh). In 
fact in the case of affine S-schemes X = Spec(B) 
and Y= Spec(C) with S = Spec(y4) we have 
X X ,Y = Spec(B ®A C), and in the general case 
we construct X x ^ Y by patching together fiber 
products of affine schemes. 

A morphism /: is called separated if 

the image of the diagonal morphism ^x/s ■ 
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X->^X X 5 Y is closed. We also Say that X is 
separated Over S or X is a separated S-scheme. 
A scheme X is said to be separated if it is 
separated over Spec(Z). All affine schemes are 
separated. 

When K is a field, Spec(K) is a space having 
only one point and equipped with K as the 
stalk of the structure sheaf. For a point x of a 
scheme X, denote by k(x) the residue field of 
Gx^jc- For / ^^x,x we call the residue class off 
in k(x) the value off at x, denoted by f(x). We 
have a natural morphism i,: Spec(/c(x))->^Y 
whose image is {x}. More generally, we call a 
morphism i of a spectrum Spec(X) of a field K 
to X a point of X with values in K. Such a 
point is determined by a point x in X and an 
embedding of k(x) in K. A point of X with 
values in an algebraically closed field is called 
a geometric point. For a morphism /: X 
S and a point s in S, the fiber product X x 5 
Spec(k(s)) is called the fiber off over s and 
denoted by /"^(s). For a geometric point 
SpeclXl-^S, X X sSpec(K) is called a geo- 
metric liber. 

A scheme X is called reduced if the local ring 
at each point of X has no tnilpotent elements. 

A scheme is said to be irreducible if its underly- 
ing topological space is not a union of two 
proper closed subsets. A scheme is called in- 
tegral if it is reduced and irreducible. Every 
local ring of an integral scheme is an tjntegral 
domain. If a scheme X has an affine open 
covering { f/j = Spec(/1,)} such that every A^ 
is a tNoetherian ring, X is said to be locally 
Noetherian. A locally Noetherian scheme is 
called Noetherian if its underlying topological 
space is +(quasi-)Compact. 

A morphism f:X-^Y = Spec(ri) is said to be 
locally of finite type (of finite type) if X has an 
open affine covering (a finite Open affme cover- 
ing) { C/. = Spec(/li)} such that each ri; is a 
finitely generated A- algebra. A general mor- 
phism /: X Y is said to be locally of finite 
type (of finite type) if there is an open affine 
covering { !(} of Y such that every restriction 
of is locally of finite type (of 

finite type). If /: Y is (locally) of finite 

type we say that X is (locally) of finite type 
over Y. 

A scheme of finite type over a field K (i.e., 
over Spec( K)) is called an algebraic scheme 
over K. There is a tnatural equivalence of 
categories between the category of reduced 
separated algebraic K-schemes (as a full sub- 
category of (Sch/X)) and the category of alge- 
braic varieties over K (defined in Section C) 
equipped with regular mappings as mor- 
phisms. Hence we identify these categories 
from now on. Occasionally, algebraic vari- 
ety means irreducible variety. Nonalgebraic 
schemes are also important as tools for the 
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study of algebraic varieties. For example, for a 
point X in a scheme X there is a canonical 
monomorphism j,: Spec(tP_:f by which 

the unique closed point of Spec(d’;t_ J is map- 
ped to X. If for two algebraic K-schemes X, Y 
and for two points xeX, ye Y there is a K- 
isomorphism &x^x= ^^en suitable neigh- 
borhoods of X and y are isomorphic over K. 

Many concepts concerning varieties, e.g., 
dimension, generic points, specialization, can 
be naturally extended to the case of schemes 
by virtue of commutative ring theory. 

A morphism of schemes /: Y is called 

proper if it satisfies the following two con- 
ditions: (1) / is separated and of finite type, (2) 
for every scheme T and for every morphism T 

Y, the morphism X x yT-> T obtained from 
/ by the “change of base” is a closed mapping. 
We also say that X is a proper Y-scheme or X 
is proper over Y. A proper algebraic K-scheme 
is called complete. A projective variety is com- 
plete, while an affine variety over K is com- 
plete only when it is of dimension zero. Every 
algebraic variety can be embedded in a com- 
plete variety (M. Nagata [63]). 

A morphism of schemes /: X Y is called 
affine if every inverse image by f of an open 
alfme subset of Y is again an affine scheme. 

A morphism of schemes /; X ^ Y is called 
finite if it is of finite type and there is an affine 
open covering { = Spec(zl,)} of Y such that 

/ “'( C/,) = Spec(Sj), where is tintegral over 
A;. For a locally Noetherian scheme Y and a 
morphism of schemes J: X ->■ Y the following 
three conditions are equivalent: (i) / is finite; 
(ii) / is affine and proper; (iii) / is proper and 
every fiber off is a finite set. For a finite sur- 
jective morphism of Noetherian schemes / ; 

X -> Y, X is an affine scheme if and only if Y 
is an affine scheme. 

A morphism of schemesf: X-> Y is said to 
be flat if for each point x e X, ^ 

-module. If, moreover, fis surjective, 
then / is called faithfully flat. Assume that 
g: Y’-i- Y is a faithfully flat morphism of finite 
type of locally Noetherian schemes and f:X-* 
Y is a morphism of schemes. Then for many 
important properties of morphisms, / has these 
properties if and only if the pull-back fy : 

X X y Y’-i- Y’ has the same properties (theory of 
descent [29,30]). 

E. Cohomology Theory 

Fet (X, 0) be a ringed space. An 6?-Module 
(i.e., a sheaf of 0-modules) F is said to be 
quasicoherent if for each point x of X there 
exist a neighborhood U of x and an tgxact 
sequence M-^N^ Fiu-O, where M and N are 
free tP|[,-Modules. An ((-Module F is said to 



be of finite type if F is locally generated by a 
finite number of sections over &; F is of finite 
presentation if, locally, there exists an exact se- 
quence where p and q are posi- 

tive integers (they need not be globally con- 
stant); F is coherent if(i) F is of finite type and 
(ii) the kernel of any homomorphism 
(where n is an arbitrary positive integer, and JJ 
is an open set) is of finite type. Obviously, if F 
is coherent, then F is of finite presentation, 
which implies that F is quasicoherenet and of 
finite type. In the category of U-Modules, the 
full subcategory of coherent sheaves is closed 
under almost all operations of sheaves. If G 
itself is coherent as an 0-Module, G is said to 
be a coherent sheaf of rings. In this case every 
6?-Module of finite presentation is coherent. 

The structure sheaf of a locally Noetherian 
scheme is a coherent sheaf of rings. On a lo- 
cally Noetherian scheme X, every quasicoher- 
ent sub- @, -Module or quotient @, -Module of 
a coherent 6?;f-Module is coherent. A coherent 
((^,-Module on an algebraic variety V is called 
a coherent algebraic sheaf. 

Fet X = Spec(A) be an affine scheme. Then 
every quasicoherent (9;^- Module F on X is 
generated by its global sections. The corre- 
spondence F-*r{X, F) defines an equivalence 
beeween the category of quasicoherent sheaves 
on X and the category of A-modules; if A is 
Noetherian, then the coherent sheaves and the 
finite A-modules correspond to each other 
under this equivalence. 

Fet X be a separated scheme, and U = {U^} 
an affine open covering of X. For each quasi- 
coherent @, -Module F, the cohomology group 
FI^{X, F) is canonically isomorphic to the 
tCech cohomology H''(U, F) (- 383 Sheaves 
F). If X is of dimension d, then //^(X, F) =0 
for every sheaf F of Abelian groups on X and 
q>d. 

For a scheme X we define the cohomological 
dimension cd(X) to be the largest integer q 
such that H‘'{X, F) /O for a quasicoherent G^- 
Module F on X [35]. The cohomological 
dimension cd(X) does not exceed the dimen- 
sion of X. If X is an affine scheme, then cd(X) 

= 0. The converse is true under the assump- 
tion that X is Noetherian (Serre’s criterion 
[29, HI])- For an algebraic scheme X of dimen- 
sion n, cd(X) = u if and only if X is complete 
(S. F. Kleiman). 

Fet y": X Y be a proper morphism of 
Noetherian schemes. Then for every coherent 
(P;^-Module F and for every q^O, R%{F) 

200 Homological Algebra) is also coherent. In 
the special case of Y = Spec(k) with a field k 
this means that for an algebraic coherent sheaf 
F on a complete variety X the cohomology 
group //^(X, F) is a finite-dimensional vector 
space over k. 
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Let X be a scheme over k and let f be a 
locally free C/^^-Module of rank n (i.e., an (p^. 
Module which is locally isomorphic to If 
we take an open covering | C/,} of X and iso- 
morphisms q>i'. F\u.^^x\v,^ then cpjO (pf^ defines 
a morphism n L/^->GL(n, k), which is 
called the coordinate transformation of F. If we 
construct a t vector bundle B on X by the same 
coordinate transformations g^j, then F can be 
regarded as the sheaf B(B) of germs of sections 
of B. By means of the canonical homomor- 
phism GL{n,k)^PGL{n — l, k) wecan con- 
struct a projective bundle P(F) on X (which is 
said to be associated with F). (Note that in 
[29], P(F) is defined to be a projective bundle 
with coordinate transformations ‘< 51 . 7 i.e., as- 
sociated with the dual of F in our sense.) This 
procedure of associating P(F) with a locally 
free (P^j-Module F can be generalized for any 
quasicoherent Os-Module of finite type on 
an arbitrary scheme S. 

A closed (locally closed) S-subscheme f:X-> 
S of p: P{F)^S is called a projective scheme 
(quasiprojective scheme) over S, or f is said to 
be a projective morphism (quasiprojective mor- 
phism). A projective morphism is proper. A 
reduced projective scheme over a field k is 
nothing but a projective variety OVCF k. We 
can develop the theory of projective schemes 
by means of tgraded rings in a way similar to 
affine schemes. 

A locally free Gj^-Module of rank 1 is called 
an invertihle sheaf. Invertible sheaves corre- 
spond to line bundles up to isomorphisms. Let 
P = P’^(k) be a projective space, (y„ y„ . . , y^v) 
a system of homogeneous coordinates of P, 
and [/• the open subset of P defined by ^ 0 . 
Denote by G(n) the invertible sheaf on P de- 
fined by the coordinate transformation g^j — 
(yj/yiT- More generally, let p: P = P(F)->’S'be 
the projective bundle associated with a locally 
free Us-Module F of rank N -|- 1 on a scheme 
S. Then there is an invertible sheaf 0(n) = 

Gil)®" on P with the properties: (i) for each 
seS its restriction to the fiber p~^{s) ~ P*''(/c(s)) 
is G(n)detined above; (ii) P*(tC(n))= S"(F) for 
n>0 where S"(F) denotes the nth symmetric 
product of F. The invertible sheaf (P(l) is called 
the tautological line bundle on P. (Note that 
(P( 1 ) in the sense of [29] is (P( - 1 ) in our sense, 
but since the definition of P(E) is also differ- 
ent, the above property (ii) holds without 
modification.) 

For a quasiprojective S-scheme f:X-^P{E) 
-^S, the restriction of 1) to X is denoted by 
1) (or simply G( !))• An invertible sheaf L on 
X is called very ample over S if there exist a 
locally free G^-Module of finite type £ on S 
and an S-immersion i: X ->-P(F) such that 
(Dxi 1) = F; £ is called relatively ample over S or 
S-ample if L®" is very ample over S for a cer- 



16 E 

Algebraic Varieties 

tain q > 0 (- Section N). When S is an affine 
scheme, an ample (very ample) sheaf over S is 
simply called ample (very ample). There is the 
following cohomological criterion of ample- 
ness (generalized Serre’s theorem). 

Let Y be a Noetherian scheme, /; AT Y a 
proper morphism, and L an invertible 
Module. Then the following four conditions 
are equivalent: (i) L is f-ample; (ii) for each 
coherent @, -Module F there is an integer 
N such that R‘’ft^{F (X) L®") = 0 for all n ^ N 
and q>0; (hi) for each coherent sheaf of 
ideals J of there is an integer N such that 
R % ( ® L® ") = 0 for all n ^ N . They imply 
the condition (iv): for each Coherent 
Module F there is an integer N such that the 
canonical homomorphisms /%(F (x) L®")^ 

F 0 L®" are surjective for all d ^ N . 

Let X be a scheme proper over a field k 
(such an X is called a k-complete scheme), 
and let £ be a coherent Q-module. Then all 
F) are finite-dimensional k-vector 
spaces (finiteness theorem), and x{F) - 
Z( ■” 1)^ dim W{X, F) is a finite number, 
called the Euler-Poincare characteristic of F 
over X. For every invertible sheaf L, x{E ® 
L®"*) is a polynomial in m, which is said to 
be the Snapper polynomial [48]. Supp F, de- 
fined to be the set {x e X is a closed 

subset of X and is called the support of F. The 
degree of the polynomial 0 £®“) is at 
most Y = dim supp F, and we have ^{F 0 L®'") 

= e • m7r! + lower terms in m for some e e Z. e 
is said to be the intersection number of U with 
F [48], When W is a closed subscheme of X 
defined by an (C;^-ideal I, {U \N) is defined to 
be the intersection number of U with &xj^ , 
where r = dim W . If L is ample, then (£*' ‘ W ) > 

0 for any W . In particular, when W = X, it 
follows that (L") > 0, where n = dim X. The 
converse of this fact is the Nakai-Moishezon 
criterion, saying that if {U W ) > 0 for any 
closed subvariety W of X where r = dim W, 
then L is ample [56,64]. 

When L is ample, //«(Y, F ® L®"*) = 0 for 
anyq > 0 and for sufficiently large m; hence 
X{F 0 L®"*) = dim //”(X, F 0 L®™) and the 
Snapper polynomial /(F ® L®"*) is also called 
the Hilbert polynomial or the Hilbert charac- 
teristic function of F. If L is the invertible sheaf 
defined by a hyperplane section of X in i.e., 
£ = &x{\), then x(£®'") = dim //°(Y, L®'") = 
dim for sufficiently large m where 0R„ 
denotes the homogeneous coordinate ring of 
X in 

In general, for a complete irreducible variety 
V of dimension Y, we put 

x(v)=xiOv)=t(-'ri<°-- 

q = 0 

(^o-« = dim /f^( K Gy)) and call it the arithmetic 
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genus of V . Classically, the number p,(V) de- 
fined by = = 

•+■... was called the arithmetic genus of 
V, instead of /( V ). When V is a nonsingular 
irreducible curve, p,(V) is the usual tgenus. If 
V is a projective variety, the constant term of 
the Hilbert polynomial is ;^(K) and the coeffi- 
cient of its highest term is (deg V)!r\. 

Let V be a tnormal variety, and D a tdivisor 
on V . If, for each point x G L, we denote by 
&{D)^ the set of the functions /g k( V ) that 
satisfy (f) + D ^ 0 on some neighborhood of x, 
we obtain a coherent algebraic sheaf &{D), and 
we have dim V, 0(D)) = 1(D). if, moreover, V 
is complete, we put;f^,(D) = x(V) — x(^(~ ^)) 
and call it the virtual arithmetic genus of D. 
Classically, the number p,(D) =( — 1)'"‘^(^^(D) 

— 1) was called by that name. When D is effec- 
tive and has no multiple eomponents, Xvi^) 
coincides with the arithmetic genus of D re- 
garded as a variety. In general, Xvi^) stays in- 
variant if D is replaced by a divisor that is 
algebraically equivalent to D (->• Section N). 

If is a Cartier divisor, 0(D) is an inver- 
tible sheaf. For two Cartier divisors D 2 , 
&(D^ -t- D 2 ) = (P(Di) 0 &{D 2 ), and Dj and D 2 
are linearly equivalent if and only if tP{Dj) = 

mi)- 

Let V be an irreducible nonsingular variety 
and let denote the sheaf of germs of regular 
differential forms of degree p (Q*’ = &y). If V is 
complete, then we denote dim V, Q^) by 

Serre’s duality theorem: Let V be a nonsin- 
gular complete irreducible variety of dimen- 
sion r, B an algebraic vector bundle over V, 
and B* the dual vector bundle of B. Denote by 
^ and J'* the sheaves of germs of seetions of 
B and B*, respectively. Then (i) fT( V, Q'') is 
canonically isomorphic to k, and (ii) //“*( V , 
and H' V, J** ® Q'’) are dual to each other as 
linear spaces by means of the +cup product of 
the above spaces with H'{ V, Q’’) = k. In partic- 
ular, V , Q^) is dual to K hence 
we have h^'‘* ~ 

This theorem was extensively generalized 
by Grothendieck in the category of schemes 
[1,331. 

When the field k is of eharaeteristic 0, we 
furthermore have h”'^ = by complex con- 
jugation 232 Kahler Manifolds C); but in 
characteristic p, there are examples for which 
this symmetry does not hold [61], In general, 
is a trelative invariant but not an tab- 
solute invariant; however, as is the dimen- 
sion of the hnear space of the differential forms 
of the first kind with degree p, is an ab- 
solute invariant. Hence is also an absolute 
invariant in the case of characteristic 0. When 
the characteristic is positive, the absolute 
invariance of has not yet been proved. 



F. Simple Points and Singular Points 

Let V be a variety over an algebraically closed 
field k. We say that a point P of V i S simple 
or that V i S nonsingular or smooth at P if the 
local ring Op is a tregular local ring. Since the 
problem is local, we may assume that V i S an 
affine variety in k" . Then the simplicity of P on 

V i S equivalent to the following condition: P is 
contained in only one irreducible component 
of V, and if that component has dimension r 
there exist n-r polynomials Fi(X) in I(V) such 
that rank {dFJdX^^ii)=p = n — r. A point of V 
that is not simple is called a singular point or 

a multiple point. The set of singular points on 

V (called the singular locus of V ) is a proper 
closed subset of V . A variety with no multiple 
points is called smooth or nonsingular. 

This notion can be made relative. A mor- 
phism y: X -4 Y of a loeally Noetherian scheme 
is called smooth if / is flat and locally of finite 
type and all the geometric fibers off are non- 
singular. In the case of an affine morphism 
/: A = Spec(i? [X. , . . . , X,,Mfi , • • • , /J)- 7 = 
Spec(R) of relative dimension r (by which we 
mean the dimension of the general fiber) with 
a Noetherian ring R, the smoothness off 
amounts to a condition that rank ((dfi/6Xj){x)) 
= S at each point x of X. 

When for a point P of a variety V the loeal 
ring Op is '•'normal, P is called a normal point. 

A simple point is normal. The set of normal 
points is a nonempty open subset of V . An 
irreducible variety whose points are all normal 
is called a normal algebraic variety (or simply 
normal variety). The singular locus of a normal 
variety has eodimension ^ 2. For an irreduc- 
ible variety V, there exists a pair (V', f) of a 
normal variety V ' and a birational finite mor- 
phism / : K' V ; V ' is unique up to isomor- 
phisms and is called the derived normal model 
or normalization of V. 

Simplicity and normality for V at a sub- 
variety W are defined in the same way as at 
a point by using the loeal ring Cy 

For a morphism y ; X Y of locally Noe- 
therian schemes, locally of finite type, the fol- 
lowing three conditions are equivalent: (i) f is 
smooth and every fiber off is a discrete set; (ii) 
fis flat and every geometric fiber over Spec(/v) 
off is a union of spectra of fields isomorphic 
to K ; (in) y is flat and every fiber off over 
y G Y is a union of spectra of fields that are 
finite tseparable extensions of k(y). These 
conditions are local with respect to X. If a 
morphism f satisfies these equivalent con- 
ditions, we say that f is etale or X is etale over 
Y. A morphism 

f: X = Spec(P[X„ , XJ/(y„ . ,Q) 

Y= Spec(P) 
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is etale if and only if dQ\{{dfJdXj){x))^^ for all 
XBX. Hence etale morphisms correspond to 
local isomorphisms in the analytic category. 
For a surjective etale morphism /; X -> Y many 
important geometric properties (reduced, 
integral, normal, nonsingular, etc.) hold on 
X if and only if they hold on Y (theory of 
descent). 



G. Dimension Theorems 

Let V be an irreducible variety, and let \J and 
W be irreducible subvarieties of V. Then any 
irreducible component of (7 fl W that is simple 
on Fhas dimension ^ dim U + dim W — dim V. 
When the equality holds, the component is 
called a proper component of the intersection 
(/ n W. If every component of (/ fl W that is 
simple on Vis proper, we say that U and W 
properly intersect on V. Any two subvarieties 
U and W of P”(k) with dim U + dim W^n 
intersect each other. When Vis an irreducible 
r-dimensional variety in P"(/c), the number of 
points of intersection F fl L of F with an {n — 
r) -dimensional linear variety L is independent 
of the choice of L as long as L is in a “gen- 
eral position.” This number is called the degree 
of F and is denoted by deg( V). Letting (P( 1) be 
the fundamental sheaf of P”(k), we have deg F 
= {(9{\J- V) (- Section E). 



H. Group Varieties 

An algebraic variety G is called an talgebraic 
group if it has a group structure and if the 
mapping G x sending (x, y) to xy~^ is 
a morphism. Every algebraic group is quasi- 
projective (Chow). If G is irreducible, then 
it is also called a group variety; a complete 
group variety is called an tAbelian variety (- 
3 Abelian Varieties B, 13 Algebraic Groups B). 
A scheme G over another scheme S equipped 
with morphisms over S:G x ^G-^G, G->G, and 
S->G, called multiplication, inverse, and unit 
section, respectively, which satisfy the relations 
corresponding to the usual axioms of group, is 
called a group scheme (over S). As a point set, 
G is not a group, while, for any scheme T over 
S the set G(T) = Homs( T, G) of the morphisms 
from T to G is a group (-52 Categories and 
Eunctors M). Consider an algebraic group 
scheme G over S = Spec(/c). If the characteristic 
of k is zero, then G is necessarily reduced, so 
an algebraic group scheme over k is essen- 
tially the same as an algebraic group; if k has 
characteristic p, there exist algebraic group 
schemes over k that are not reduced. 
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I. Rational Mappings 

Eet / : V-t F’ be a morphism of varieties. If F 
is not complete, the image f(V) is not always 
closed; the closure off(V) (in V’) is called the 
closed image of F The image f(V) contains an 
open dense subset of the closed image. 

Eet F and IF be irreducible varieties. A 
closed subset T of Fjc IF is called an algebraic 
correspondence of F and W. We say that points 
P e F and Q e IF correspond to each other by T 
if (P, Q)e T. If T is irreducible and the closed 
image of the projection T-^V coincides with 
V, then the function field k(V) can be identified 
with a subfield of k(T); if we have k(V) = k(T) 
with this identification, then T is called a 
rational mapping from F to IF. Moreover, if 
the same conditions are satisfied for IF, then T 
is called a birational mapping (of birational 
correspondence or birational transformation), 
and in this case we have k(V) = k( IF). A mor- 
phism can be considered a special kind of 
rational mapping by taking the graph. If P is a 
rational mapping from F to IF and IFi is the 
closed image of T in IF, then k( VFj) can be 
regarded as a subfield of k(T) = k( V). If k(V) is 
tseparably generated (tpurely inseparable) over 
k( IFi), then T is said to be separable (purely 
inseparable). 

Eet P be a rational mapping from F to IF, 
and let F’ and IF’ be irreducible subvarieties 
of F and IF, respectively. If there exists an 
irreducible subvariety P’ of P whose projec- 
tions have the closed images F’ and IF’, then 
we say F’ and IF’ correspond to each other by 
P. The union of irreducible subvarieties of IF 
that correspond to V by P is a closed subset of 
IF; it is called the proper transform of F’ by P 
and is denoted by P[F']. Note that F" 
does not imply T[F'] D P[ F”/. The set of 
points of IF that correspond to the points of 
F’ is called the total transform of F’ by P and 
is denoted by T{ V’}. Identifying k(T) with 
k(V), we have Cj. j, z) in general; if the 
equality holds, we say that P is regular (or 
defined) along F’, In that case, IF’ is the unique 
irreducible subvariety of IF corresponding to 
F’ by P, If F’ is simple and of codimension 1 
in F, then P is always regular along F’, The set 
V of points of F at which P is regular is a 
nonempty open subset, and the restriction of P 
to U defines a morphism from U to IF. A 
rational mapping can be defined as the closure 
of the graph of a morphism defined on an 
open subset of F, The study of a rational 
mapping can be reduced to that of the bira- 
tional morphism T-*V Zariski’s main theo- 
rem: Eet S : X Y be a birational mapping, 
and assume that the inverse Y-X is 
regular, and that X is normal along an irre- 
ducible subvariety X’. If there exists an irre- 
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ducible component Y’ of S[Y'] with dim 
dim Y’, then S is regular along X’. It follows 
from the above main theorem that, if T:V-* 

W is a rational mapping and if P is a normal 
point of V such that T[P] contains an iso- 
lated point, then T is regular at P. 

For a birational mapping T:V^W between 
complete irreducible varieties, a subvariety V 
of V is said to be fundamental when dim T[ F'] 
> dim V'. When V’ is a point (curve) we say V’ 
is a fundamental point (fundamental curve) with 
respect to T. The most classical example of a 
birational correspondence with fundamental 
points is the quadratic transformation T of a 
projective plane onto itself given by (xq: Xj : X 2 ) 
^(x,X 2 :x 2 Xo;XoX,) with a suitable coordinate 
system. In this case, = identity and the 
points P. defined by Xj = 0 for j / correspond 
to the lines x,- = 0 by T. Let Cr, denote the 
group of birational transformations of P”(k) 
into itself. These transformations are called 
Cremona transformations when n ^ 2. Cr, is 
generated by linear transformations and qua- 
dratic transformations (M. Noether). Recently, 
Cr, has been studied in detail by M. Dema- 
zure [21] and H. Umemura [91]. Let V be a 
complete nonsingular irreducible variety ovCf 
k It is called relatively minimal if every bira- 
tional morphism from V to a complete non- 
singular variety V is an isomorphism. It is 
called minimal if every birational mapping 
from a nonsingular variety V’ to Vis a mor- 
phism. Replacing birational mapping by ra- 
tional mapping, we can define strong (abso- 
lute) minimality. Abelian varieties and com- 
plete nonsingular curves with positive genera 
are strongly minimal. In general, minimality 
implies relative minimality. 

Zariski’s main theorem is closely related to 
the general connectedness theorem due to W. 
Fulton and J. Hansen stated as follows: LetP 
be the product of r copies of P™ and A be the 
diagonal subvariety of P. If X is an irreducible 
projective variety and /: X is a morphism 
with dim f{X)>(r— \)m, then /“^(A) is con- 
nected [22]. The following results are derived 
from this theorem: (1) if X is singular with only 
normal crossings (- Section L), then X can- 
not be imbedded in , where n = dim X. (2) 
Let X be a nonsingular subvariety of which 
is not contained in any hyperplane. Suppose 
that by the projection tt^: P'''->-P'^“’ with cen- 
ter X, X being a general point of P^, X is iso- 
morphic to nJ^X). Then 3 ■ dim X ^ 2{N — 2) 
(Zak’s theorem). (3) Let X be a nonsingular 
subvariety of P^ and H be an arbitrary hyper- 
plane section of X. Then dim(Sing H) < 
codim (J. Roberts). 

By a similar connectedness theorem, it was 
proved that if C is a (reducible) curve with 
normal crossings on P^ over C, then the fun- 



damental group of P^ -C is commutative, 
which was conjectured by 0. Zariski [20]. 

J. Rational Varieties 

An irreducible algebraic variety V over k 
whose function field is purely transcendental 
over k is called a rational variety. A complete 
smooth surface $ over an algebraically closed 
field is rational if and only if = <y(5) = 0 
(the Castelnuovo-Zariski criterion, 15 Alge- 
braic Surfaces E). 

If the function field of V has a finite alge- 
braic extension which is purely transcendental 
over k, then Vis called unirational. A unira- 
tional curve is in fact rational. More generally, 
if the function field k( C) of a curve C over 
k is contained in a field finitely generated 
and purely transcendental over k, then C is 
rational (LUroth’s theorem). A unirational 
surface over an algebraically closed field of 
characteristic zero is rational by virtue of the 
above criterion, but in the case of positive 
characteristic there are unirational surfaces 
which are not rational (Zariski). There are 
nonrational unirational threefolds even of 
characteristic zero; for example, all smooth 
cubic hypersurfaces in P"*^ (C. H. Clemens and 
P. A. Griffiths [16]), and some smooth quartic 
hypersurfaces in P"^ (V. A. Iskovskii and Yu. 1. 
Martin [45]). See [6]. 

K. Monoidal Transformations 

Let y be an irreducible variety and J be a 
sheaf of ideals of 6y. For any affine open set V 
of V, J\U is determined by an ideal a of the 
coordinate ring A of (J, Let a,, a,, , a, be a 
system of generators of a, and let U' be the 
graph of a rational mapping from U to P'”(k) 
such that the points PeV and 
: a^(P))e P"‘(/c) correspond to each other by 
U'. Then U’ is uniquely determined (up to 
isomorphisms) by U and a only. Suppose 
V has a covering by affine open sets Uj. We 
obtain L/ over (7; as before. By patching them 
together, we get a birational morphism T 
K'-> V, which is unique up to isomorphisms. 
This T with V’ is called the monoidal trans- 
formation or blowing up of Vby the ideal 
sheaf Note that in some references, the 
inverse transformation T * is said to be the 
monoidal transformation. The inverse image 
ideal T“^'(,/) is an invertible sheaf, that is 
relatively ample over V. If W is the support of 
and W# V, then T-‘(W) has codimen- 
sion 1 and T gives rise to the isomorphism 
from V’ — T~^{W) onto V- W. Thus one can 
say that V’ is obtained from V by replacing W 
with T-‘(W), which is locally defined by prin- 
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cipal ideals. If every point of is a nonsin- 
gular point of V and [V is itself nonsingular, 
T~^{W) is the projective bundle over W whose 
fiber is the projective r-space where r = dim V 
— dim W— 1. In general, if ly is a subvariety 
defined by a sheaf of ideals J, then T is said to 
be the monoidal transformation with center W. 
In particular, if W is a point, T is called the 
(locally) quadratic transformation. 



L. Resolution of Singularities 

Given an arbitrary irreducible variety V, we 
have the problem of finding out a nonsingular 
projective variety V' birationally equivalent 
to V . This is called the problem of resolution 
of singularities. In the case of characteristic 
zero, this problem was solved by Zariski 
(1944) for dimension ^ 3 and by H. Hironaka 
(1964) [37] for any dimension. In the case of 
characteristic p, S. Abhyankar solved the 2- 
dimensional case (1956) and the 3-dimensional 
case (1966). Hironaka’ s theorem of resolution 
of singularities is stated as follows: Let V be a 
variety over a field of characteristic zero. Then 
there exists a finite sequence of morphisms of 
varieties: ->■. Lq = V such that (1) 

has no singular points, (2) each 1] is a 
monoidal transformation of with center Dj, 
(3) each £); is a nonsingular subvariety, (4) 
each Vf is normally flat along D^. Here, Vis 
said to be normally flat along a subscheme D 
of V defined by the sheaf of ideals if the 
quotient modules are flat (9q^^- 

modules for all p and all points x of D. 

Let 1^ be a nonsingular variety and D an 
effective divisor on V. D is said to be a divisor 
with (only) normal crossings at x e K, if D is 
defined by /j . such that ( /j , ,/,) is a part 
of a tsystem of local coordinates around x. D 
is said to be a divisor with only normal cross- 
ings, if it is so everywhere. For any subvariety 
W on F, there exist /j : F -> V that is a composi- 
tion of monoidal transformations with non- 
singular centers, such that h~^{W) has only 
normal crossings. This results from Hironaka’s 
main theorem II [37]. The normal crossing 
divisor is also defined for complex manifolds, 
and similar results hold. 



M. Cycles and Divisors 

Let V be an irreducible variety. We denote by 
the set of r-dimensional irreducible sub- 
varieties of V that are simple on V (i.e., are not 
contained in the singular locus of V), and by 
3r(F) the free Abelian group with basis 23,. 
Elements of 3r( V) are called cycles of dimen- 
sion Y (or r-cycles) on V . Let A and B be r- 
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cycles; A = Z A,-, B = S A, (A; 6 ^ 

if i If Uf ^ nii for all i, then we write A ^ B. 

If A ^ 0, then A is said to be a positive cycle. 

For a 0-cycle A the integer deg(A) = 

Z H; is called the degree of A . 

A cycle of codimension 1 is called a divisor. 

A divisor ^ 0 is usually called effective in- 
stead of positive. If Vis of dimension d and if 
W • 23,-, , the local ring is a (discrete 

valuation ring. The tnormalized valuation 
defined by it is denoted by ). For a func- 
tion /€ k( V), we say that W is a zero of order n 
if = n > 0, and that W is a pole of order 
- n if Vfy(f ) = n<0. Any function /e k( V ), 
other than the constant 0, has at most a finite 
number of zeros and poles. We denote by 
(/)q the sum Z Vfy{f) W extended over all the 
zeros W off, and put (/"‘)o = (/)oo and (/)q- 
(f),=(f). We call (/)o, (/)oo, and (/) the 
zero divisor, the pole divisor, and the divisor 
off; respectively. The divisor (f) is equal to 
2 /) 1^5 where the summation is taken over 

lF£©rf_j, and we have (/^) = (/) + (^). When V 
is complete and the singular locus of V has 
codimension > 1, then fis constant if and only 
if(f), =0 (or (/)q = 0). Fet Dj and D 2 be divi- 
sors; if there exists a function /( #0)e k(V) 
such that -D 2 =(/), then and D 2 are 
called linearly equivalent to each other and we 
write Dj ~ D 2 . The linear equivalence class 
containing a divisor D is denoted by cl(D). A 
divisor which is linearly equivalent to 0 on a 
neighborhood of each point of V is called a 
Cartier divisor (some authors call a Cartier 
divisor simply a divisor). If V is smooth, then 
any divisor is a Cartier divisor. If a divisor D 
can be written as D =(f) on an open set U, 
then the function fis called a local equation of 
D on U. Fet T: F' ^ V be a rational mapping 
from a normal variety V ’ to a complete variety 
V , let D be a Cartier divisor on V , and assume 
that the closed image of T is not contained in 
D . Since T is regular on some open set U such 
that codim( V ’ L) > 1, we have a morphism 
(p = T\u and the pullback (/)*(£)) defined by 
composing the local equations of D with (p. 
Taking the closure of this divisor in F', we 
obtain a divisor on V’, which is denoted by 
T*(D). 



N. Divisors and Linear Systems 

Fet V be a complete irreducible variety, /q, 

,/i) ...)/„ be elements of the function field k(V) 
of F, and D be a divisor on V satisfying (fi) + 

D ^ 0 for each i- Then the set of the divisors 
of the form (Z aj^) + D, where the are 
elements of k and not all zero, is called a linear 
system. The linear space fc/o + /(/,+ ■ +kf„ is 
called a defming module of 2^. The divisors in 
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2T are positive and are linearly equivalent to 
each other; if every positive divisor that is 
linearly equivalent to a member of Z belongs 
to then iT is said to be a complete linear 
system. For any linear system there exists a 
unique complete linear system containing it, 
which is denoted by IZ"!. The maximal positive 
divisor Dq that is contained in all divisors of E 
is called the fixed component of E, and for 
each D E C we call D —Dq the variable compo- 
nent of D (or of E). A point P of F is called a 
base point of a linear system 27 if P is on each 
variable component of E. A linear system E 
is called irreducible if its generic member is 
irreducible; otherwise it is called reducible. The 
dimension of a defining module of a linear 
system E is denoted by 1(C); we call 1(C)- 1 the 
dimension of E and denote it by dim E. A 
linear system of dimension 1 is called a linear 
pencil. 

A defining module of a linear system E is 
determined uniquely up to k-isomorphisms. 
Let L be a defming module, and let /q,/i, • ,/„ 
be a linearly independent basis of L over 
k. If we associate to each point P of V the 
point Q = {fo{P):fi{P):...:f„{P)) of the n- 
dimensional projective space, then we obtain 
a rational mapping 02- from V to another 
variety V ' . Outside the base points of E, the 
rational mapping regular; and the base 

points are the fundamental points of We 
say that 0^ is the rational mapping defined 
by the linear system E. When Chas no tixed 
components and is a closed immersion, 

E is said to be very ample (or ample). For a 
divisor U, the set of positive divisors that are 
linearly equivalent to D is a linear system, 
which is called the complete linear system and 
is denoted by |£)|. We usually write 1{D) in- 
stead of /( |Z)|). If |D| is very ample, we say that 
D is very ample. We say that D is ample (or 
nondegenerate) if mD is very ample for some 
m >0. 



0. Differential Forms 

Let V be an n-dimensional irreducible variety, 
and let A = k( V ) be its function field. We de- 
note by D* the set of derivations of over 
k, i.e., the k- linear mappings D : satisfy- 

ing D{fg) = D{f)g+fD{g). Then T)* is an n- 
dimensional linear space over R. Let X) denote 
the tdual space of I)* over R. For each f 
let df be an element of X) defined by {df, D} = 
D(f) (D G D*). Let X 1 ,^ 2 , . . . , x„ be a separat- 
ing transcendence basis of 51 over k, in the 
sense that Xj, ,x„ are algebraically inde- 
pendent over k and 5^ is a tseparable algebraic 
extension over k(x j, ,x„). (Such a basis exists 
under the weaker hypothesis that kis tperfect.) 



Then dx,, . . . . dx, form a basis of X) OVer R. 

The homogeneous elements of degree r of the 
tGrassmann algebra of X) over 51 are called 
differential forms of degree r on V (or rational 
r-forms). The set of the differential forms of 
degree n is a 1 -dimensional linear space over 51 
spanned by dx, A dx, A . A dx,. 

A set of n functions /i, ,/„ in 5^ is called a 
system of local coordinates on an open set JJ of 
V if/i -fiiP), • J„-f„{P) is a tregular system 
of parameters of the local ring Op for each 
Pe V. In that case, /j, ,/„ is also a separat- 
ing transcendence basis of 5^. If P is a simple 
point of V, then there exists a system of local 
coordinates on a suitable neighborhood of P. 
Let ft) be a differential form of degree r on V, 
and write (o = Ef, < a a 

where (/i , . . ,/„) is a system of local coordi- 
nates around P ■ If the coefficients are 
regular at P , then ft) is said to be regular at P . 

When Visa complete variety without sin- 
gular points, a differential form that is every- 
where regular on V is called a differential form 
of the first kind (or a regular form); the dif- 
ferential forms of the first kind are determined 
by the function field A and are independent of 
the choice of the nonsingular model V . The 
number of linearly independent differential 
forms of the first kind, of degree n, is denoted 
by Pg and is called the geometric genus of V. 

Let V be a complete variety, W an irreduc- 
ible subvariety of V of codimension 1, and P 
a point of W that is simple on V . Choose a 
system of local coordinates (fi). Given a dif- 
ferential form CO on V, we write it as a “poly- 
nomial” in the df^, and denote by v^{co} the 
minimum of the values of the coefficients for 
the tvaluation 0jy( ). The number ^^.(co) is 
determined by co and W, and it is independent 
of the choice of P and of the local coordinates. 
Then co defines a divisor (w) = Eif t»(^(co) W on 
V, which is called the divisor of a differential 
form (0, The divisor of a differential form of 
degree n (= dim V ) is called a canonical divisor 
and is usually denoted by K. The canonical 
divisors form a linear equivalence class of 
divisors. 



P. Albanese Variety, Picard Variety, Neron- 
Severi Group 

Let V be an irreducible variety. Then we can 
construct a couple (A,f ) consisting of an 
^Abelian variety called the Albanese variety of 
V and a rational mapping V +A (called an 
Albanese mapping) such that: (i) the image of 
f generates A, i.e., the sum off with itself n 
times, F: K"->A, is generically surjective for 
sufticiently large n ; (ii) for every rational map- 
ping g: B of V into an Abelian variety B, 
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there exist a homomorphism h \ A B and a 
point beB such that g — h-f+b. The Albanese 
variety is uniquely determined up to isomor- 
phisms and f is determined up to translations. 

In the case of A: = C, if V is a complete non- 
singular variety and if q is the dimension of the 
linear space of differential 1 -forms of the first 
kind on V, then the first tBetti number i?, is 
equal to 2q. Let m, , , be a basis of the 
linear space and let 7i , , y 2 q be a basis of 
the first homology group modulo torsion. Put 

= |y, u>; and ocj = (a^j, , Then the period 
vectors ccj (1 ^ 29) are linearly independent 

over R in C^. If F denotes the discrete sub- 
group of generated by the dj, then the quo- 
tient group C^F is the Albanese variety of 
V. The Albanese mapping is given by the map- 
ping (modF), where P 

is a variable point on V and Q is a fixed point 
on y (- 232 Kahler Manifolds C). 

Replacing the term “rational mapping” by 
“morphism” in the definition, we can define the 
strict Albanese variety of V and prove its 
existence. It is a quotient Abelian variety of 
the Albanese variety of V. If V is nonsingular, 
both coincide by virtue of the tstrong mini- 
mality of an Abelian variety (- Section I). 

Let y be a complete normal variety, U the 
set of the simple points of V, and D a divisor 
on y. Then D is said to be algebraically equiva- 
lent to 0 if there exist a nonsingular curvC C, 
a divisor F on 1/ x C, and two points P and 
Q on C such that D can be written as D = 

(p*(F) — (Pq(F), where (Pp and <Pq are the mor- 
phisms (pp'.U^U x P^V x C and (pq: 

U X Q-U X C. We denote by (5(K), 05a{K), 
and (S;(F) the set of all divisors on V, the set 
of divisors that are algebraically equivalent to 0, 
and the set of divisors that are linearly equiva- 
lent to 0, respectively. We can introduce a 
canonical structure of an Abelian variety into 
©a(F)/(5,(y), which is called the Picard variety 
of y. The dimension q of the Picard variety is 
called the number of irregularity of V; if = 0 
we say that Vis regular. 

The Albanese variety and the Picard variety 
of y are tisogeneous to each other, and each 
one is the Picard variety of the other. If Vis 
a curve, they are isomorphic and called the 
Jacobian variety (-^ 9 Algebraic Curves E). 

Using Cartier divisors instead of divisors, we 
get an analogous theory to construct another 
kind of Picard variety which turns out to be 
isomorphic to the Picard variety of the strict 
Albanese variety of V. The group of the linear 
equivalence classes of Cartier divisors can be 
identified with H^{V, (9*), where &* is the sheaf 
of multiplicative groups of the invertible ele- 
ments in 0,. From this point of view, we can 
generalize the theory of Picard variety to the 
case of schemes also. The theory thus obtained 
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is called the theory of Picard schemes [60]. 

The quotient group NS(V) = (5(y)/(Sg( V) is 
finitely generated [66] and is called the Neron- 
Severi group of V. We call the rank of NS(V) 
the Picard number of V and denote it by p(V). 
In the case of a nonsingular projective vari- 
ety over k = C we have an inequality p(V) ^ 
h^’UV) ( = dim, (V, and the Lefschetz 
number S 2 ( V) — p{ V) is a birational invariant 
(where B,(V) is the second Betti number of 
V) [39]. For the positive characteristic case, 
however, the above inequality does not hold 
in general [61]. 

The torsion part of NS(V) is (5^{ y)/©^( V), 
where ©„( V) denotes the group of divisors 
numerically equivalent to zero (- Section Q) 
(T. Matsusaka). The last fact cannot be gen- 
eralized for higher codimensional cycles [25]. 

Q. General Intersection Theory 

Fet ybe an irreducible variety of dimension n, 
and let A and B be irreducible subvarieties of 
y of dimension r and s, respectively. If C is a 
proper component of A fl B, we can define the 
intersection multiplicity i{A B, C; yj of A and 
B along C on V, which has properties consis- 
tent with our geometric intuitions. In partic- 
ular, it is invariant under biregular mappings. 
If A and B intersect properly on V and if 
c„ , c„ are the proper components of A FI 6, 
we define ar\ {r + s- n)-cycle A- Bhy A- B = 
i{A . B, Cj,; V) Q and call it the intersection 
product of A and B. If each component of 
an r-cycle X = and each component Ij, 

of an s-cycle Y = Z/j mp Y, intersect properly, 
we define 

X- Yp. 

Then we have the associative law (X yj ■ Z = 

A • ( y • Z) for cycles X, Y, and Z whenever 
both sides are defined. Two r-cycles and X 2 
are said to be numerically equivalent if for 
every (n — r)-cycle Y that intersects them pro- 
perly we have deg(X, Y) = deg(X2 Y). 

The theory of intersection is one of the most 
basis theories in algebraic geometry, for the 
other theories can be constructed from it [92]. 

R. Chow Rings 

Fet U and T be nonsingular irreducible vari- 
eties. If Z is a cycle on U x T such that Z • 

(U xt) is defined for every point { of T, we 
put Z -{U X t) = X{t)xt, and we obtain a 
family {X(t)} of cycles on (J parametrized by 
the points of T. Such a family is called an 
algebraic family of cycles. Two cycles and 
X 2 whose difference is equal to the difference 
of two cycles in an algebraic family are said to 
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be algebraically equivalent. In particular, if 
— X 2 can be expressed as the difference of two 
eyeles of an algebraie family parametrized by 
the points of the affine line, then X, and X 2 
are said to be rationally equivalent. The set of 
the eyeles that are algebraieally (rationally) 
equivalent to 0 is a subgroup 3a( f^)(3rat( U)) of 
J(U). For divisors, rational equivalence coin- 
cides with linear equivalence. 

Let /■; F-v L be a morphism between non- 
singular irreducible varieties. For an irre- 
ducible subvariety W of V, let W denote the 
closure of /( W), and put /j( IF) = 0 if dim W > 
dim W’ and /^(IF) = m W’ if dim W = dim W’, 
where m= [k(W): k( W’)] is the degree of the 
morphism lF-> W’. Extending by linearity, 
we obtain a module homomorphism 3( 
->3(t/).if/ is proper, then induces a 
module homomorphism from 3(f^)/3rat(f^) 
=A(V) to A(U). 

Let U and V be nonsingular irreducible 
projective varieties, and /: V+U he a mor- 
phism with graph F. If ye3( U) such that 
r • (F X Y) is defined, we denote by Y) the 
image of F- (Vx Y) under the induced isomor- 
phism F-> y. Each class of the rational equiva- 
lence class group A(U) = 3(f^)/3rat( U) contains 
a cycle for which f * is defined. Hence we can 
define f* : A(U)^A{V). Let A: x [/ be 

the diagonal morphism, and define x • y = 

A*(x X y) for X, ye A(U). Then A(U) is a ring 
with respect to this product; moreover, it is a 
■•■graded ring with the grading by codimension. 
This graded ring A(U) is called the Chow ring 
of U, and the mappingf*: A(U)+ A( V) is a 
ring homomorphism. If f is proper, we have 
/*W) =/*{>’) -^ for xeA(U), yeA{V). 

Zero Cycles. Let y be a nonsingular irreduc- 
ible projective variety over an uncountable 
algebraically closed field of characteristic 
zero (say C). Denote by A,/yj the group of 
classes of 0-cycles of degree 0 on y modulo 
rational equivalence. We say A,(V) is finite- 
dimensional if the mapping V” x y”-i-A,(yj 
sending (u 1 , . . . , h 1 , . . . , h„) to S Uj - S h; is 
surjective for a certain n. 

If A,(yj is finite-dimensional, then the 
canonical mapping A,( y)->Alb( V) induced 
from y~>Alb( V) is bijective (A. A. Rojtman). 

In general, the torsion part of A,(yj is isomor- 
phic to that of Alb( V) (Rojtman [75]). If 
/jP.O( yj > 0 for some p > 1, then A,(yj is not 
finite-dimensional (Mumford, Rojtman [74]). 
If Vis a surface which is not of general type 
and with h^’^{ V) = 0, then Aq(K) is isomorphic 
to Alb( V) (S. Bloch, A. S. Kas, D. Lieberman 
[13]). There exist surfaces of general type with 
=0 such that A,(yj =0. Lor instance, 
Godeaux surfaces are such surfaces (H. Inose 

[ 43 ]). 



One Cycles. Let y be a nonsingular projective 
irreducible variety and N(V) denote {numer- 
ical equivalence classes of 1-cycles on V} 

■Via the intersection pairing, N(V) is dual to 
the Neron-Severi group tensored by R (- 
Section P); and let NECV) denote the smallest 
cone in NCV) containing all effective 1-cycles. 
S. Mori studied the structure of NE(V) in 
detail. One of his results is stated as follows: If 
the anticanonical divisor is ample, then there 
exist rational irreducible CUrves , , If SUCh 
that (1) -{l..Ky)<n+ 1 and (2) NE(V)= 

R+ [^] "F • • • + R+ [^r]> where n = dim V,Kyis 
the canonical divisor on V, and [/J denotes 
the class represented by 1,. Moreover, Mori 
has proved that (1) if Kyis not numerically 
effective, e.g., if — Ky is ample, then V con- 
tains a rational curve, and (2) if the tangent 
bundle of V is ■•■ample, then Vis isomorphic 
to the projective space [59]. (This is called 
the Hartshorne conjecture.) 



S. Chow Coordinates, Hilbert Schemes 

Consider an irreducible algebraic correspon- 
dence T between irreducible varieties V and 
W. Let V’ and W’ be the closed images of the 
projections of T to V and W, and let a and c 
be their dimensions, respectively. Take generic 
points P, Q of V’, W’, and consider the total 
transforms r(P} of P to W and T(Q} of Q 
to y. Denoting the dimensions of T( Pj and 
T( QI hy b and d, respectively, we have a - 1 - 
b = C + d, where both sides are equal to the 
dimension of T. This property is called the 
principle of counting constants. 

This simple principle has wide application. 
Lor instance, let V be an r-dimensional variety 
in P”(k), and let Sj = 0 (0 ^ i ^ r) be the 
equations of r -f- 1 hyperplanes The con- 
dition VC\ {Hq C\ r\ H^) ^ 0 defines an irreduc- 
ible algebraic correspondence T between V 
and the multiprojective space W = E’“(k) x x 
P”(k) with u^J as coordinates: T=((x, u)|xeK, 
Z UijXj = 0}. In this case, we have a = r,b = 

(n l)(r + 1), and d =0 in the notation intro- 
duced previously, so that c = n{r +1)— 1. This 
implies that W’ is of codimension 1 in W; 
hence W’ is defined by a single equation P(Uy) 
= 0. This form F is the associated form of V of 
B. L. van der Waerden and W. L. Chow. It is 
a homogeneous form of degree d (d = deg( yj) 
in each (u,o, , uj and is symmetric in the 
indices i. More generally, for a positive cycle 
X = Z of dimension r and of degree d 
( = Z deg( yj) in P"(k), the product H F"“ of 
the associated forms of is called the as- 
sociated form of X. The coefficients of F, 
arranged in a fixed order and regarded as the 
homogeneous coordinates of a point of a pro- 
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jective space, are called the Chow coordinates 
of X. This is a natural generalization of the 
iPliicker coordinates. Given r, d and a projec- 
tive variety C/( e P”(k)), the set of Chow coor- 
dinates of the positive cycles that are con- 
tained in V, whose dimension is r and whose 
degree is d, is a projective variety called a 
Chow variety. 

In scheme theory, the Hilbert scheme is 
introduced in the following way [60]. Let P£ 
be the n-dimensional projective space over 
Spec Z. For a locally Noetherian scheme S, 
and for a closed subscheme Z of x S, we 
have the composition of the immersion of Z 
and the projection P^ x 5 to S; this is denoted 
by For any point s of S, f£^{s) is 

a closed subscheme of and the restric- 
tion of 6{\) to /2 ^(s) denoted by Lj. Put 
.^(5) = {Z c P^ X S} 1/2 is flat}. Then M be- 
comes a contravariant functor to the cate- 
gory of sets, which is representable, i.e., there 
exists a locally Noetherian scheme M such 
that ^ is naturally isomorphic to the functor 
Hom( — , M). M is written as Hilb(P^) and 
is called the Hilhert scheme. There exists a 
closed subscheme IT of P^ x Hilb(P2) with flat 
such that for any ZeM{S), there exists a 
unique (p: S->M = HilbjP^) in such a way that 
Z = IT X ^S. In particular, if X is a closed sub- 
scheme of P^ over a field k , there exist se 
Hilb(P^) and a field extension k/k{s) such that 
X = /rr/s) ® k{s)k- Thus Hilb(Pz) parametrizes 
all closed subschemes of P^. Fix a polynomial 
P and define to be {ZeMiS)\x(Lf’") 

= P(m) for all s}. J{^ is also a contravariant 
functor, represented by a scheme Hilb^(Pz) 
which is projective over Spec Z. The direct 
sum of all Hilb^(P 2 ) is just HilblP^). 



T. Algebraic Geometry and Complex Analytic 
Geometry 

When C = C, an algebraic variety is called a 
complex algebraic variety, and it has the struc- 
ture of a tcomplex analytic manifold or (if it 
has singular points) of an tanalytic space. If we 
denote by ^ the ring of holomorphic func- 
tions at a point x of V, then (9^ ^ c and 
their tcompletions coincide. If x is a simple 
point of V, then ^ is the tring of convergent 
power series, and its completion is the Iring of 
formal power series. The prime ideals of 
remain prime under completion (M. Nagata). 
Because of this, the analytic behavior of Vin a 
neighborhood of x can be investigated alge- 
braically through the completion of (Py^^- 
If Vis complete, then the analytic coher- 
ent sheaves on V and the algebraic coherent 
sheaves on V correspond to each other bijec- 
tively; consequently, for propositions that can 
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be stated in terms of coherent sheaves, the 
results in the analytic sense remain valid in the 
algebraic sense also, and vice versa (J.-P. Serre 
[82]) (- 72 Complex Manifolds E). 

U. Topology of Algebraic Varieties 

Every algebraic variety defined over R (or C) 
can be triangulated by real analytic cells [46]. 
Eet y be a nonsingular connected algebraic 
variety defined over C. Eor an algebraic auto- 
morphism (7 of C we can define F'" by letting 
a operate on the coefficients of the defming 
equations of open affine coverings of V. V and 
are not necessarily homeomorphic or even 
of the samehomotopy type [83].Grothen- 
dieck has shown that there is a tspectral se- 
quence V, Q^)=>//'’''’^( V, C), called a 

Hodge spectral sequence. This spectral se- 
quence can be defined in a purely algebraic 
way and V, C) can be considered as the 
thypercohomology of the de Rham complex 
{ F( y Q'^.), d} of y. If y is projective, V carries a 
Kahler metric and by the theory of tharmonic 
integrals the Hodge spectral sequence degener- 
ates and is C-isomorphic to the complex 
conjugate of (- 232 Kahler Manifolds B). 
This is also the case if V is complete. 

The topology of a nonsingular projective 
surface was studied by Eefschetz using the 
method of Eefschetz pencils. Eor a projective 
nonsingular irreducible variety V of dimension 
n defined over C, a Eefschetz pencil { Ililtep' of 
y is, by definition, a linear pencil consisting of 
hyperplane sections Wj of y such that: (i) for 
a\\ t e U = Pq — {t tj, , tj}, W, is nonsingular; 
(ii) each W,. has only one singular point that is 
an ordinary double point; and (iii) Wq D 
is nonsingular, where we assume 0 , 00 G C7. 
Embed V into by a high multiple of a 
hyperplane of V and take a generic linear 
pencil {//,} of hyperplanes in P'^. Then {W,= 

Hf n y/ is a Eefschetz pencil of V. By blowing 
up y along Wq H W^ we obtain a smooth 
variety V and a surjective morphism n: F->P'. 
Eet W—n ^(0), 7iy = n\n^^{U). R‘’nu*Q is a 
local system attached to the monodromy 
representation (pp:7ii{U,0)->^GL{H'’{W,Q)). (Pp 
is trivial if p^n — I - Eor each point t ; there 
corresponds a cocycle diof (W,Q) called a 
vanishing cocycle such that if y; is a loop based 
at 0 going once (counterclockwise) around t,-, 
we have, for each xe 

X ± (x, Sfydi, where ( ) is the intersection 
pairing of //""*( IT, Q). (Ppiyd is called a Picard- 
Lefschetz transformation. The main results due 
to Eefschetz are restated as follows. (1) (Weak 
Lefschetz theorem). The natural homomor- 
phism H'{ V, W, Q) is an isomor- 

phism for 0 ^ i ^ n — 2 and is an injection for 
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i = n " 1, or equivalently Hi{V,W, Q) = 0 for 
0 ^ ^ M ~ 1* (2) (Strong Lefschetz theorem). Let 
be the cohomology class of V, Q) corre- 
sponding to the hyperplane section and let 
L:H*(V,Q)^H*^^{V,Q) be the homomor- 
phism detined by the cup product with Then 
foreach/^tl,L"^^: V, Q) is an 

isomorphism. The weak Lefschetz theorem is 
true for a cohomology with integral coeffi- 
cients. In fact, V-W has the homotopy type 
of a real n-dimensional finite CW complex, 
and 7 t,( V, W) = 0 for Y < n. The strong Lefschetz 
theorem is equivalent to the statement that 
fV, Q) is the direct sum of the vector 
space spanned by the vanishing cocycles 3^ 
and the vector space spanned by the invar- 
iant cocyles (i.e., = 1, ,d). 

Lefschetz’ s original proof of this statement is 
incomplete, and no direct topological proof is 
known. The transcendental proof of (2) is 
given by the theory of harmonic integrals. A 
version of Lefschetz pencils is a proper mor- 
phism /: A ^£)={z||z|<e} of a complex mani- 
fold X onto a disk D such that /* 

0* — D — {0} , is of maximal rank at every point 
of /“^(D*). Fix a point seD*. oper- 
ates on H^{ W, Z), W and we have a 

representation (Pj:n,{D*,s)^GLiHj{W,Z)). 

For a loop y based at s and going once around 
0 the Picard-Lefschetz transformation (Pj{y) is 
quasiunipotent (i.e., for a certain integer m, 
(Pj{y)"' is unipotent). 

For a nonsingular projective variety detined 
over a field k with characteristic p > 0, the 
above Lefschetz theorems hold for an /-adic 
cohomology (l^p) [19, 30 (SGA 7)] (- Sec- 
tion AA). Using the theory of finite etale cover- 
ings of an algebraic variety detined over a field 
k, we can define the algebraic fundamental 
group and the algebraic homotopy groups, 
which are profinite completions of the topo- 
logical fundamental group and the topological 
homotopy groups, respectively, where k = C 
[5, 30 (SGA 1)]. Let (X,x) be a germ of a 
complex space with isolated singular point x. 
(X,x) is always algebraizable, i.e., the com- 
pletion 0^ ^ of the analytic local ring (9x,x is 
isomorphic to the completion of the local ring 
of a closed point of an algebraic variety de- 
fined over C [3], For topological type of al- 
gebraic surfaces -»■ Moishezon [57]. 



V. Hodge Theory 

Let //jj be a finite-dimensional real vector 
space containing a lattice and let H = 
Hr 0rC be its complexification. A Hodge 
structure of weight m on H (or Hr) is, by de- 
finition, a direct sum decomposion H = 

H‘^ P^ where H'’’'' is a 



complex vector subspace and the overbar de- 
notes complex conjugation. If H and H' carry 
Hodges structures of weight m and m’, respec- 
tively, then 0 Homc(H, fi'j, /\^H, and 
H* carry Hodge structures of weight m -h m’, 
m' — m, pm, and —m, respectively. For a Hodge 
structure H ofweight m, f^H= 0t^pH'‘’'"~'‘, 
p = 0, 1 , . , induces a decreasing filtration. 

Let H be a Hodge structure of weight m, and 
let Q be a bilinear form on H . If the follow- 
ing three conditions are satisfied, the Hodge 
structure H is said to be polarized by Q. (i) Q 
is detined over Q and is symmetric (skew- 
symmetric) if m is even (odd), (ii) Q HP ■«) 

= 0 unless p = p',q = q'. (hi) ( v) > 

0 for nonzero VE H^’^. Let V be a compact 
Kahler manifold. Then H = H”{ V, C) carries 
the Hodge structure induced by the type (p, q)- 
decomposition (^232 Kahler manifolds B). 
This is also the case if V is a compact complex 
manifold which is the image of a holomorphic 
mapping from a compact Kahler manifold of 
the same dimension. Moreover, if Vis projec- 
tive, the Hodge-Riemann bilinear relations 
define a natural polarization on the subspace 
P of H consisting of all primitive cohomology 
classes. 

Each algebraic cycle W of codimension 
s determines a cohomology class [W] e 
H^®( V, Q), which belongs to H*’*( V) . (Such 
a class is called an algebraic cycle.) The con- 
verse of this fact is called the Hodge conjecture, 
which says that H^*( V, Q)n H*’'*(K) is spanned 
by algebraic cycles. The case s = 1 has been 
verified by Hodge., Lefschetz, and Kodaira. 

Let V(W) be a smooth irreducible algebraic 
variety detined OVCr C (complex manifold), and 
let (p; IV be a projective smooth morphism 
with connected fibers. Then H= C is a 

flat vector bundle on W with the flat connec- 
tion V. V is often called the Gauss-Manin 
connection, and it can be detined algebraically 
if W is also algebraic. 

For each fiber K = 9 ’ (■^X se W, the filtra- 
tion FPH'"(K,, C)=0^^pH'‘-'”-'‘( F,) induces a 
complex subbundle F'’ and the connection V 
has the property V(tP(F^)) c fP(F^"'0T*), 
where T is the tangent bundle of W [26]. 
Moreover, if W is algebraic, V is a differential 
equation with regular singular points on W 
where PV is a smooth compactification of W, 
such that W ^ W is a divisor with normal 
crossings (-^ Section L). If we consider the 
subbundle P of H consisting of all primitive 
cohomology classes, the polarization on each 
fiber induCCS a Hermitian pseudometric on 
P. Curvatures of bundles P Fl F'’ have been 
studied by Griffiths [26]. There exists a classi- 
fying space D for polarized Hodge structures 
and there exists a holomorphic mapping of the 
universal covering IF of PV into D, usually 
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called a period mapping. D may not be a 
bounded symmetric domain but has several 
interesting properties [24,27]. In some cases 
D/r with a suitable discrete subgroup Y is the 
moduli space of polarized algebraic varieties 
(e.g., curves. Abelian varieties). 

P. Deligne [18] has generalized Hodge 
theory to arbitrary algebraic varieties (more 
generally schemes of finite type over C). The 
simplest case is the Hodge theory of a smooth 
noncomplete irreducible variety X. By Na- 
gata’s embedding theorem [63] there exists a 
complete algebraic variety X such that Y = 

X-X is a subvariety. By virtue of Hironaka’s 
resolution theorem we can assume that X 
is nonsingular and that Y is a divisor with 
normal crossings. Let Q|(log Y) be a sheaf 
of germs of meromorphic 1 -forms with loga- 
rithmic pole along Y, i.e., locally written as 
aj{x)dxj, where 

(xj, , x„) is a system of local coordinates with 
center peY in X such that x x^ = 0 is a local 
equation of Y and «i(x), aj(x) are holomor- 
phic at p. Using the complex {QJ(log Y)= 
/\^flj(log Y), d}, with a suitable filtration, 
Deligne has shown that H = C) carries 
a mixed Hodge structure and this structure is 
independent of the choice of X. The mixed 
Hodge structure on H consists of two finite 
filtrations, i.e., Oc.. c W„ ^ c H , the 

weight filtration which is detined over Q, and 0 
c. cT'’c:F'’'^'c.cH, the Hodge filtration 
such that induces on W„/W„^i a Hodge 
structure of weight n. As a corollary he has 
shown that a meromorphic p-form on X with 
logarithmic pole along Y (i.e., a section of 
n^(iogY) is (i-closed on X, and m = 0 if and 
only if is zero in H’’{X, C). An important 
application of the theory of this mixed Hodge 
structure on C) is the following. Let 

V and W be smooth irreducible varieties, 
and let W be a smooth projective mor- 

phism. If L is a smooth compactification of 

V, the canonical homomorphism H'”(V, Q)-» 

W, h=Q) surjective. Fix a point se 

W. 71 1 (W, s) operates on //'"( 1^,Q). Then this 
action is semisimple [18]. 

W. Deformations, Moduli, Algebraic Spaces 

In this section for simplicity the field k is as- 
sumed to be algebraically closed. Let X be an 
algebraic scheme over k. A (flat) deformation of 
X over a connected scheme S over k with base 
point Sq consists of the following data: (1) A 
morphism p:X-^S that is flat and of finite 
type. If X is complete, p is also proper. (2) A 
closed point Sg e S such that the fiber x ^ /c(sq) 
is isomorphic to X. For any closed point s e S, 
the liber X,=X k(s) is called a flat deforma- 
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tion of X. If X is smooth and complete, we 
assume further that p is smooth. Similarly, we 
can define a deformation of a polarized alge- 
braic manifold, an embedding deformation 
of X in an algebraic scheme Y over k, a de- 
formation of an affine scheme with isolated 
singular points, and a deformation of vector 
bundles on a fixed algebraic scheme over k, 
etc. The theory has two aspects: local theory 
and global theory. 

Let (R, m) be a complete Noetherian local 
ring such that R/m = k. Set = i?/m". A for- 
mal deformation X^ of X is a sequence {X„} 
such that (i) is a deformation of X over 
Spec(K„) and (ii) there is a compatible se- 
quence of isomorphisms X„ 0 r^R„-i 
for any n. Let (FLA/k) be the category of 
finite-dimensional commutative local k- 
algebras. The local theory of deformation is 
the study of the covariant functor f of (FLA//c) 
to (Set), where, for zle(FLA/k), F(A) is the set 
of isomorphism classes of deformations of X 
over Spec (A). The functor is in general neither 
representable nor prorepresentable (i.e., there 
exists a formal deformation X^ of X such that 
F(A) = Hom^.jjjg(R, A)). But, under reasonably 
mild conditions on F, F has the hull R [78]. 
That is, there is a formal deformation Xr of X 
and a natural transformation j: G-F, where 
G(A) = Hom^.^jg(K, A), such that j is for- 
mally smooth (i.e., for any surjection A’-i- A in 
(FLA/k), G{A’)^G{A) X p^^^F{A') is surjective) 
and G(k[fi])^F(k[e]) is bijective for the ring 
of dual numbers k[a]. The formal deformation 
Xr is called a versal deformation of X. The 
hull R is unique up to noncanonical isomor- 
phism. The deformation functor F has the hull 
R if X is (i) a complete algebraic scheme over 
k, (ii) an affine scheme with isolated singular 
points, (iii) a polarized algebraic variety over 
k, or (iv) a vector bundle on a complete alge- 
braic scheme over k, etc. If there exists a de- 
formation Tt; of X over a scheme S 
with base point over k such that R = 

^ 3 0 K„, the formal deformation {n„ : 
X„->Spec(/?„)} is called algebraizable. Alge- 
braizability of the versal deformation has been 
studied by M. Artin. Since the assumption that 
S is a scheme is rather restrictive, Artin has 
introduced the notion of algebraic spaces and 
has considered algebraizability in the category 
of algebraic spaces [2-4]. For a complete 
algebraic variety, the versal deformation is not 
necessarily algebraizable and we need to con- 
sider deformations of polarized algebraic 
varieties. The versal deformation of an affine 
variety with an isolated singularity is alge- 
braizable in the category of algebraic spaces. 
For the global theory of deformations, we 
need the projectivity assumption, and the 
theory is essentially reduced to the theory of 




16 W 

Algebraic Varieties 



82 



Hilbert schemes (or Chow varieties) (->■ Sec- 
tion S). The problem of moduli is considered 
as the study of the set A4 of all isomorphism 
classes of deformations of X. Usually we con- 
sider the moduli of polarized varieties. 

Let (Sch) be the category of Noetherian 
schemes and Ji a contravariant functor from 
(Sch) to (Set) delined by = {isomor- 
phism classes of fa mi lies of polarized varieties 
parametrized by X eOb(Sch), e.g., families 
of polarized Abelian varieties with(out) addi- 
tional conditions } . The functor .M is called 
a moduli functor. If it is represented by a 
scheme M, M is called a fine moduli scheme 
[60]. In this case there is the universal family 
n-.i^ ->M. In many cases the moduli functor 
is not representable. A coarse moduli scheme 
for a given moduli problem is a scheme A4 
together with a natural transformation 

-,M)such that (1) (;^i(Spec(/c)) : 
M(Spec(/c))^Hom{Spec(fc), M) is bijective for 
any algebrically closed field k, and (2) for 
any scheme N and any natural transformation 
,N), there is a unique natural 
transformation X'. Hom( — , M)->Hom( — , N) 
with ijj = X- (j). A coarse moduli scheme is called 
a moduli space or a moduli scheme. 

In many cases, the moduli space can be 
obtained as the quotient space of a certain 
(locally closed) subset H of a Hilbert scheme 
by the following equivalence relation: s-s’ eH 
if and only if as polarized varieties, 

where 7 t : ^ H , jl ~*{s) = X, (T. Matsusaka 
[53]). This equivalence relation is often in- 
duced by an action of a Ireductive algebraic 
group G. Suppose that a reductive algebraic 
group G operates on an algebraic k-scheme Z. 
A G-invariant morphism f Y (i.e., for the 
trivial action of G on Y, f is a G-equivariant 
morphism) is called a geometric quotient if (1) 
/is a surjective affine morphism and/^jd^'^)^ 

= &Y, (2) if X is a G-stable closed subset of Z, 
then f{X) is closed in Y, and (3) for x j , X 2 e Z, 
f(^i) = fi^i ) if if G-orbits of 

and X 2 are the same. Let G be a reductive 
algebraic group, x: G->^Aut( V) a rational rep- 
resentation on a finite-dimensional vector 
space V over k, and a G-invariant point. 

Then there exists a G-invariant homogeneous 
polynomial F of degree ^ 1 on K such that 
F{Vq) ^ 0 (W. J. Haboush [32]). This implies 
that if a reductive group G operates on an 
algebraic k-affine scheme Spec(/4), then the 
invariant ring is a finitely generated k- 
algebra and, moreover, if any G-orbit in 
Spec(/1) is closed, the natural morphism 
Spec(/1)-+Spec(zl‘^) is a geometric quotient. 

For a quasiprojective scheme Z over k with 
an action of a reductive group G we need a 
notion of stable points [60, 62]. The subset 
of stable points of Z consists of all geometric 



points of a G-stable open subscheme of Z, 
and there exists a geometric quotient f\Z^^Y 
where Y is quasiprojective (Mumford [60], C. 

S. Seshadri [79], Haboush [32]). In this way 
Mumford has shown the existence of coarse 
moduli schemes of nonsingular complete 
irreducible algebraic curves and polarized 
Abelian varieties. But, in general, analysis of 
stable points is very difficult and it is desirable 
to extend the category of schemes so that it 
becomes easier to obtain a quotient. MatSU- 
saka has introduced the notion of a Q-variety 
[52]. M. Artin has introduced the notion of an 
algebraic space (D. Knutson [49]). 

An algebraic space X of finite type consists 
of an affine scheme U and a closed subscheme 
R oUx if such that (1) R is an tequi valence 
relation, and (2) the projections p,: R+U (i = 
1,2) are etale. (These are often written as 
U->X.) A morphism g: F->X of an affine 
scheme F to an algebraic space X consists of 
a closed subscheme Wc U x V such that (1) 
the projection IF-)- F is etale and surjective, 
and (2) the two closed subschemes R x u W, 
IFx^IFof UxUx Fareequal. LetSr^F^Y 
be an algebraic space. Then Hom( Y, X) is de- 
fined as the kernel of HomfF, X)=tHom(S, X). 
If Y is an affine scheme, this definition of 
Hom( Y, X) is equivalent to the previous defi- 
nition by virtue of the etaie descent. Thus 
algebraic spaces form a category which con- 
tains the category of schemes. We can define 
the structure sheaf of an algebraic space and 
construct a cohomology theory. Many impor- 
tant notions and theorems for schemes can 
be generalized to those for algebraic spaces. 
Every algebraic space has a dense open subset 
that is an affine scheme. A group algebraic 
space is a group scheme (J. P. Murre). Suppose 
k = C. If an algebraic group G operates on an 
algebraic k-scheme properly with a finite stabi- 
lizer group, the quotient space exists as an 
algebraic space. In this way H. Popp has 
shown the existence of moduli spaces of alge- 
braic surfaces of general type as algebraic 
spaces [70,71] (- 15 Algebraic Surfaces; also 
[54]). Moreover, every separated algebraic 
space X of finite type over C carries a natural 
structure X”” of an analytic space. If there 
exists a proper modification morphism / : X”” 
-)• Y of a separated algebraic space X onto an 
analytic space Y, then Y carries a structure of 
an algebraic space and / becomes a morphism 
of algebraic spaces (Artin [2]). For any alge- 
braically closed field k, Artin has introduced 
the notion of formal algebraic space and 
formal contraction and has obtained results 
similar to those for algebraic spaces [49]. 

An irreducible compact complex space X 
whose algebraic dimension (^72 Complex 
Manifolds F) is equal to dim X is called a 
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Moishezon space. As a corollary of the above 
theorem, any Moishezon space X carries a 
structure M of a compact algebraic space such 
that X ^ M”“. 

X. Formal Schemes 

Let A be a ring which we assume to be Noe- 
therian, for simplicity, and / an ideal of A. 
Taking | /"}„>q as a fundamental system of 
neighborhoods of 0, we can introduce a struc- 
ture of a topological ring into A called /-adic 
topology. The tcompletion of A with /-adic to- 
pology is isomorphic to the projective limit A 
= lim„>oA/7” (here A/F are regarded as dis- 
crete topological rings) and called the comple- 
tion of A along /. If A is Noetherian, then A is 
again Noetherian. There is a canonical con- 
tinuous homomorphism i: A-^A whose kernel 
comprises the zero divisors a with a — 1 £ / 
(intersection theorem of Krull; 284 Noeth- 
erian Rings B). If i is an isomorphism, we say 
A is complete with respect to I. The topology 
of A is the /-adic topology where I~ i{I)A and 
A is complete with respect to /. Take a Noeth- 
erian ring A complete with respect to / which 
we consider as an /-adic topological ring by 
identifying its completion along / with A. 
OnaC=F(/)cSpec(A) we can define a sheaf 
of topological rings 0^ by V{^{f), 0,) = 
^//^"^/ for ^(/) = £>(/) n I with 
fe A. We call (X, (9^) the format spectrum of A 
and write Spf(A). / is called a defining ideal of 
Spf(A). A (locally Noetherian) formal scheme is 
by definition a topological local ringed space 
which is locally isomorphic to a formal spec- 
trum (of a Noetherian ring). If we define mor- 
phisms between two formal schemes by those 
in the category of topological local ringed 
spaces, the formal schemes form a category. 

For two formal spectra Spf(A) and Spf(B) 
with deiining ideals / and J, respectively, the 
direct product Spf(A) x Spf(B) in the category 
of formal schemes is the formal spectrum of 
the completion of A 0 B along the image of 
I ® B + A® J. Similarly, we can construct a 
fiber product of formal schemes. A formal 
scheme 3E is called separated if the image of the 
diagonal morphism X is closcd 

Section D). 

For a Noetherian ring A with an ideal /, the 
formal spectrum Spf(A) (with a defining ideal 
/) is called the completion of Spec(A) along 
P’(I)- Similarly for a Noetherian scheme X and 
a closed subscheme X’ we can define the com- 
pletion of X along X’ . Every completion of 
a separated scheme is separated. For a coher- 
ent sheaf F on X one can define its completion 
Fix' along X’, which is again coherent (under 
the assumption that X is locally Noetherian). 



Thus we can develop a theory of formal 
schemes in a way similar to that of schemes, 
which we call “formal geometry” (for the more 
general definitions and further discussions see 
[2; 29, 1, III; 303). Roughly speaking, a function 
on is a formal Taylor series with respect 
to the direction normal to X’ whose coeffi- 
cients are regular functions on X’ . The method 
of formal completion enables us to introduce 
“analytic” or “intinitesimal” methods in alge- 
braic geometry. Among many important 
theorems, we state here the following two 
theorems. (1) The fundamental theorem of 
proper mapping: Let /: Y be a proper mor- 

phism of locally Noetherian schemes, Y’ a 
closed subscheme of Y, and X’ = X x y Y’ the 
inverse image of Y’. Denote the respective 
completions of X and Y along X’ and Y’ by 
X and Y, respectively. We have the induced 
proper morphism of formal schemes f:X-^ 

Y. Then we have canonical isomorphisms, 
{F%{F))iy- ^ R%{Fix), n^o, for every coher- 
ent, (P;y-Module F on X. This theorem can 
be applied to prove Zariski’s connectedness 
theorem: for a proper morphism f:X^Y of 
locally Noetherian schemes with f^{0x) = 
every fiber /“' (y) off is conneeted and non- 
empty for ye Y. (2) We use the same nota- 
tion as in (1) and assume, moreover, that Y = 
Spee(A) for a Noetherian ring A, complete 
with respect to an ideal /, and Y’ = V(I). Then 
the correspondence F -^Fix' gives an equiva- 
lence between the category of coherent &x~ 
Modules with proper support over Y and the 
category of coherent (P;f-Modules with proper 
support over Y. This theorem plays an impor- 
tant role in the theory of Reformations of 
algebraic varieties. 



Y. Algebraic Vector Bundles 

In this section a vector bundle is a locally free 
sheaf of finite constant rank (- Section E). A 
quotient sheaf of a vector bundle is called a 
quotient bundle if it is a vector bundle. A sub- 
sheaf F of a vector bundle £ is a subbundle 
when both F and E/F are vector bundles. A 
vector bundle is said to be indecomposable 
unless it is a direct sum of proper subbundles. 
Every vector bundle £ on P Ms a direct sum of 
line bundles, that is, £ ^ 0 (()(«;) (Grothen- 
dieck). This property characterizes in the 
category of nonsingular projective varieties. In 
fact, if X is a nonsingular projective variety 
with dim X > 0 and X ^ and if r is an in- 
teger with r ^ dim X, there are stable (see 
below), a fortiori, indecomposable vector 
bundles on X of rank r (J. Simonis and M. 
Maruyama). Vector bundles are closely related 
to subschemes of the base variety. Eet £ be a 
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vector bundle of rank 2 on a nonsingular 
quasiprojective variety X and 1) an tample 
invertible sheaf on X. For n»0 and general 
sGH%X,E{n]), E{n)js(9x = L® I with L a line 
bundle and / an ideal in which defines a 
smooth subscheme Y= (s)^ of codimension 2. 
When X is, for example, P” (n ^ 3) or affine, 
the converse holds. Let Y be a subscheme of X 
purely of codimension 2 and locally of com- 
plete intersection. If coj, ^ A4 (x) C)y for an inver- 
tible sheaf M on X, there is a vector bundle 
E of rank 2 and sgH*^{X, E) with (s)q = Y. 
Moreover, E is decomposable if and only if Y 
is globally a complete intersection. An Abelian 
surface can be embedded in P'^(C), and hence 
we have an indecomposable vector bundle of 
rank 2 on P''^(C) [40]. For n ^ 5, we have no 
examples of indecomposable vector bundles of 
rank 2 on P”(C) [36]. Every vector bundle on 
A” = k" is trivial (D. Quillen [72], A. Suslin). 
From this and the fact stated above several 
results can be deduced; every nonsingular 
curve in is set-theoretically a complete 
intersection (L. Szpiro). 

For a vector bundle F on a complete 
scheme, the following are equivalent: (i) for 
every coherent tP^-module F,F® S”(E) is 
generated by its global sections for n » 0, (ii) 
for every coherent sheaf F on X, H‘{X, F ® 
S”(E)) = 0 for all i > 0 and n » 0, (hi) the ttauto- 
logical line bundle 1) on P(E) is ample. A 
vector bundle having these properties is said 
to be ample. This is a generalization of the 
notion of ampleness of invertible sheaves 
(- Section E). The set of ample vector bundles 
is closed under several operations [34]. No 
good criterion such as Nakai-Moishezon’s 
for the ampleness of invertible sheaves (->■ 
Section E) is yet known. The tangent bundle 
of a nonsingular complete variety X is ample 
if and only if X ^ P" Section R; S. Mori 

[59]). 

Eet Y be a nonsingular projective variety 
over an algebraically closed field and Oyi 1) an 
ample invertible sheaf on Y. A coherent sheaf 
£ on Y is said to be stable (or, semistable) with 
respect to ^)j,( 1) if E is torsion- free and if for 
every coherent subsheaf F of E with 0 / E ^E, 
X{F{m))lr{F)<{ox ^)x{E{m))lr{E) for allw»0, 
where r(*) denotes the rank. If E is stable (or 
semistable) and locally free, it is called a stable 
(resp. semistable) vector bundle. Let f\ X^S be 
a ^’smooth, projective, geometrically integral 
morphism and (p^( 1) an f-ample invertible 
sheaf on X. For a numerical polynomial H 
and an S-scheme T, set Zx/s( ~ F is a 
coherent sheaf on Xj with the properties (a) 
and (b)}/ ~ ; (a) F is T-flat, (b) for every geo- 
metric fiber E, ^ E ^ is stable with re- 
spect to &xi 1) X, x(E,{m)) = H (m), and if 



£i = F 2 ® f*{E) with L an invertible sheaf on 
T, Fj ^x/s is ^ tcontravariant functor of 
the category of tlocally Noetherian S-schemes 
to the category of sets. If S is of tfinite type 
over a ■''universally Japanese ring A, Xx/s 
a coarse moduli scheme Mx/siH) (-^ Section 
W) and it is tlocally of finite type over S (E>. 
Mumford, Seshadri, D. Gieseker, Maruyama 
[23, 51]). The set of the classes of semistable 
sheaves under a suitable equivalence relation 
(S-equivalence) on geometric libers of X over S 
also has a coarse moduli scheme Mxjs{H) and 

III (H) is its open subscheme. Moreover, it is 
known that M „,(H ) is projective over S in 
some cases, for example, when A is a field of 
characteristic zero. When dimX/S= 1, 
or P^, the structures of M ,„(H ) have been 
extensively studied [ 11,69]. Theories of vector 
bundles are used in theoretical physics [9]. 

Z. Torus Embeddings 

Over an algebraically closed field ki a torus 
embedding, or a toric variety, is a normal 
scheme X locally of finite type over k on which 
an algebraic torus T acts with a dense open 
orbit isomorphic to T. Such X’s, as well as 
many of their algebrogeometric properties, can 
be described very simply in terms of cones in 
real affine spaces, as Demazure [21] first saw 
in the nonsingular case in connection with 
algebraic subgroups of the Cremona trans- 
formation group, and then as D. Mumford et 
al. [47] as well as K. Miyake and T. Oda [67] 
saw in the general case immediately after H. 
Sumihiro [SS] proved a basic theorem on 
linear algebraic group actions. 

The group N, written additively, of one- 
parameter subgroups of an r-dimensional 
algebraic torus T is a free Abelian group of 
rank r. A convex rational polyhedral cone G in 
Ar = R ® 2 ^ N with 0 as the vertex is the set of 
nonnegative linear combinations of a finite 
number of elements of N such that a fl ( —a) = 

{ O } . A subset T of (T is called a face, and is 
denoted by t < cr, if there exists a linear func- 
tional m on Nf^ having nonnegative values on 
G and T = {y6o-|m(y) = 0}. A fan A in Ar is a 
collection of such c’s satisfying the conditions 
(i) Abct, ( 7 >T=>A 3 Tand (ii) Abo", g'=>o> 
gC\g'< g’. 

Each O' 6 A gives rise to an affine torus em- 
bedding (7^ as follows: Eet M be the Z-module 
dual to N; hence A4 is the group of 'I'characters 
of r. For O’ 6 A, the set Mnd- = {meM|m(y)^0 
for all ye O’} is seen to be a linitely generated 
additive sub semigroup of M containing 0 
and generating M as a group. is then the 
spectrum of the semigroup algebra /c[M H d]. 
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Thanks to the condition (ii) above, the UJs 
can be naturally pasted together to produce 
a torus embedding X = U„. 

Every torus embedding is obtained in this 
way. Equivariant dominant morphisms be- 
tween torus embeddings can be described in 
terms of Z-linear maps between N’s which 
send a fan to another. 

Many algebrogeometric properties of X can 
be described in terms of A, e.g., X is nonsin- 
gular if and only if every cr e A can be spanned 
by a part of a Z-basis of N. X is complete 
(proper over k) if and only if the union of c’s in 
A coincides with N,. X has at worst cyclic 
quotient singularities if and only if every o' e A 
is simplicial, i.e., if it is spanned by R-linearly 
independent elements of N. An equivariant 
resolution of singularities of X exists and is 
obtained by a suitable subdivision of A (Mum- 
ford et al. [47] ). X has only rational singular- 
ities, hence is Cohen-Macaulay (M. Hochster 
[38] and Mumford et al. [47]). The set of f- 
orbits in X is in one-to-one correspondence 
with a 6 A. A reduced T-invariant subscheme Y 
of X is the union of T-orbits, and hence corre- 
sponds to a subset ^ of A. M.-N. Ishida [44] 
determined when Y is Cohen-Macaulay or 
Gorenstein in terms of the combinatorics of 27. 

A T-invariant Cartier divisor D on X cor- 
responds to a support function h, which is a 
continuous R-valued function on the union 
IJ, 7 sa ^ which is (i) positively homogeneous, 
i.e., h{Xy)^Xh{y)for />0 and yE[j„^^a, (ii) 
Z-valued on N fl ( g ) and (iii) linear 
when restricted to each ere A. Various prop- 
erties of the invertible sheaf {D){- Section 
E) can be described in terms of h. Eor in- 
stance, when X is complete, Ox{D) is +ample if 
and only if h is upper convex, i.e., h(y) -i- h{y') ^ 
h{y + y’), and moreover, A is the coarest fan 
with the property (iii) above. The cohomology 
of &x{D) can be calculated by means of h 
(Demazure [21] and V. Danilov [ 17]). 

A support function h, on the other hand, 
gives rise to a eonvex polyhedron in Mr with 
vertiees in M. in this way, certain aspects of 
the geometry of convex sets can be thought of 
as a part of the theory of projective varieties 
(R. Stanley [87], B. Teissier [89]). 

Mumford et al. [47] introduced a more gen- 
eral concept of toroidal embedding: A normal 
algebraic variety Y and a nonsingular Zariski 
open subset U such that Y z>U is formally 
isomorphic at each point to a torus embedding 
X D r. This concept has been used very effec- 
tively to prove important theorems: (1) Sys- 
tematic nice compactitications of arithmetic 
quotients of bounded symmetric domains (D. 
Mumford et al. [7], 1. Satake [77]). Y. Nami- 
kawa [65] worked out the details in the case 



of the Siegel upper planes, improving their 
earlier Satake compactifications studied by I. 
Satake, W. Baily, J.-I. Igusa, and A. Borel [12]. 
(2) Semistable reduction theorem (Mumford et 
al. [47] ). Eet f-.V—yChe a flat morphism from 
a nonsingular variety V to a nonsingular 
curve C in characteristic 0. After a finite base 
extension C'-y C and a modification V'^ 

VXc C, we can get f C whose singular 

fibers are reduced with only nonsingular com- 
ponents crossing normally with each other 
(- Section E). Without the reducedness 
requirement, the existence is derived from a 
result of Hironaka [37]. 

A A. Etale Topology 

Eet y be a category. We say that a Gro- 
thendieck topology on Y is given if, for each 
Se Ob(S), families of morphisms (covering 
families of S) are given and satisfy the follow- 
ing conditions: (1) If (p\T-yS is an isomor- 
phism, is a covering family, (2) if 

{(Pj-. is a covering family, for any 

morphism cp':S'^S, the fiber product R[ = 

R. X ^ S' exists and the induced family { : 

R--yS'j is a covering family of S’, (3) if {(pd 
is a covering family and if for each 
16 /, is a covering family, 

then is a covering 

family. In general, it is more convenient to 
use a notion of sieves to define a Grothen- 
dieck topology [30 (SGA 4)].) A category with 
Grothendieck topology is called a site. Eet X 
be a scheme and Et/Y the category whose 
objects consists of schemes etale over X. If we 
choose a family of morphisms 
with Ui,/ ^;( y;.) = Y, as a covering family of 
Y 6 Ob(Et/X), this defines the etale topology 
on X and the etale site Similarly, one can 
define the Zariski site (resp. the flat site 
Xfi), using open immersions (resp. flat mor- 
phisms locally of finite type). A presheaf on a 
site Y is a contravariant functor from Y to 
(Set). A presheaf f is a sheaf if, for any cover- 
ing family {(j>,:Ri^R}i^j, F{R)^Yl,eiF{Ri)z^ 
rii.je/ f (R; X nRj) is exact. If Y has a final ob- 
ject X, the functor F\—yF{X) is left exact on 
the category of Abelian sheaves on ^ and 
the cohomology groups FI • (X, F) are de- 
fined as the right derived functors. Using 
covering families, one Can define the Cech 
cohomology H ■ (X, F) as usual. In the follow- 
ing, a sheaf means an Abelian sheaf. Eor a 
■•■geometric point i:x-yX of a scheme, an etale 
neighborhood U of x consists of an etale mor- 
phism f\U-^X and a morphism j: x-y U such 
that f oj = i. Eor a sheaf F on X^p the stalk 
F^ at a geometric point x is defined by F^ = 
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lim r{ (7, F), U being the etale neighborhoods 
of X. On X^^ the sheaf is defined by G,(S)= 
r(S, for each SeOb(Et/A'). The kernel of 
the nth power homomorphism is 

denoted by The sequence 

G„^ 1 is exact in if p is prime to resid- 
ual characteristics of X, but not necessarily 
exact in There are canonical isomor- 

phisms Pic(X)= 

G,). For X = Spec(k), k a field, the 
etale cohomology theory of sheaves on is 
equivalent to the tGalois cohomology theory 
over k; hence H‘{X^^, F)^H‘(G, f^), where 
G = Ga\{k/k), k the tseparable closure of k and 
fjT is the stalk of F at the geometric point 
of X. For a morphism /: X -> Y of schemes 
and a sheaf f on the direct image sheaf 
F of F is defined by F(S) = F{X x j,S), 

SeOb(Et/Y), and higher direct-image sheaves 
R'f^F are detined by the right-derived functor. 
Let X be a separated scheme of linite type 
over a field k. By a theorem of Nagata [63], 
there exists a scheme X proper over k and 
an open immersion j: X->X For a torsion 
sheaf F on X^^, let j,F be a sheaf on X^^ ex- 
tended by 0 outside X. The cohomology 
with compact support H^{X^^, F) is defined 
by //«(!.„ F) = ff^(X„J-n This is inde- 
pendent of compactifications. Similarly, for a 
separated morphism /: X of finite type of 
schemes and a torsion sheaf F on X^i, one can 
define higher direct image sheaf with compact 
support R%F. For the etale topology, torsion 
sheaves are important. All torsion sheaves on 
X . on a Noetherian scheme X are inductive 
limits of constructible sheaves. A sheaf F on 
is called locally constructible (constant) if F 
is represented by an etale covering of X. A 
sheaf F on Xg, is called constructible if there 
exists a linite surjective family of subscheme X; 
of X such that the restriction of F to X,- is 
locally constructible (constant). This is equiva- 
lent to saying that every irreducible closed 
subscheme Z of X contains a nonempty open 
subscheme U such that the restriction F to L 
is locally constant and has finite stalks, i.e., 
there is a covering family such that 

F Ui is constant and the stalk is a finite 
Abelian group for all geometric points x of U. 
The cohomology of a torsion sheaf or a con- 
structible sheaf has properties similar to those 
of the classical cohomology. Let /: X^S be a 
proper morphism and F a torsion sheaf on 
X^(. Then the stalk R^f^F at a geometric 
point 5 of S is isomorphic to H‘^{X^ F), 

where X,=X x j Spec k(s). If / is a separated 
morphism of linite type of Noetherian schemes, 
a similar fact holds for the cohomology with 
compact support. Moreover, if F is a con- 
structible sheaf, R^f F is also constructible 
(finiteness theorem). For an affine scheme X of 



finite type over a separably closed field and F 
a torsion sheaf on X^j, one has //^(Xj,, F) =0 
for q > dim X. Let f ; X -^S be a separated 
morphism of schemes of finite type on C, Fa 
torsion sheaf on X^,. One has a canonical 
isomorphism (R‘^f Ff''2:iR^ff"F‘‘". In partic- 
ular, if X is proper over C, one has W{X^^, 
Z/{n))2^W{X‘‘”, Z/(n)). On the other hand, 
for a nonsingular complete curve C over an 
algebraically closed field, one always has 
//*(Q,, Z) = 0. For a geometric point s of S 
the strict localization of S at s is the ring 
^ = lim E( U, &u), U being etale neighbor- 
hoods of s. A geometric point t of S is called a 
generalization of s if t is detined by an alge- 
braic closure of the residue field of a point of 
Spec this case, s is called a special- 

ization of t. If / : X -^S is a proper smooth 
morphism the sheaf R^f^Z/{n) is locally con- 
structible (constant), and ifs is a specializa- 
tion of t one can define a cospecialization of 
H‘’{X^ ^^, Z/(n)) to //^(Xg gi, Z/(n)) which is 
bijective. Let X be a scheme of finite type over 
a field k. If / #ch(k) is prime, one can define the 
Fadic cohomology W{X, Q,) = lim„FF'(X,Z/ 
(!")) ®z,Q;-The /-adic cohomology has as 
many good properties as the classical co- 
homology (-^ Section U). 
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Algebroidal Functions 

A. General Remarks 

If an analytic function / satisfies an tjrreduc- 
ible algebraic equation 

Ao(z)/*^ + ^1 (z).r~‘ + • • • + Mz) = 0 ( 1 ) 

with single-valued ^meromorphic functions 
Aj{z) in a domain G in the complex z-plane, 
then / is called a /c-valued algebroidal function 
in G. With no loss of generality, we can as- 
sume that there is no common zero among the 
Aj{z) and that all the Afj) are tholomorphic in 
G. When k = 1, the solution of (1) is a single- 
valued meromorphic function in G. If all the 
A^{z) are polynomials, then / is an talgebraic 
function. Thus algebroidal functions can be 
regarded as extensions of single-valued to 
multiple-valued functions and also as exten- 
sions of algebraic to Transcendental functions. 
Since (1) is irreducible, its discriminant D(z) 
does not vanish identically. Lor all the points 
a satisfying D(a)^0, /lo(n)^0, neG, equation 
(1) determines k holomorphic function ele- 



ments /i(z), /fc(z) in a suitable neighborhood 
of a that determine the analytic function f. 
They can be tprolonged analytically in G in 
the wider sense. At any point satisfying A,(z) 

= 0, at least one element has a pole; and at any 
point satisfying D{z) = 0, there may appear 
ramified elements. Therefore an algebroidal 
function can be defined as a finitely multiple- 
valued analytic function in G with the excep- 
tion of poles and ^algebraic branch points. 
Every algebroidal function f(z) determines a 
tRiemann surface, which may be considered a 
tcovering surface 3 of G. This surface ak- 
sheeted '•'covering surface over G with no sin- 
gular point except for algebraic branch points. 
Also, f reduces to a single-valued meromor- 
phic function on 3 , and all the function ele- 
ments over a point z are different. A fc-valued 
algebroidal function can also be characterized 
by this property. 

These two (equivalent) delinitions of alge- 
broidal functions give rise to two distinct 
methods of studying these functions. In adopt- 
ing the first definition, we can make use of 
results in the single-valued case, as did G. 
Remoundos and G. Valiron. When the lat- 
ter definition is adopted, we can use several 
methods that are also applicable in the single- 
valued case, as did H. Selberg and E. Ullrich. 

Research on algebroidal functions has been 
carried out mainly for the case where G is the 
finite plane |z| < 00 or the unit disk |z| < 1. 
Almost all the known results for algebroidal 
functions are extensions of those on single- 
valued functions, but several results particu- 
larly relevant to algebroidal functions have 
been discovered. The existence of branch 
points makes it difficult to investigate alge- 
broidal functions in some cases. 



B. Absolute Value 

Among several results on the absolute values 
of algebroidal functions, the tmaximum prin- 
ciple, one of the basic principles, holds on the 
Riemann surface 3- The following relation 
holds among the Aj{z) and |/v(z)|. Assume that 
(1) has the form 

r + A,{z)f'‘-^ + ... + A,{z) = 0. (!') 

Then log( 1 -f /4(z))/log( 1 -|- F(z)) is bounded, 
where v4(z) = max|/13z)l and f(z) = max |/3z)|. 

An algebroidal function that has no pole 
in z I < 00 is called an entire algebroidal func- 
tion. The successive derivatives of an entire 
algebroidal function may have poles at every 
branch point, which is a departure from the 
case of single-valued integral functions. An 
algebroidal function defined by (1) or (!’) with 
entire functions /lj(z) and zero-free ,4„(z) is 
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entire. Dividing both sides of (1) by A,(z), we 
obtain equation (!'), and all its coefficients are 
entire. In equation (1'), the border, type, and 
class of an entire algebroidal function coincide 
with those of the largest Aj{z) (- 429 Tran- 
scendental Entire Functions). 

For an entire algebroidal function of order 
less than 1/2, F(z) tends to infinity uniformly 
along a sequence of concentric circles |z| 

(r„-> 00 ). However, it can be shown that not 
all the branches of f[z) necessarily tend to in- 
finity on the Riemann surface in this sense, 
IWiman’s theorem does not remain true. But, 
using min|j|=,.maxJ/v(z)|, one can obtain some 
extensions of the generalized Wiman theorem. 



C. Picard’s Theorem and Its Extension 

Remoundos first extended iPicard’s theorem 
and tBorel’s theorem to an algebroidal func- 
tion. Every k-valued transcendental alge- 
broidal function in the finite plane takes on 
every value inlinitely often with at most 2k 
exceptional values. There are examples where 
2k values are actually omitted. Hence the 
theorem is the best one possible in this sense. 

T. Varopoulos (Bull. Soc. Math. France, 53 
(1925), 23-34) introduced the degeneracy index 

z = dim{(co,Ci,...,c3eC*+Hco^o + ^^i^i + ••■ 

(0^ A ^ k — 1) into equation (1) and showed 
that the number of Picard’s exceptional values 
off is at most k -f A -|- 1 . There is no single- 
valued meromorphic function with A > 0, so 
this result is relevant only for algebroidal 
functions. Extending this idea further, J. Du- 
fresnoy and others obtained more precise 
results. The situation is the same for the Borel 
exceptional values. That is, the -FConvergence 
exponent of f(z) — w = 0 coincides with the 
'•'order off except for at most 2k values, for 
which the convergence exponents off are 
less than the order off (E. Borel). There are 
at most 2k polynomials P(z) for which f(z) — 
P(z) = 0 has at most a finite number of roots 
(Borel), and furthermore, using the degeneracy 
index, we can give more precise results. 

Selberg was the first to extend the tNevan- 
linna theory of meromorphic functions to 
algebroidal functions (-^ 272 Meromorphic 
Eunctions). Almost simultaneously, 'Valiron 
obtained the same results starting from the 
coefficients of (1); then Ullrich improved the 
results by considering the effect of branch 
points of 3- 

Eet 3r be the part of 3 over [z| < r, let n{r, w) 
be the number of roots of f(z) — w = 0 in 3r? 
and let n{r, 3) be the number of branch points 



of 3r- With these notations we write 



1 ^ dttn(^(0,w) 

N(r,w) — - (n(t, w) — n(0, w)) — I — log A 

kj 0 t k 

m{r, w) 



log 



|/(re‘^)-w| 



d(p, W^GO, 



T{r, w) = m{r, w) -i- N{r, w). 



Eet T(r,/) be the tlogarithmic integral of the 
spherical area of the image of 3, under w = 

.m. 



T{r,f) 



J 'dt rr 



t dt dip, 



and let N{r, 3) be the logarithmic integral of 
n{r, 3). Then we have T(r, w) = T(r,/) + 0(1) 
and the ramification theorem: 



N{r,3)<i2k-2)TirJ) + 0{\). 

Also, T(r, /) = O(logr) holds if and only if / is 
algebraic, Eet .4(z) be the maximum of \Aj{z)\, 
and let 



] 2rt 

^l{r)=^zr^ logA(re'’’)d(p. 

.ZrCTC Q 

I 

Then T{r, /) = p(r) -i- 0( 1). As the second funda- 
mental theorem we have 



X N{r,w,)>{q-2)T{r,f)-N{r,3) 

V = 1 

E Z ^1 (r. wj +0(logrT) 

v^l 

>{q-2k)T{r,f) 

+ Yu Niif,w3 + 0(\ogrT), 

V = 1 

where N, (r, w) is the logarithmic integral of 
Hi (a vv) which is the sum of the tmultiplicity 
minus one of all the roots of f(x) — w = 0 in 3r- 
Furthermore, the deficiency, ramification index 
of f(z), and ramification index of the surface 3 
are defined by 

6(w)= 1 — limsupA(r, w)/T(r,/) 

= lim inf m (r, w)/T(r, /), 

D (w) = lim inf (r, vv)/ T (r, f ), 

(^ = liminfA(r,3)/T(r,/). 

'With these notations, we have 

lS{w,)+lT>{«,H2 + U2k. 

These results contain the Picard theorem 
and the Borel theorem. Eurthermore, by con- 
sidering the effect of branch points, the Ahlfors 
theory of covering surfaces can be extended to 
algebroidal functions (Y. Tumura). By using 
this result, tBloch’s theorem can be obtained 
very simply. 
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But these results do not contain the Varo- 
poulos result. Taking into consideration the 
degeneracy index j,, H. Cartan proved the 
following inequality when /i = 0: 

v = l 

and for general A he conjectured that the 
number q-k — 1 may be changed to the num- 
ber q-k-l— 1. This conjecture is still un- 
solved except for some special cases where for 
example, q = k+ 1 + 2 (N. Toda, Nagoya Math. 
J.,91 (1983), 37-47). 

Interesting results follow upon a study of 
this degeneracy index. For example, when f is 
an entire algebroidal function, if Sv=i’ ^(w^) 

> 2k 2 (w,, ^oo), there are at least k ™ 1 
tPicard’s exceptional values in {w^,} (Niino, 
Ozawa, and Toda). 

With respect to the relation between the 
number of exceptional values and the order of 
fin |z| < 00 , there are some results similar to 
those for single-valued functions. For example, 
if / has k -I- 1 Picards exceptional values, the 
order off must be a positive integer or oo. 



D. Asymptotic Values and Other Results 

In the single-valued case, Valiron, L. Ahlfors, 
W. H. J. Fuchs, A. Edrei, W. K. Hayman, and 
others studied the 1'Borel direction, the number 
of lasymptotic values, the relation between the 
deficiency values and the asymptotic values, 
etc. However, almost none of the correspond- 
ing results holds for the algebroidal case as 
shown by several counterexamples. 

There is no relationship between the order 
of an entire algebroidal function and the num- 
ber of its finite tasymptotic values, which is 
quite different from the single-valued case. 
Furthermore, it is possible to have an infinite 
number of asymptotic values even if the order 
is equal to zero. If an algebroidal function f 
satisfies lim inf T{r, /)/(log rf < -l- oo, then it 
has at most k asymptotic values (Valiron- 
Tumura). The Ahlfors theorem, which is con- 
cerned with the number of tdirect transcen- 
dental singular points of the inverse function 
and the order of a meromorphic function in 
the single-valued case, was extended to the 
algebroidal function by Lii Yinian (Scientia 
Sinica, 23 (1980)). 

The t Julia direction or the Borel direction 
for an algebroidal function is defined not on 3 
but on |z| < cc because of the appearance of 
branch points, with respect to the Julia direc- 
tion, there are results similar to those for the 
single-valued case, but it is unknown in gen- 
eral whether the Borel direction exists. A. 
Rauch proved that if T{r,f)jr^*^ dr = cjo, 
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then there is a sector with an angle of at least 
njp in which L(^) = |*log'^f(re“^)/r'’‘^^dr 
diverges. Apart from the theory of distribution 
of values, Selberg obtained some conditions 
under which the inverse functions of t Abelian 
integrals of a special kind reduce to alge- 
broidal functions. 
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A. History 

The theory of almost periodic functions was 
originated by H. Bohr in 1924 as a result of his 
study of tDirichlet series. The theory provides 
a method of studying a wide class of trig- 
onometric series (-^ 159 Eourier Series) of 
general type. Eurther generalizations were 
made by N. Wiener, V. V. Stepanov, A. S. 
Besikovich, S. Bochner, and others. H. Weyl, 

J. von Neumann, and others clarified the rela- 
tions between this theory and trepresentation 
of groups, specifically, the relations between 
almost periodic functions in a ^topological 
group and representation theory of a tcompact 
group. 

B. Almost Periodic Functions in the Sense of 
Bohr 

Eet f(x) be a complex-valued continuous 
function defined for all real values of x. A 
number t is called a translation number of f(x) 
belonging to £ > 0 if 

sup \f{x + x)-f{x)\^E. 

-00<x<« 

If for any e> 0 there exists a number /(e) > 0 
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such that any interval of length 1(e) contains a 
translation number off belonging to e, then 
f(x) is called almost periodic in the sense of 
Bohr. We denote by B the set of all almost 
periodie funetions in the sense of Bohr. 

If f(x) is Aperiodic with a period p, then 
/eB, beeause each number I ^ p plays the role 
of /(fi) for any g > 0. Any f e B is bounded and 
uniformly eontinuous. A neeessary and suffi- 
cient eondition for a bounded eontinuous 
funetion on (—00 , 00 ) to belong to B is that for 
any given sequenee {/i„} of real numbers, there 
exist a subsequenee {h„ } such that the se- 
quenee of funetions { f(x -i- ) } is uniformly 

eonvergent in (— oo, oo); i.e., the set { f(x+ 
h) he{ —CO, oo)} is ttotally bounded with re- 
speet to the uniform norm || f || =sup /(x)| in 
the spaee of bounded eontinuous funetions in 
(-00, oo). 

If f(x) e B, then /( -x), /(x), af (x) (where 
a is a eomplex number), and f(x -|- h) (where 
h is a real n umber) eB. If f{x), ^(x)eB, then 
f{x)±g{x) and f{x)g{x)eB. ]ff„{x)eB and 
{f„{x)] eonverges uniformly to f{x), then f(x) 
eB. For any real number A, exp ux (where 
i is the timaginary unit) is eontinuous and 
periodie. Henee the polynomial funetion 
exp iA„x E B. Moreover, if the latter 
function converges uniformly to S a„exp j/l„x 
as m tends to oo, then the limit function 
is also an element of B. The polynomial 
1 exp i/l„x and the series ol„ exp j/^x 
are called a generalized trigonometric poly- 
nomial and a generalized trigonometric series, 
respectively. 

For any f eB, its mean exists: 






fa+r 



f{x) dx. 



The convergence of the right-hand formula 
is uniform in ae( — 00, oo), and the limit is 
independent of the choice of a. Thus M[/] 
is a tiinear functional defined on B. Since 
M[exp//ix]= 1 for A = 0 and =0 for 2.^0, the 
family {exp Ux -oo </t < oo} is an tortho- 
normal system with respect to the tinner prod- 
uct (f,g} = M[f(x)g(x)] defined on B. Let 
ot(/l) = M [ f (x)exp( Ux)] for any f e B; then 
there exist countably many values of 2. for 
which a(/i) differs from zero. Denote these 
values of 2. by • and write a(/i„) = 

a„. We call the numbers Xj, , a„, the 
Fourier coefficients of f(x). The formal series 
2^=1 oc„ exp U„x is called the Fourier series 
of f(x). Moreover, the Parseval equality 
M[|/(x)p] = |a„p is valid for any / eB. 
For every periodic function, these definitions 
coincide with the ordinary Fourier coefficients 
and the Fourier series (-^159 Fourier Series). 
Any almost periodic function in the sense of 
Bohr is uniquely determined by its Fourier 



coefficients; i.e., if two almost periodic func- 
tions have the same Fourier series, then they 
are identical. For any f e B, its Fourier series 
does not always converge uniformly, but 
f{x) e B can be approximated uniformly by a 
sequence of trigonometric polynomials. Hence 
the almost periodic functions in the sense of 
Bohr are also called uniformly almost periodic 
functions. 



C. Generalizations of Almost Periodic 
Functions 



Let C( —00, oo) be the space (- 168 Func- 
tion Spaces) of all bounded continuous func- 
tions on ( — 00, oo) with distance p( f, g) = 
SUP-oo<;c<® \f(x)-g{x)\. Then a uniformly 
almost periodic function is the limit of a se- 
quence of trigonometric polynomials with 
respect to this distance. Generally, let p be a 
tdistance function introduced in a function 
space (whose elements are not necessarily 
continuous in (—00, oo)). Then the limit of a 
sequence of generalized trigonometric poly- 
nomials with respect to the distance p is called 
an almost periodic function with respect to p. 
For example, for p ^ 1, we set 



Dsp\_f, SI = sup 

-co<a<oo 



■fl + 1 ■) 1/P 

\f(x)-gix)\Pdx\ 



DwpU,SI = 



lim sup 

l-»oo “co<a<oo U 



ra+l 



jf(x)-g(x)j^dx 



i/p 



These are distance functions. The properties of 
the corresponding almost periodic functions 
and their relations to other classes of almost 
periodic functions have been studied by Besi- 
kovich [1]. 



D. Analytic Almost Periodic Functions 

Let D be a strip domain, a < Re z < b, defined 
in a complex plane. For any tholomorphic 
function f(z) in D and 8 > 0, a real number t is 
called a translation number off belonging to 
e > 0 if sup^eo |/(z -I- n) -/(z)| If for any e > 
0 there exists a number 1(e) > 0 such that any 
interval of length 1(e) contains a translation 
number off belonging to e, then f(x) is called 
an analytic almost periodic function in D. We 
denote the set of all analytic almost periodic 
functions in B = {a < Rez < b} by A(a, b). If we 
fix an X in a < X < b, then g(y) = f (x + iy) for 
any /(z)eA(a, b) belongs to B. 

For any feA(a, b) there corresponds a 
tDirichlet series oc„ exp 2.„z such that two 
analytic almost periodic functions are identi- 
cal if the corresponding Dirichlet series are 
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identical. Here the coefficients 

a« = ^y[/(^ + '»exp(-ii, y)] 

are determined independently of x (a < jc < b), 
and Parseval’s identity 

00 

A^y[|/(x + iy)|^]= £ |a„|^exp2>a„x 

n-1 

holds (Bohr [3] ). If the series 
X C(„ exp Ji„x exp a^y 

at X = a and x = b represent the Fourier series 
of/,(y)and A(y)eB, respectively, then there 
exists /e A(a, b) such that f(z) is continuous on 
D and 

f{a + iy)=fa(y), f{b + iy)=ft,{y). 

The behavior of f e A(fl, b) at the boundary or 
exterior points of the domain D= {a <Ro z< 
b} has also been investigated by Besikovich 
[1]- 

E. Almost Periodic Functions on Groups 

Von Neumann defined almost periodic func- 
tions on any group, generalizing the charac- 
terization of uniformly almost periodic func- 
tions on ( — 00, oo). Let B(G) be the set of all 
complex-valued bounded functions on a group 
G. Then B(G) is a metric space with the tdis- 
tance p(/,^) = sup^^Gl/(x)-,^(x)|. If for any 
feB{G) the set A,= { f„ ,,(x)=f{axb)\a,beG] 
is totally bounded in the metric space B(G), we 
call / an almost periodic function on the group 
G. This condition is equivalent to the total 
boundedness of By = {/„(x) = f{xa) ae G} or 
{a/W=/(«^)keG}. We denote the set 
of almost periodic functions on G by j/(G). 

For f(x), g{x)e.^{G), the linear combina- 
tions a-/(x)-|- b-g{x) (a, beC) and the prod- 
uct f{x)g{x) are both contained in s/{G). If 
f„ E s/{G) and { /„} converges to f uniformly on 
G, then fEj^{G). If /e, (/(G), then fa,bJa,af^ 
d(G) also. Hence .(/(G) is a closed subalgebra 
invariant under two-sided translation in the 
■''Banach algebra i?(G). For any / e j/(G) there 
exists only one number M [/] in the closure 
(with respect to the distance p in B(G)) of 

= |.Z C; > 0, X Ci = 1 , a;, h; 6 g| 

(= the least closed Icon vex set including A,). 
We call M[/] the mean off on G. The map- 
ping /->M[/] is a linear functional on j/(G), 
and we have M [/] ^ 0 if 0. 

F. Relation to Bounded Representation 

Suppose that we are given a finite-dimensional 
matrix representation D (x) = (d,(x)) of a group 



G. Then the following three conditions are 
equivalent: (i) All the dy(x) are bounded on G. 
(ii) All the d,(x) are almost periodic in G. (iii) 
The representation jT) is tequivalent to a repre- 
sentation by unitary matrices. The inner prod- 
uct (/ g) = M[/(x)gf(x)] provides the algebra 
j^{G) with the structure of a tpre-Hilbert 
space. Let H(G) be the Hilbert space that is the 
completion of ,(/(G). If we select D”(x) = (d-)(.x)) 
from each L, where L is an equivalence class of 
bounded irreducible representations of G, and 
if rix is the order of D^, then { (l/y^)d,j(x) 1 1 ^ 
i, j ^ z 6 L} is a tcomplete orthonormal sys- 

tem in the Hilbert space H(G). Any f(x) e 
j/(G) can be approximated uniformly in G 
by a finite linear combination of the dy(x). 

G. Almost Periodic Functions on Topological 
Groups 

When G is a tseparated topological group, we 
denote the set of all continuous functions on G 
contained in j/(G) by s//^(G). The statements 
of the theorems in the previous section con- 
cerning j/(G) and the representation D remain 
valid if we replace j/(G) by d,(G) and replace 
B by a continuous representation of G. In 
particular, if G is the additive group of real 
numbers R, then d,(R) is exactly B. 

H. Relation to Compact Groups 

Every continuous function on a compact 
group G is almost periodic; i.e., j/,^(G) = C(G). 
The mean value M[/] of /€,(/,^(G) is identical 
to Jg/(x)dx, where the tHaar measure dx is 
normali z ed so that dx = 1 . In this case, the 
theory of bounded representations discussed 
above is the Peter- Weyl theory (- 69 Com- 
pact Groups). 

In general, let G be a separated topological 
group. There exists a continuous homomor- 
phism (p of G onto a compact group K = K(G) 
with the following two properties: (i) For any 
compact group K' and a continuous homo- 
morphism (p'\ G^K', there exists a continuous 
homomorphism i// : K K' such that (p'=ij/o 
(p. (ii) Such a pair K =(K, cp) is unique up to 
isomorphism. K is called the Bohr compactifi- 
cation of G, and (p is called the canonical map- 
ping. In particular, suppose that G is a locally 
compact Abelian group and G* is its ■'"charac- 
ter group. We denote by G' the group G* with 
discrete topology. Let K be the character 
group of G’, and let (p* be the identity map- 
ping G'^G* and (p be its conjugate mapping 
G^K, which is a continuous homomorphism. 
Then K is the Bohr compactification of G 
with the canonical mapping (p. A necessary 
and sufficient condition for / on G to be con- 
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tinuous almost periodic is that a continuous 
function f on K such that f = f'Qcp must 
exist. If this condition is satisfied, then the 
mean M [f] is identical to dx. For 

any finite-dimensional continuous unitary 
representation /)' of K, D = D’o <p is a finite- 
dimensional continuous unitary represen- 
tation of G, and vice versa. Hence there exists 
a canonical isomorphism (determined by D = 
D 'o (p) between the equivalence classes of 
finite-dimensional unitary representations 
of a separated topological group G and the 
equivalence classes of finite-dimensional uni- 
tary representations of its Bohr compactifica- 
tion K. The tkernel of the canonical mapping 
(P- G->K is identical to the intersection of all 
the kernels of finite-dimensional continuous 
unitary representations of G. 



1. Maximally Almost Periodic Croups 

Let G be a topological group. If for each pair 
a, b of distinct elements of G there exists a 
continuous almost periodic function f on G 
such that f(a) #f(b), then G is called a maxi- 
mally almost periodic group. This is the case 
if and only if G has sufficiently many finite- 
dimensional unitary representations. For a 
connected locally compact group G, the fol- 
lowing six conditions are equivalent: (1) G is a 
maximally almost periodic group. (2) There is 
a one-to-one continuous homomorphism from 
G into a compact group. (3) G is the direct 
product of a compact group and a vector 
group R”. (4) G is the tprojective limit of -i-Lie 
groups that are locally isomorphic to compact 
groups. (5) The quotient group G/Z is com- 
pact, where Z is the center of G. (6) The system 
of all neighborhoods that are invariant under 
the tinner automorphisms constitutes a basis 
for the neighborhood system of the unit [7]. 

Moreover, any discrete free group is maxi- 
mally almost periodic. If there is no continu- 
ous almost periodic function except constant 
functions, the topological group is called 
minimally almost periodic. Any noncompact 
connected tsimple Lie group is minimally 
almost periodic. 
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Analog Computation 

A. History 

The term analog computation is a generic term 
describing various techniques of computa- 
tion employing diagrams, or physical systems 
whose equations are similar to the mathemat- 
ical problems in question. The history of ana- 
log computation is probably as old as that of 
digital computation; for example, the ancient 
Greeks tried to solve cubic equations using 
diagrams, and the astrolabe widely used by 
astronomers through the medieval period is 
also a kind of analog computer. Soon after 
the discovery of logarithms, the slide rule was 
invented. In the 18th Century, the planimeter, 
used to measure plane areas, was introduced, 
and in the 19th Century, the nomogram ap- 
peared (- Section D). In the first half of 20th 
Century, a large electronic analog computer 
was developed, thus predating the first prac- 
tical digital computer. 

However, analog computation has an essen- 
tial defect, namely, the limitation of accuracy. 
Today it is useful for simple calculation, but is 
rapidly becoming less important as the devel- 
opment of digital computers, including pocket 
calculators, advances. 

B. Graphical Calculation 

Graphical calculation is a method of compu- 
tation by means of geometric constructions 
using common drawing tools. Some typical 
examples of practical graphical calculation are 
the following: evaluation of linear functions of 
several variables (J. Massau, 1887), of systems 
of linear equations (F. J. van den Berg, 1 888), 
of polynomials (J. A. Segner, 1761), of alge- 
braic equations (Lill, 1 867), graphical integra- 
tion, graphical differentiation, and the solu- 
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tion of ordinary differential equations of the 
first order (J. Massau, 1878), of linear differ- 
ential equations (Czuber), and of ordinary 
differential equations of the second order 
(Lord Kelvin, 1892). 

Using bisquare -root graph paper, sold 
under the names of binomial probability 
paper or stochastic paper; we can handle tf’- 
distributions or other probability distributions 
reducible to F-distributions, such as binomial 
or normal distributions, to a fairly good de- 
gree of approximation. This graphical method 
of statistical inference is, even now, a powerful 
method of statistical quality control. 



C. Graphical Mechanics 

Graphical mechanics is the graphical treatment 
of mechanical problems, especially problems of 
equilibrium. In this method, the fundamental 
constructions are the composition and reso- 
lution of forces by means of force polygons 
(Fig. 1). This method is also applicable to 
problems in dynamics when they can be re- 
duced to problems of equilibrium by means of 
d’Alembert’s principle. 




Fig. 1 

An example of the graphical method of constructing 
the composite and the line of action of three given 
forces F^, F 2 , and Fj. In the force polygon in (b), the 
vector^ represents the composite R, On the link 
polygon in (a), the point d is a point on the line of 
action of the force. If the fourth force F. has the 
same magnitude as R with opposite direction, both 
polygons are closed, and the four forces fj, Fj, F 3 , 
and are in equilibrium. 

Graphical mechanics is most convenient 
when applied to problems reducible to 2 - 
dimensional structural mechanics. But we can 
also use it for 3-dimensional problems if we 
work on projections, e.g., on the plan and the 
elevation, of the original body. In recent years, 
however, the amount of work involved in 
geometric construction has been deemed non- 
negligible, and the technique of computer 
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graphics is now considered to be much niore 
convenient. Here the computation is done by a 
digital computer, and the result is displayed in 
a convenient graphical form. 



D. Nomograms 



Nomograms are charts in which we can easily 
read off the corresponding value u„ from given 
values Mj, , when there is a relation 
f (m 1 , . . . , u„) = 0 among ]\ real variables u, , 

.... u„. The construction of nomograms has 
been thoroughly investigated by M. d’Ocagne 

[5]. 

A function of two variables F{u^,U 2 ) = 0 can 
be represented by a two-sided scale or by func- 
tional paper. The most useful nomograms are 
those for functions of three variables, F(ui, 

^ 2 ! W 3 ) = 0. Changing this into /j(x, y, u,) = 0 
(i = 1 , 2 , 3 ) and drawing the curves f.{x, y, Uj) = 0 
for fixed values of Uj, we have an intersection 
chart, where three curves for the correspond- 
ing values Mj, U 2 , W 3 meet at a point. In prac- 
tice, however, it is much more convenient to 
use the alignment chart, which is a dual of the 
intersection chart. It is especially useful when 
the relation F(u j , 1 / 2 » ^ 3 ) = ^ can be decom- 
posed into an equation of the form 



/,(wi) gi(w,) hi(ui) 

^2(^2) ^2(^2) 
93(^3) ^3(^3) 



= 0 . 



( 1 ) 



Putting X,- yi = gjhi (i = 1, 2,3) and 

drawing the CUrves (x,-, y,) with the parameter 
U; scaled on it (u, -scaled), the corresponding 
values Mj, and satisfying relation ( 1 ) lie 
on a straight line (Fig. 2). Using this property 
we can easily read off the corresponding values 
(by laying down a ruler). In the strict sense, the 
term “nomogram” usually refers specifically to 
an alignment chart. 




Fig. 2 



For four or more variables, we can apply 
similar techniques if the function is separable 
into functions of three variables. If it is not 
separable, we can sometimes apply such tech- 
niques such as two-functional scalcS with 
functional networks, cocircular charts, co- 
planar charts, or moving charts. Using func- 
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tional approximations, an approximate nom- 
ogram has recently been constructed with 
errors considered to be admissible in practical 
use. 

Historically, the origin of Hilbert’s 13th 
problem, which asks if it is possible to rep- 
resent a function of many variables as the 
composite of functions of two variables, 
comes from a study of nomograms with many 
variables. 

E. Analog Computers 

The analog computer is a special purpose 
machine designed for a specified analog com- 
putation. In a wider sense it includes special 
devices that perform ’’'analog simulation. The 
following mechanical devices are well known: 
the pantograph for copying plans, the harmonic 
analyzer for obtaining the Fourier expansion 
of a periodic function, and V. Bush’s differen- 
tial analyzer. Since about 1940 large electronic 
analog computers for solving differential equa- 
tions have been extensively developed. 

Analog computers have up to recently had 
the advantages of ease of construction, sim- 
plicity, and inexpensive operation, and they 
were also considered to be fast enough for use 
in real-time computation. To compensate for 
the limitations of analog computers in com- 
parison to digital ones, several hybrid com- 
puters have also been used. However, with 
the rapid progress of digital computers and 
digital-analog converters, the analog com- 
puters are now considered less important than 
in the past. 

F. Curve Fitting 

Curve fitting is a method of finding a sim- 
ple curve y —f{x) supplying the best possible 
approximation to the values y,, y,, for 
discrete values Xj , X 2 , of the independent 
variable. A polynomial passing through all the 
given points is constructed by means of tinter- 
polation. For experimental data we usually 
construct a curve by using methods, such as 
the tmethod of least squares, that take the 
errors due to observation into account. 

A function constructed to best fit the ob- 
served values of the function y=y(x) express- 
ing some physical law is called an empirical 
formula, in contrast to the theoretical formula. 
Usually, we first assume that the function 
contains several empirical constants, which are 
then determined to fit the experimental data. 

The most common case is a linear approx- 
imation (using a linear function) with a suit- 
able change of variables. Semilogarithmic 



paper, logarithmic paper, or probability paper 

are used to facilitate transformations in terms 
of logarithms or normal distributions. 

We also frequently use polynomials or trig- 
onometric polynomials of lower degree. To 
determine the empirical constants the follow- 
ing methods are used: the graphing method, 
the selection of certain points, the mean value 
method, or the method of least squares. 



G. Orthogonal Polynomials 



Let there be given the values » Tn-i foi" 

discrete values x = 0, 1, . , n — 1 with equal 
differences. A polynomial f(x) of degree k ( < n) 
that minimizes the sum of the squares G,(n) = 
2(Tm-/(w))^ is given by 

k 

f{x)= ^ a,q^{n,x), 

v = 0 

n-1 

«v= z 
m=0 

n-1 

m = 0 

and we have = 

Here the function q^{n, x), called a Chebyshev 
q-function, is defined by 



q.{n,x)= Z 

m = 0 






In practical applications, it is better to replace 
q^{n, x) by the function 



q*{n,x) = q,(n,x)l2 '’v\M,(n), 



where M,(n) is the greatest common divisor of 





(m = 0,l,..., v) . 



The values q*{n,m) (m = 0, 1, ...,n — 1) are mu- 
tually coprime integers. The function q*{n, m) 
is called the simplest Chebyshev (^-function or 
simplest orthogonal polynomial, and is some- 
times denoted by X,„(x), ^’„ „,{x), or „(x). 
When two functions satisfy the condition 

if^9)n= Z = 

m = 0 

they are called orthogonal for a finite sum. If 
from . . . , x'’ we construct a system 

orthogonal with respect to this definition, then 
we have the polynomial 



^=0 \mj\ m j\mJ \m 



which has the following connection with q^: 
Qy(n, x) 

= ((-!)>- l)!/2>-v-l)!)n,„-i(4 
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The polynomial is called a Chebyshev 

orthogonal polynomial or sometimes simply an 
orthogonal polynomial. The polynomials 
are orthogonal for the finite sum. If oo and 
0 ^ X ^ 1, the function tends to P^{ 1 — 

2x), where is the tLegendre polynomial. 
These functions may be conveniently used in 
least-squares curve fitting. 
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Analysis 

The origin of analysis can be traced back to 
the time when Eudoxus (4th century B.c.)and 
Archimedes (3rd century B.c.) devised the SO- 
called method of exhaustion for calculating 
the area of a plane figure and the volume of a 
solid. Their objects of investigation were re- 
stricted, however, to particular types of figures 
or solids. In the I6th and I7th centuries, E. 
Viete, J. Kepler, and B. Cavalier! again took 
up this problem. In the 1 7th Century, the prob- 
lem of drawing a tangent to a given curve 
was studied by R. Descartes, P. de Fermat, 

B. Pascal, and J. Wallis. Fermat, in particular, 
applied the result to find the maxima and 
minima of certain functions. It is worth noting 
that a certain type of mathematics powerful 
enough to produce similar results was inde- 
pendently developed in Japan around that 
time. In 1684, G. Eeibniz in Germany intro- 
duced the symbols dx and dy in treating the 
same problem. He proved that dy/dx repre- 
sents the slope of the tangent to the curve at 
a given point and discovered a new operation 
to calculate it. In 1686, he established the “in- 
verse tangent method,” which is what we now 
call integral calculus. He also introduced the 
notation J. On the other hand, I. Newton in 
England developed his “method of fluxions,” 
corresponding to our differential and integral 
calculus, from the viewpoint of mechanics. But 
neither Eeibniz nor Newton formulated the 
fundamental concepts rigorously, and there- 
fore they were criticized severely by many 
contemporary scholars. The new calculus 
gained ground in Great Britain rather slowly. 
B. Taylor in England and C. MacLaurin in 
Scotland demonstrated its usefulness in 1715 
and in 1745, respectively. On the continent, 
however, Eeibniz’s symbolic calculuSwas 
taken up by mathematicians of the Bernoulli 
family, G. E. A. de I’Hopital, G. Eagnano, and 
many others; with it, they solved many scien- 
tific problems which until then had remained 
intractable. This motivated subsequent re- 
searchers to pose new problems in the form of 
differential equations. 

One of these problems, treated by J. le 
Rond d’Alembert in relation to the vibration 
of a chord, concerns the tpartial differential 
equation 

d'^yldt^ — a^d^yldx^ ( 1 ) 

for y = y(tj x) with the boundary conditions 
y = 0 for X = 0 and x = /. He obtained the solu- 
tion y= f(^at + x)~ f{at-x), where / is an 
arbitrary function of period 21. In 1753, D. 
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Bernoulli showed that solutions of equation 
(1) are given by functions of the form 






Oo 

2 




( knx 

I + OfcSm 




These two kinds of solutions gave rise to the 
question of whether an arbitrary function can 
be expressed by a ttrigonometric series. This 
problem was studied by A. C. Clairaut, J. L. 
Lagrange, and L. Euler. In 1807, J. Fourier in 
France, in treating a problem on the conduc- 
tion of heat, claimed that an arbitrary function 
of period In can be expressed as 






Oo 

2 



OC 



+ y (a^coskx + b^sinkx), 



( 2 ) 



where the coefficients a,^ and h,^ are given by 






1 

n 



'it 

f{x) cos kxdx, 

J -n 




f{x) smkx dx. 



(3) 



This series is now called the tpourier series, 
but Fourier never verilied the fact that the 
series (2) with coefficients (3) converges and 
represents f(x). It was only as late as 1820 that 
A. F. Cauchy in France first noted that to 
treat a series properly, one must examine its 
convergence. 

In the 1 9th Century, the concept of tfunc- 
tions, which had been taken in the sense of 
“analytic expressions,” came to be detined by 
the correspondence relation. Cauchy clarified 
the ideas of '^limit and tcontinuity, tdifferentia- 
bility and '^integrability. He showed that a 
function that is continuous in a bounded 
closed interval is integrable in that interval. 

But his proof was not rigorous, as he lacked 
the notion of tuniform continuity. In his 1854 
paper on the trigonometric series, B. Riemann 
in Germany considered the integrability of 
functions that might be discontinuous and 
introduced the concept of what we now call 
the tRiemann integral. 

The theory of '•'sets, initiated by G. Cantor in 
Germany in his paper of 1874, revolutionized 
analysis. R. Baire, E. Borel, and H. Febesgue 
in France contributed to the establishment of 
analysis on the basis of set theory. Baire made 
a classification of discontinuous functions. 
Generalizing his results, Febesgue gave a 
definition of analytic expressions, thus clarify- 
ing the term that had been used vaguely since 
the time of Euler. Febesgue also tried to define 
the concepts of the integral of a function, the 
length of a curve, and the area of a surface 
from the most general viewpoint. In generaliz- 
ing the notion of Imeasure introduced by 
Borel, he established in 1902 the theory of 
iLebesgue measure with which he laid the 



foundations of the theory of ''‘Lebesgue inte- 
grals. The introduction of this theory gave 
to the theory of Fourier series a new turn in 
the direction of functional analysis. Measure 
theory was also employed by A. N. Kolmo- 
gorov of the Soviet Union to lay a solid foun- 
dation for tprobability theory in 1933. 

The study of functions of a complex variable 
was originated by Cauchy in the first half of 
the 19th Century. He began his research by 
introducing the notion of “monogenic func- 
tions”; a function which is monogenic at every 
point of a domain is what we now call a tholo- 
morphic function. He established tCauchy’s 
integral theorem and integral formulas for 
these functions, and deduced from these theo- 
rems the +residue theorem for functions with 
Ipoles. Making use of the integral formula, 
Cauchy proved that a function that is 
holomorphic at a point a can be expanded in 
a power series of the form a^(z — a)" in a 
neighborhood of this point. 

Riemann considered a complex variable w 
as a function of another complex variable z 
when dw/dz is independent of the value of 
the differential dz. This amounts to the same 
thing as a “monogenic function” of Cauchy. 
Riemann’ s mapping theorem became a model 
for subsequent developments. Riemann intro- 
duced the concept of tRiemann surfaces in 
order to tuniformize multivalued functions. 
This important idea was basic to the progress 
of analysis and ttopology in the 20th Century. 

K. Weierstrass, who was a contemporary of 
Riemann, developed the theory of functions 
of a complex variable from a purely analytic 
viewpoint. He delined an element of a function 
to be a power series ^ 

representing a holomorphic function in the 
interior of its tcircle of convergence, and de- 
fined an analytic function to be an aggregate 
of such elements that are derived from one of 
them by means of ^analytic continuations, 
along all curves having the point a as the 
initial point. 

Riemann and Weierstrass constructed their 
theory to complete the theory of telliptic func- 
tions and tAbelian functions that was initiated 
by N. H. Abel and C. G. J. Jacobi. Their re- 
sults were a high point of 19th-century class- 
ical mathematics. H. Poincare in France built 
upon their work. Another high point in the 
theory of functions of a complex variable was 
reached when J. Hadamard and C. de Fa Vallee 
Poussin in France made use of it to prove the 
■•^prime number theorem in 1896. 

Weierstrass also initiated the study of func- 
tions of several complex variables and was 
succeeded by Poincare, P. Cousin, and E. 
Picard in France. They tried to extend the 
theory of functions of one complex variable to 




99 



that of many variables. F. Hartogs, however, 
discovered a phenomenon quite different from 
the case of one variable. Cousin posed in 1895 
the problem of constructing a function of 
several complex variables that has assigned 
poles. This problem was pursued by A. Weil in 
France and solved by K. Oka in Japan in 
1936. Also, the problem of characterizing 
tdomains of holomorphy was investigated by 
E. E. Eevi in Italy, H. Cartan in Erance, and 
P- Thullen and K. Stein in Germany and was 
finally solved by Oka in 1953. H. Cartan and 
J.-P. Serre reformulated these results in terms 
of ^'cohomology with coefficients in 'isheaves. 
This considerably influenced the formulations 
of mathematics thereafter. These results were 
further generalized to tanalytic space by H. 
Grauert, R. Remmert, and Stein in Germany. 
The theory of 'I'complex manifolds, which are 
generalizations of Riemann surfaces to several 
variables, was initiated by W. V. D. Hodge in 
England and K. Kodaira in Japan and con- 
tinued by E. Hirzebruch in Germany, M. E. 
Atiyah in England, and I. M. Singer in the 
United States. 

Differential calculus gives a general method 
of finding extreme values of a given function. 
Eikewise, in order to find a function that pro- 
duces an extremal of the given tfunctional, the 
calculus of variations was created in the 18th 
Century. Eor example, Euler considered the 
problem of finding a particular function y(x) 
that renders the functional j* F{x, y, y’) dx 
an extreme value among all those functions 
y(X) for which the plane curve y = y(x) passes 
through two given points (a. A) and (b, B) of the 
plane; he showed that this y(x) must satisfy 
the differential equation dFy/dx — Fy = 0 (1744). 
Eagrange, W. R. Hamilton, and others devel- 
oped this result into a general t variational 
principle that governs not only classical tme- 
chanics but also 'Iquantum mechanics. 

Erom research on the continuity or differen- 
tiability of the functional with respect to y 
emerged the idea of considering a function as a 
“point” in a tfunction space. This gave rise to 
tfunctional analysis, a branch of analysis that 
treats functions as elements of certain spaces 
and utilizes the methods of algebra and topol- 
ogy. The first result in this regard was the 
theory of tintegral equations of V. Volterra in 
Italy and E. E Eredholm in Sweden in the 
beginning of the 20th Century. Eredholm’ s 
work was motivated by his desire to solve the 
■IDirichlet problem, the solution of which had 
been used by Riemann in the proof of his 
mapping theorem, etc. However, Riemann’s 
own proof of existence of the solution, called 
the ■^Dirichlet principle, was not rigorous, and 
attempts to save the proof provided one of the 
central problems of analysis for some time. 
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Eredholm’ s solutions were different from Rie- 
mann% but D. Hilbert in Germany was able 
to justify Riemann’s original proof. Hilbert’s 
proof was later simplitied and generalized b} 

R. Courant, H. Weyl, and others. Hilbert also 
introduced the '^function spaces and L 2 to 
study the '•'eigenvalue problem of integral 
equations with tsymmetric kernels. Later J- 
von Neumann in Germany established '•'spec- 
tral theory in abstract tHilbert spaces and 
applied it to the mathematical foundations 
of quantum mechanics (1929). S. Banach in 
Poland created the theory of tfinear operators 
in Banach spaces (1932). This theory was 
further generalized to that of '•'linear topolog- 
ical spaces and was applied to the theory of 
distributions. 

In their study of partial differential equa- 
tions and Eourier analysis, Hadamard, J. 
Leray, S. E. Sobolev, T. Carleman, and many 
others had to extend the notion of functions; 
they also enlarged the notion of derivatives. E. 
Schwartz in Erance introduced tdistributions 
and defined derivatives in the sense of distribu- 
tions to unify these generalizations (1945). M. 
Sato in Japan defined more general generalized 
functions, called thyperfunctions (1958). It has 
become evident that both distributions and 
hyperfunctions have provided the most power- 
ful tools in recent research in the general 
theory of tpartial differential equations, to 
which E. Hormander in Sweden has made 
outstanding contributions. 
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Analytic Functions of Several 
Complex Variables 

A. Holomorphic Functions 

As in the case of '•'holomorphic functions of 
one complex variable, the definition of holo- 
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morphic functions can be given in two ways: 
the first definition utihzes differentiability, 
following the approach of B. Riemann; and 
the second method utilizes the notion of power 
series expansion as developed by K. Weier- 
strass. In this article, N = {0, 1,2,.}. 

B. Power Series 

Eet z be an n-tuple of complex variables 
z 1 , . . . , z„, and c = (Cj , . . . , c„) a point of C". 

An infinite sum P of monomials u^(z cf = 

A„(zi -c^f' ...(z„-c„)'‘" ...,/c„)6 

N"), where e C, is called a power series 
with center c and coefficients o^. If, for a bi- 
jection (p of N onto N", the simple series 
ZpeN\%(p){z - convergent at z = 

we say that P is absolutely convergent at z^. 

Its sum at z®, denoted by Z «fc(z° — cf, is de- 
fined as the sum Z ~ which is 

independent of the choice of (p. If P is uni- 
formly bounded at z°, then P is absolutely 
convergent at every point of the open polydisk 
S= {z\\Zj — Cj\<\Zj —Cj\,j=\, ... ,n]. Further- 
more, in this case P converges absolutely and 
uniformly on every compact set in S (N. H. 

Abel). 

The convergence domain of a power series P 
is the set D of points SUch that P is ab- 
solutely convergent at every point in a neigh- 
borhood of z°. The interior of the set B of 
points at which the infinite sum P is uniformly 
bounded is equal to D. A (complete) Reinhardt 
domain with center c is a domain D in C" such 
that whenever D contains Zo> the domain D 
also contains the torus { z \zj — Cj\ = |z9 Cj\,j = 

1, ,n} (the closed polydisk {z| \Zj-Cj\ ^Iz^- 
Cj\,j= 1,. , n}). If the convergence domain D 
of the power series P is not empty, it is a com- 
plete Reinhardt domain and is also logarithmi- 
cally convex; that is, the setD — (Jj{z|z~Cj) is 
mapped onto a convex domain in R" by the 
mapping Zj^\og\zj — Cj\ (j= 1, . . .,«). The set D 
of points at which P is absolutely convergent 
is, in general, greater than D, and it is possible 
that D contains exterior points of D . A thorn of 
D is the set of exterior points of D contained in 
D that are located on the planes {z Zj = Cj} (j = 

1, . , n). An n-tuple r 6 R",. is called a set of 
associated convergence radii if P is absolutely 
convergent at every point of {z 1 1 — c^| < 

rpj=\,...,n} but not of {z 1 1 Zj - Cj\ > r^j = 

1, , «}. An n-tuple of associated conver- 
gence radii may not be uniquely determined, 
but it satisties 

limsup(|ajr'‘)'^l'‘'= 1, \k\ = ki + ... + k„ 

|k|-» + oo 

(E. Eemaire). 

Eet / be a complex-valued function defined 
in a neighborhood of z® G C”. If there exists a 
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convergent power series P with center SUch 
that at every point of a neighborhood of z” the 
value off and the sum of P coincide, then f is 
called analytic at in the sense of Weierstrass, 
and P is the Taylor expansion of f at z”. 



C. Differentiability 



Let / be a complex-valued function detined in 
a neighborhood of e C”. If in a neighbor- 
hood of we have 

/(z)-/(z'^) = ai(zi -z?) + ... + 0 f„(z„-z°) + £, 

with a,, . . . ,a„eC and 



lim£/(lzi-z?| + ...-)-lz„-z"l)-0, 



( 1 ) 



then we say that / is (totally) dilferentiable at 
z°. The function / is then continuous at z°, 
and the partial derivatives df/dzj (j = 1, , n) 
exist. Furthermore, the Cauchy-Riemann 
differential equations dfldzj = 0 (j= 1, , n) 
hold, where d//dz^=(l/2)(c)//0Xj-i0//dyj) and 
dfldzj = {\/2)(df/dxj+ idf/dyj) with Zj^ Xj + iyj 
We say that f is holomorphic at z^ in the sense 
of Riemann if f is differentiable at every point 
in a neighborhood of z^. Analyticity in the 
sense of Weierstrass is equivalent to holomor- 
phy in the sense of Riemann. Furthermore, if 

the partial derivatives df(dz^ (j = 1 ri) exist 

at every point in a neighborhood of z°, then f 
is, without assuming continuity, proved to be 
holomorphic. Thus the holomorphy off in 
each variable Zj implies the holomorphy off in 
z = (zj, . ..)z,) (Hartogs’s theorem of holomor- 
phy, 1906). 

A complex-valued function in a domain G c; 
C" is called holomorphic or analytic in G if it 
is holomorphic at every point of G. Let H(G) 
be the ring of holomorphic functions in G. For 
/ = u + if e N(G), u and p satisfy in G the dif- 
ferential equations d^u{z, z)ldzjd\ = 0; that is, 

d^u 

( 2 ) 

dxjdy^ dx,dyj 

A tdistribution TeD’(G) is called pluribar- 
monic in G if it satisfies (2) in G. Then T is 
harmonic and hence a real analytic function 
by ■*'Weyrs lemma. 

Let Gj be a domain in the z^-plane with 
piecewise smooth boundary Cj. If f e H(G) (G 
= n"=i Gj) is continuous on G, then 

no 




(270')" Jc, X ... XC„ (Cl - Zl) • • ■ (C„ -z„) 



f /(z),zeG, 
dCi A...AdC„ = f - (3) 

' 0,z^G 



(Cauchy’s integral representation). Thus if n ^ 2, 
then / is determined by its values only on the 
proper subset C = Ci x x C„ of dG, which is 
called the skeleton (or determining set) of G. 

For (pluriharmonic) functions of several com- 
plex variables, the boundary value problem, 
not necessarily solvable in its classical form, is 
not so effective as the Dirichlet problem in one 
complex variable. 

As in the case of one variable, the Laurent 
expansion is valid for every holomorphic 
function in a domain of the form G = ri"=iGj, 
where the Gj are circular annuli c: C. Suppose 
that we are given /, 6 f/(Gj) and (2 e //(G,), 
where Gj , G 2 are domains in C” such that Gj H 
G 2 is nonempty and connected. If /i =^2 
{z\\zj — zj\<rj,y = y'^,j~\, ...,n}, where z° = 

6 Gi n G 2 , then there exists a unique 
/ef/(Gj U G 2 ) such that /| G^ =/, and /| G 2 = /2 
(theorem of identity). Thus tanalytic continu- 
ation proceeds as in the case of one variable. 
Similarly, some fundamental theorems in one 
variable, such as tLiouville’s theorem on ten- 
tire functions and the f maximum principle, 
hold also for several variables. However, there 
are some properties that reveal the differences 
between the cases of one and several variables. 
For instance, the set of zeros of a holomorphic 
function (^ 23 Analytic Spaces B) has no 
isolated point for n ^ 2. The investigation of 
these remarkable differences is one of the 
purposes of the theory of analytic functions of 
several complex variables. 



D. Shilov Boundaries 

While the maximum principle holds for a 
holomorphic function in a domain G, the set of 
points where the maximum is attained may be 
a proper subset S of 6G. For instance, if G is 
the product of annuli as before, then the skele- 
ton of G can be taken as S. In connection with 
the theory of tnormed rings, G. E. Shilov 
proved that there exists a unique smallest 
member Sq (called the Shilov boundary of G) 
in the family of closed subsets S such that 
sup{j/(z)| |zeS} = sup{|/(z)| |zeG} for every 
f E H(G) continuous on G. The structure of 
is investigated in detail together with 
the pseudoconvexity of G connected with it. 
Applying tPerron’s method for the Dirichlet 
problem to ^'plurisubharmonic functions and 
the Shilov boundary, H. J. Bremermann solved 
one type of boundary problem. 



E. Local Theory 

Let / and g be two functions defined in neigh- 
borhoods of a set Sc C". If /= g in a neigh- 
borhood of S, then f and g are called eqiiiva- 
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lent with respect to S. The germ off on S, 
denoted by /j, is the equivalence class off: A 
germ of a bolomorpbic function on S is the 
germ on S of a holomorphic function defined 
in a neighborhood of S, and H(S) denotes the 
ring of germs of holomorphic functions on S. 
Given a point 0 in C”, H(0) = H{ {0}) is iso- 
morphic to the ring of convergent power 
series at 0, i.e., the power series that are ab- 
solutely convergent in some neighborhoods of 
0. For every nonzero function fe H (0 ), there 
exists a system of coordinates (zj, , z„) cen- 
tered at 0 such that /’(O, , 0, zj 0 for every 
z„ ^ 0 in a neighborhood of = 0. In a neigh- 
borhood of 0, / is then equal to the product of 
an invertible element of H„ and a distinguished 
pseudopolynomial 

= + z„_i)zr‘ + 

+ flp(Zi,...,Z„_i)6//„_i[zJ, 

with (0 , . . . , 0 ) = . . . = 0 ^( 0 , ... ,0) = 0, and P(z„) 
is uniquely determined by / and the coordi- 
nates zi, , z„ (Weierstrass’s preparation 
theorem). It follows from this that is an n- 
dimensional tregular local ring. Considering 
H (0) as the ^inductive limit of tlocally convex 
rings H (U ), where U ranges over a base for a 
neighborhood system of 0, H. Cartan proved 
the preparation theorem in a more precise 
form in which the association f-^dj is con- 
tinuous with respect to the supremum norm. 
Based on a deep consideration of this situ- 
ation, K. Oka proved a theorem of funda- 
mental importance: The tsheaf (9q« defined by 
&c\z= ^ (zeC") is tcoherent. 



F. Domains of Holomorphy 

Given a domain G c G" for n ^ 2, it may be 
that there exists a domain G’ strictly greater 
than G such that all the functions that are 
holomorphic in G extend to holomorphic 
functions in G’. For instance, let S = S’ x n, 
where S’ and o are open polydisks in (zj, . , 
z„_i)-space and z„-space, respectively, and 
let T c C” be an open set. If there exists an 
open set U {^0}cS' such that ((7 x (t)U(S' x 
do) c T and if S D T is connected, then all the 
functions that are holomorphic in T extend 
uniquely to holomorphic functions in SU T 
(Hartogs’s continuation theorem). In particular, 
if A is an tanalytic set in a domain G c C" with 
dim A ^ n 2, then all the functions that are 
holomorphic in G-A extend uniquely to 
holomorphic functions in G. Furthermore, if A 
is an analytic set in G with A ^G, then every 
f 6 H(G — A) that is locally bounded at the 
points of A extends uniquely to a holomorphic 
function in G (Riemann% continuation theorem 



for 2). The domain Gj- of holomorphy for f 
is defined to be the maximal domain to which 
/ may be continued analytically. A domain G 
is called a domain of holomorphy if G = Gj for 
some ye H (G ). However, Gf is, in general, not 
a subdomain of C". Gj- is, generally, a mani- 
fold spread over C”; i.e., Gf is a connected n- 
dimensional tcomplex analytic manifold with a 
holomorphic mapping cp: Gj-^C" of maximum 
Jacobian rank (<p is then an open mapping). 

The same is true for the common existence 
domain of functions in a subfamily of H (G). 

The common existence domain G of all the 
functions in H (G ) is called the envelope of 
holomorphy. A holomorphically complete 
domain is a domain G such that G = C. These 
notions carry over to the case where G is a 
manifold spread over C”. The (general) Levi 
problem of determining the conditions for a 
given domain to be holomorphically complete 
is fundamental to the theory of analytic func- 
tions of several complex variables (-^ Section 
I). In connection with this problem, various 
notions of pseudoconvexity of holomorphi- 
cally complete domains are defined. 

G. Pseudoconvexity 

An upper semicontinuous real-valued function 
u (— 00 ^M< -boo) in a domain G<=C" is said 
to be plurisubharmonic if for every e G and 
every a e C" the function u(z° -i- ta) of ( is tsub- 
harmonic (including the constant — oo) in all 
the connected components of {t z° + taG G} . A 
domain G is said to be pseudoconvex (or d- 
pseudoconvex) ifu=— logd, is plurisubhar- 
monic in G, where d,(z) is the distance from 
z e G to 5G with respect to any norm in C”. 
Every connected component of the interior of 
the intersection of a family of pseudoconvex 
domains is pseudoconvex, and the union of an 
increasing sequence of pseudoconvex domains 
is pseudoconvex. Suppose that we are given a 
domain G and a function u of class in a 
neighborhood of G such that G = {z| u(z) < 0} 
and, for some e> I,jk{d^uldzjdz,)aja^^F.\a\^ 
for every (2 eC". Then the domain G is said to 
be strongly pseudoconvex. Strong pseudo- 
convexity implies pseudoconvexity. Every 
pseudoconvex domain is exhausted by an 
increasing sequence of strongly pseudoconvex 
domains. An open set Pc C” is called an ana- 
lytic polyhedron if P= |z| \Xxi^)\ < 1,0£ = 
b » N}, fi{P) (a= 1, . . . N). Then every 
connected component of P is pseudoconvex. A 
Weil domain is a connected and bounded 
analytic polyhedron P defined by Xai'^ = 

1, , N) with such that for every k 

(1 ^ k ^ n) the intersection of the hypersurfaces 
|;(^(z)| = l (l^i^k)is of dimension <2n-k. 
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H. Holomorphic Convexity 

A domain G c C" is called holomorphically 
convex if for every compact set K c G, K = 

0 fsH(G){^ |/(m)|} (the holomor- 

phic hull of K) is a compact set contained in 
G. (For a domain G contained in an tanalytic 
set we can similarly define holomorphic con- 
vexity of G.) Every connected component of 
the intersection of a family of holomorphically 
convex domains is holomorphically convex, 
and a holomorphically convex domain is 
exhausted by an increasing sequence of Weil 
domains. Holomorphic completeness implies 
holomorphic convexity. The converse is true 
for domains in C". if G is holomorphically 
convex, then for every point ( of 5G there 
exists an / e H(G) such that / is not locally 
bounded at ( (H. Cartan ^^d P. Thullen, 1932). 
Hence a holomorphically convex domain is a 
domain of holomorphy. Thus a domain in C 
is holomorphically convex if and only if it is a 
domain of holomorphy. (The same is true for 
unramified covering domains over C" (Oka, 
1953}.) The union of an increasing sequence of 
domains of holomorphy is a domain of holo- 
morphy (H. Behnke and K. Stein [2], 1939). 
Suppose that we are given a domain G and 
domains S^,T^(a= 1,2,) such that U 7” c 
G and sup^^ |/| = sup^^y |/| for every / eH{G). 
Suppose also that = hm is bounded. We 
say that the continuity principle holds in G if 
Tq c G {Tq = lim TJ implies c G. The con- 
tinuity principle holds in a domain of holo- 
morphy (Hartogs’s theorem of continuity). 

This implies that if G is a bounded domain 
c C" (n ^ 2) with connected boundary dG, 
then every function holomorphic in a neigh- 
borhood of dG extends to a holomorphic 
function in G (Hartogs-Osgood theorem). In 
particular, for « ^ 2, the set of singular points 
of a holomorphic function has no isolated 
point. A domain is pseudoconvex if the con- 
tinuity principle holds there. Hence a domain 
of holomorphy is pseudoconvex. 

I. The Levi Problem 

Let G be a domain in C" and z^edG. If there 
exists an open neighborhood U of such that 
every connected component of G fl f/ is a 
domain of holomorphy, then G is called Car- 
tan pseudoconvex at z'’. On the other hand, if 
every 1 -dimensional analytic set that has as 
an ordinary point contains points not belong- 
ing to G U in the neighborhoods of z^, 
then G is called Levi pseudoconvex at Fur- 
thermore, G is called locally Cartan (Levi) 
pseudoconvex if G is Cartan (Levi) pseudo- 
convex at every point of dG. Every domain of 
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holomorphy is locally Cartan pseudoconvex. If 
G is pseudoconvex and there exists a neighbor- 
hood U of z° such that Gfl V — {z\(p{z)<0], 
where (pEC^{U), then G is Levi pseudoconvex 
at z°. 

The (proper) Levi problem of whether every 
pseudoconvex domain is a domain of holo- 
morphy was proposed by E. E. Levi (1911). 
After unsuccessful efforts by various mathe- 
maticians to solve the problem, it was affirma- 
tively solved by Oka (1942 for n = 2 and 1953 
for manifolds spread over C" for n ^ 2), H. 
Bremermann, and E. Norguet. The problem 
was solved also by H. Grauert [ 14] (1958) in a 
more general form (- Section L) using results 
on linear topological spaces and by L. Hor- 
mander [18] (1965) using methods of the 
theory of partial differential equations. A 
fundamental step in Oka’s solution is his 
gluing theorem: Let G be a bounded domain 
c C". If every connected component of Gj = 
{z|xi >a}nC and G 2 ^{zjxj <h) DG (a<fe) 
is a domain of holomorphy, then G is a do- 
main of holomorphy. Indeed, by virtue of 
the Behnke-Stein theorem and the fact that 
every pseudoconvex domain is the union of an 
increasing sequence of bounded locally Cartan 
pseudoconvex domains, it suffices to solve the 
Levi problem in the case of a bounded locally 
Cartan pseudoconvex domain. The Levi prob- 
lem in this case is solved by the gluing theo- 
rem. Various integral representations of holo- 
morphic functions are known besides the 
Cauchy representation. The Bergmann-Weil 
integral representation in a Weil domain was 
used as an important means of solving the 
Levi problem. 



J. Holomorphic Mappings 

Holomorphic functions with values in a tquasi- 
complete tiocally convex complex vector space 
E have also been investigated. The classical 
theory described above has been generalized, 
to some extent, to this case. In this way, many 
applications of the theory have been dis- 
covered. An E-valued function in a domain G 
c C” is holomorphic if and only if the map- 
ping uof: G^C is holomorphic for every con- 
tinuous linear form u on E. By this theorem, 
most problems concerning E-valued holo- 
morphic functions can be reduced to those of 
ordinary holomorphic functions. Note that the 
vector space H(G) of ordinary holomorphic 
functions in G is a tprechet space. The spaces 
and complex '•'Banach spaces belong to the 
above category of E. A. C^-valucd holomor- 
phic function in a domain G c C” is called a 
holomorphic (or analytic) mapping of G into 
O’. An isomorphism in the category of do- 
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mains G c C" and holomorphic mappings is 
called an analytic isomorphism (or biholomor- 
phic mapping). An automorphism in the cate- 
gory is called an analytic (or holomorphic) 
automorphism. With every domain G c C 
is associated the sheaf (9q of germs of holo- 
morphic functions over G. Thus we have the 
notion of a tringed space (G, 0,). A complex 
analytic manifold can be defined as a (Haus- 
dorff) ringed space that is locally isomorphic 
to some (G, 0,). 

A meromorphic function in G is a function 
that is locally the quotient of two holomor- 
phic functions with denominator ^0. It may 
be defined more rigorously as a meromof- 
phic mapping of G into P,(C) (- 23 Analytic 
Spaces D). 

K. The Cousin Problems 

The Cousin problems are those of constructing 
meromorphic functions with given zeros or 
poles. In terms of sheaves the problems are 
stated as follows: Let be the sheaf of germs 
of meromorphic functions over a domain G <z 
C”. The first Cousin problem asks whether 
the mapping F (G, F (G, ^g) induced 

by the exact sequence 0 ^ Gg JT g ^ 0 
surjective, where F(G, is 
the module of tsections of ^ over G (- 383 
Sheaves C). Let JTg* be the sheaf of multiplica- 
tive groups of germs of meromorphic functions 
not identically 0 and be the sub sheaf of 
Jfg* formed by germs of nonzero holomorphic 
functions. The second Cousin problem asks 
whether the mapping F(G, Xg*)-^P{G, 

{9)q = surjective. P. Cousin (1895) 

solved the first problem for G = C" or n”= iG, 
and the second problem for G =C". Oka (1935) 
proved that the first problem is solvable in 
every domain of holomorphy. In solving the 
second problem in a domain of holomorphy, 
Oka established the notion of tfiber bundles 
and proved that the problem for any domain 
is reduced to holomorphic triviality of a 
holomorphic principal fiber bundle over the 
domain and that holomorphic triviality is 
equivalent to topological triviality when the 
domain is of holomorphy (Oka’s principle). 
Using the solutions of the Cousin problems, 
Oka proved his gluing theorem, described in 
Section 1. 



L. Stein Manifolds 

Abstracting certain important properties of a 
domain of holomorphy, Stein [33] introduced 
the following category of complex analytic 
manifolds (X, (1) X is paracompact (i.e., 



each connected component of X has a count- 
able open base). (2) Functions in F(A", &x) 
separate the points of X. (3) For every point 
X 6 X there exists a system of local coordi- 
nates around X that is formed by functions in 
F(X, &x)- (4) X is holomorphically COnveX. 

(X, 0,) is then called a Stein manifold. It was 
later discovered by Grauert [12] that con- 
ditions (2) and (4) imply (1) and (3). 

Applying the theory of tcohomology with 
coefficients in sheaves, H. Cartan and J.-P. 
Serre obtained for an tanalytic coherent sheaf 
^ on a Stein manifold X, the following funda- 
mental theorems of Stein manifolds. Theorem 
A: H^{X, #”) generates the stalk (as an (9x- 
module) at every point x of X. Theorem B: 

=0 for al\q>0 [7, 30]. Conversely, 
for a complex analytic manifold X, if for every 
analytic coherent sheaf J of ideals defined by 
a 0-dimensional analytic set in X (i.e., a dis- 
crete subset of X), ^ (X, ./) = 0, then X is a 

Stein manifold. Furthermore, if F{X, &x) = 

F( Y, (9y) for a Stein manifold Y (as in the case 
where X c C"), then the fundamental theorem 
A for every coherent sheaf of ideals implies 
that X is a Stein manifold (I. Wakabayashi). 

Due to the fundamental theorems, most 
results on domains of holomorphy hold un- 
changed for Stein manifolds. For instance, the 
first Cousin problem is always solvable. The 
second Cousin problem is solvable if and only 
Z)=0. An n-dimensional Stein mani- 
fold can be realized as a (ramitied) covering 
domain of holomorphy over C”. Furthermore, 
some theorems on differentiable manifolds 
have analogs on Stein manifolds. For instance, 
the cohomology groups of the complex of 
holomorphic differential forms over a Stein 
manifold X are isomorphic to the cohomol- 
ogy groups H*{X, C) (analog of tde Rham’s 
theorem). Every n-dimensional Stein manifold 
X is realized as a closed complex analytic 
submanifold in that is, there exists an 

injective ^proper holomorphic mapping / : 
with df #0. Consider all the holo- 
morphic tprinciple tiber bundles over a Stein 
manifold X whose fibers are isomorphic to a 
complex Lie group G. The analytic isomor- 
phism classes of the bundles and the elements 
in hHx, G”) (where G“ is the sheaf of germs 
of holomorphic mappings of X into G) are in 
one-to-one correspondence. The same is true 
for the topological isomorphism classes of the 
bundles and the elements in ff^(X, G'^) (where 
G‘" is the sheaf of germs of continuous map- 
pings of X into G). The mapping (X, G‘')-+ 

H\X, G') induced by the canonical injection 
G" -> is bijective (Grauert [13]). Every rela- 
tively compact domain in a complex analytic 
manifold is holomorphically convex if it is 
strongly pseudoconvex (Grauert [14]). Hence 
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such a domain is a Stein manifold. It follows 
from this that every real analytic manifold 
with countable base for open sets is realized as 
a closed real analytic submanifold of some R”. 

The notion of Stein manifolds is generalized 
to that of weakly 1 -complete manifolds: a 
complex manifold X is called a weakly 1- 
complete manifold if there exists a plurisubhar- 
monic function u of class C“ on X, such that 
for any ce R, = {x 6 A' I m(x) < c} is relatively 
compact. The family of such manifolds in- 
cludes compact complex manifolds too. For a 
weakly 1 -complete manifold also, vanishing 
theorems of cohomology have been estab- 
lished by H. Hironaka, S. Nakano [26], and H. 
Kazama [2] (- 232 Kahler Manifolds D). 

The theory of entire functions of two vari- 
ables has been developed from a new view- 
point established by T. Nishino. An entire 
function f of two variables defines a family of 
Riemann surfaces { f= c c e C } on C^, and 
investigations of the structure of such a family 
play an important role. For instance the fol- 
lowing is proved in this way: If every irreduc- 
ible component off= c (ce C) is biholomor- 
phic to C*, then there exists an entire function 
g such that y): is a biholomorphic 

mapping (Nishino [28]). In this theorem, the 
analyticity of g is obtained from the fact that 

is a Stein manifold. 

M. Continuation of Analytic Sets 

The application of the theory of cohomology 
with coefficients in sheaves is not restricted 
to problems concerning Stein manifolds. 

Given Gq = {z\\zj\< 1, 1 ^j^n} (n>3), Gj = 
{z||zj|<£,|z^|<l,2^;':^n}, and G‘'"’ = GiU 
(Gq — {z|z 2 = ... =z^ = 0}) (3^m^n), we have 
= 0(1 ^p^m-2) (Scheja’s 
theorem [32]). Let be a coherent analytic 
sheaf over a domain G c C". If, for every point 
z of an analytic set A ^ G, = {0} or p ^ 
n ^ dim, A — 2 hd, (where hd, is the 
thomological dimension of the G^.j-module 
,^), then it follows from Scheja’s theorem that 
the mapping H^(G, — A, induced 

by the canonical injection G — v4-^ G is bijec- 
tive. This generalizes Hartogs’s continuation 
theorem for holomorphic functions, which 
corresponds to the case p = 0. 

Besides the continuation of holomorphic 
functions, we can consider the continuation of 
analytic sets. Let A be an analytic set in a 
domain G e C" and S an analytic set in G A. 

A point ze G is said to be regular (essentially 
singular) with respect to S if the closure 5 of S 
in G is (is not) analytic at z. If dim A < dim, S 
for every point z e S, then S is analytic in G. If 
S is purely d-dimensional and S is analytic at 
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a point of a d-dimensional irreducible com- 
ponent A’ of A, then S is analytic at every 
point of A’ that is not located in any other ir- 
reducible component of A. Furthermore, if 
dim A ^ dim S and S is purely d-dimensional, 
then the following hold: (1) The set E of essen- 
tial singularities of S is, if not empty, a purely 
d-dimensional analytic set in G, formed by 
irreducible components of A. (2) If every 
irreducible component of A contains points 
of E not located in any other irreducible com- 
ponent of A, then A <zE, and A is, if not 
empty, purely d-dimensional. (3) If every d- 
dimensional irreducible component of A con- 
tains points that are regular with respect to S, 
then S IS a purely d-dimensional analytic set 
in G (Thullen, Remmert, and Stein). By these 
results it is possible to give a proof for tChow’s 
theorem that every analytic set in P„(C) is 
algebraic. 

The continuation of holomorphic functions 
is related to the continuation of their graphs. 
W. Rothstein investigated the continuation of 
analytic sets to obtain the following analog 
to Hartogs’s theorem of continuity: If G = 

Gi U G 2 , Gi = {z I |z, I < 1/2, Ej =2 k/ <J }, G, = 

{z I |zi I < 1, 1/2 <S "=2 Uj\^ < 1 }, and G = 

{z I |z, I < 1, Z "=2 <1} (the envelope of 

holomorphy of G) with 3, then every purely 
[n l)-dimensional analytic set A in G extends 
to an analytic set in G; that is, there exists a 
purely (n — 1) -dimensional analytic set A in 
G such that A = A n G. K. Kasahara and H. 
Fujimoto generalized this theorem to the case 
of analytic spaces. 

N. Nevanlinna Theory for Several Complex 
Variables 

The Nevanlinna theory investigates holomor- 
phic mappings between complex manifolds. 

In function theory of one variable, for a holo- 
morphic mapping /:C^P*(C) from C into 
the 1 -dimensional tcomplex projective space, 
the famous -tPicard theorem states that if f 
omits three values, then f must be constant. 

R. Nevanlinna developed the quantitative 
theory of value distributions off: L. Ahlfors 
[1], introduced the geometric approach, and 
enunciated the principle that the negative 
curvature of the image manifold restricts a 
holomorphic mapping. In higher-dimensional 
situations, this principle has been realized in 
many cases. The Nevanlinna theory for a 
holomorphic mapping / : C-^P"(C) with re- 
spect to thyperplanes was established by Ahl- 
fors and H. Weyl and J. Weyl [34]. Holomor- 
phic mappings into projective spaces have 
been studied in detail by W. Stoll, H. Fujimoto, 
and M. Green. Following the work of S. S. 
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Chem [9] of around 1970, the equidimen- 
sional case was investigated (P. A. Grifliths, K. 
Kodaira, S. Kobayashi and T. Ochiai). J- A. 
Carlson and Grifliths [6] succeeded in obtain- 
ing a tdefect relation for equidimensional 
holomorphic mappings and thypersurfaces of 
the image manifold. This yields the following 
Picard-type theorem (after the formulation by 
F. Sakai, Inventiones Math., 16 (1974)): Let X 
be a tprojective algebraic manifold of dimen- 
sion n and f: a holomorphic mapping. 

Let D be a hypersurface of X. We take a 
tdesingularization u ; X*-i-X so that the in- 
verse image D* = (D) has at most normal 

crossings as singularities. We denote by /C* a 
tcanonical divisor of X*. We assume that 

dim 

lim sup ;; > 0. 

m~>oo 

If / omits D, then the Jacobian off vanishes 
everywhere. As a corollary, it follows that the 
tuniversal covering manifold of X -D cannot 
be C". When X = P”(C) and /) is a nonsingular 
hypersurface of degree d, the above assump- 
tion is satisfied if d ^ n -i- 2. In the nonequidi- 
mensional case, Ochiai [29] (with supple- 
mentary works by M. Green, Y. Kawamata, 
and P. Wong) verified the following asser- 
tion, which was first stated by A. Bloch with 
rough arguments: Let X be a projective alge- 
braic manifold of dimension n, and suppose 
dim H^{X, > n; then the image of a holo- 

morphic mapping f'.C^X is contained in a 
proper subvariety of X (-^ 124 Distributions 
of Values of Functions of a Complex Variable; 
272 Meromorphic Functions). 



0. Hyperbolic Manifolds 

Every complex analytic space X has two in- 
variant tpseudodistances: the Caratheodory 
pseudodistance Cx and the Kobayashi pseudo- 
distance dx, both of which generalize the Poin- 
care distance p of the unit disk D = { |z| < 1}. 
These pseudodistances can be defined by the 
property that dx is the largest pseudodistance 
among all pseudodistances <5^ on X for which 
all holomorphic mappings (X, dx)-^{D, p) are 
distance-decreasing, while Cx is the smallest 
among all pseudodistances dx for which all 
holomorphic mappings (D, p)^(X, 6,) are 
distance-decreasing. Then Cx^ d,. If Y is an- 
other complex analytic space, every holomor- 
phic mapping /: X Y is distance-decreasing 
with respect to either its Caratheodory or 
Kobayashi pseudodistance. This may be con- 
sidered to be a generalization of the Schwarz- 
Pick lemma. (The ISchwarz lemma is often 



referred to by this name, since G. Pick also 
investigated the distance-decreasing property.) 
The Kobayashi pseudodistance dx can also 
be obtained by integrating an infinitesimal 
differential pseudometric called the Koba- 
yashi pseudometric in the same manner as the 
Riemannian distance is obtained from the 
Riemannian metric (H. L. Royden). 

If dx is a (complete) distance on X, then X is 
said to be (complete) hyperbolic. A Riemann 
surface is hyperbolic in this sense if and only 
if its universal covering is biholomorphic to 
the unit disk. If X is open in Y and if, for 
every pair of sequences of points { p„} and 
in X converging to distinct points of X, 
hm dx{p„, q„) is positive, then X is said to be 
hyperbolically embedded in Y. The Riemann 
sphere CU { oo } minus three points is not 
only complete hyperbolic but also hyperboli- 
cally embedded in the Riemann sphere. Every 
holomorphic mapping of C into a hyperbolic 
complex analytic space is constant, while every 
holomorphic mapping of the punctured disk 
D* = {0<|z|< 1} into a hyperbolically embed- 
ded space X cz Y extends to a holomorphic 
mapping of D into Y. Thus the classical little 
Picard theorem reduces to the statement that 
C — {0, 1} is hyperbolic, while the great Picard 
theorem reduces to showing that C — {0, 1} is 
hyperbolically embedded in the Riemann 
sphere. These classical theorems can be gen- 
eralized in two ways. If M is a tsymmetric 
bounded domain and F is a discrete arithmetic 
group acting freely on M, then M/F is not 
only complete hyperbolic but also hyperboli- 
cally embedded in its Satake compactification 
(Kobayashi and Ochiai). If X is the comple- 
ment of 2n -|- I hyperplanes in general position 
in the complex projective space P”(C), then X 
is complete hyperbolic and hyperbolically 
embedded in P”(C) (a restatement of a re- 
sult going back to E. Borel, A. Bloch, and H. 
Cartan). 

Although there are some noncompact pon- 
hyperbolic complex manifolds X for which 
every holomorphic mapping y; C->X is con- 
stant, a compact complex manifold X is hyper- 
bolic if and only if every holomorphic map- 
ping f:C-^X is constant (R. Brody). 

Eor the tTeichmiiller space X = T^ of com- 
pact Riemann surfaces of genus g, the Koba- 
yashi distance dx agrees with the Teichmiiller 
distance which had been introduced before 
the complex structure of was defined (H. E. 
Royden). 

In the study of pseudoconvex domains, the 
IBergman metric, the Caratheodory distance, 
and the Kobayashi distance serve as useful 
tools. Their behavior at the boundary has been 
studied extensively. 
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Given an n-dimensional complex mani- 
fold X, an invariant measure 'Fx can be de- 
fined as the largest measure such that every 
holomorphic mapping / from the polydisk 
D” with invariant measure into X is measure- 
decreasing. For an algebraic manifold of gen- 
eral type, this measure is everywhere positive. 

P. Bounded Domains in C" 

For any bounded domain in C” there is natu- 
rally assigned a tKahler metric called the 
Bergman metric. Using the invariance of this 
metric, E. Cartan proved that all the Her- 
mitian symmetric spaces of noncompact type 
are realized as bounded domains. In view of 
the fact that some important period matrix 
domains (e.g., the 19-dimensional bounded 
symmetric domain of type IV, the Siegel upper 
space, etc.) are of this type, it is obviously of 
great significance in algebraic geometry to 
study discontinuous subgroups of the auto- 
morphism groups of such domains (Pyatetskii- 
Shapiro). 

On the other hand, it is almost impossible 
for general bounded domains to determine the 
explicit form of their Bergman metrics. But 
Cartan also investigated strongly pseudo- 
convex real hypersurfaces in C^, and he solved 
completely the (local) equivalence problem for 
them, introducing a definite type of tCartan 
connections over them in a functorial way. 
Thereafter N. Tanaka (and for hypersurfaces 
also Chem and J. Moser) generalized this 
result to all pseudoconvex real submanifolds 
in higher-dimensional complex manifolds. 

One can further apply this result to the equi- 
valence problem of bounded domains with 
smooth strongly pseudoconvex boundaries, 
for C. Fefferman [10] proved by analyzing the 
boundary behavior of the Bergman metric that 
every biholomorphic mapping between two 
such domains is extended smoothly up to their 
boundaries (I. Naruki [27] gave an alternative 
proof). 

Q . H istory 

In connection with tAbelian functions, ana- 
lytic functions of several complex variables 
have been studied sporadically since the time 
of Riemann and Weierstrass (H. Poincare, 
Cousin). A series of investigations by Hartogs 
([17] (1906), etc.) that revealed the distinctive 
properties of several complex variables ini- 
tiated a new epoch in complex analysis. Levi 
(1910- 1911) generalized Hartogs ’s results to 
the case of meromorphic functions, introduced 
the notion of pseudoconvexity, and proposed 
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the so-called Levi problem. After a lapse of 
time, many contributions to this new area of 
complex analysis have been made since 1920. 
The study by K. Reinhardt (1921) of analytic 
automorphisms was further developed by C. 
Caratheodory and Behnke. The tkernel func- 
tion introduced by S. Bochner and S. Berg- 
mann (1922) produced many remarkable 
results. In contrast with t Picard’s theorem 
in one variable, P. Fatou found a holomor- 
phic mapping /: with nonvanishing 

Jacobian such that the image /(C^) has an 
exterior point. 

The theory of analytic functions of several 
complex variables has flourished since 1926. 
Behnke and Thullen in Munster, together with 
G. Julia and H. Cartan in Paris, were the most 
active investigators. The results on Inormal 
families of analytic functions of several com- 
plex variables (Julia [20], 1926), the uniqueness 
theorem of holomorphic mappings (H. Cartan, 
1930), and a characterization of a domain of 
holomorphy by holomorphic convexity (Car- 
tan and Thullen, 1932) are their most remark- 
able achievements. Behnke and Thullen [3] 
systematized the results obtained since the 
discovery of the theory by providing a com- 
plete bibliography of articles up to 1934. 

The three major unsolved problems at that 
time-those of Cousin, Levi, and the approx- 
imation of holomorphic functions -were 
intensively studied by Oka from 1936, who has 
given complete solutions [30]. The investiga- 
tion of ideals of holomorphic functions by H. 
Cartan (1944), together with that of ideals with 
undetermined domains by Oka, has developed 
into the theory of coherent analytic sheaves. 
The notion of analytic spaces, first introduced 
by Behnke and Stein (1951), extended the field 
of investigation in the theory of analytic func- 
tions of several complex variables. The theory 
of cohomology with coefficients in sheaves has 
been effectively applied by H. Cartan and 
Serre (1951-1952). The introduction of the 
notion of Stein manifolds (195 1) came at the 
same time. Grauert’s deep investigations since 
1955, together with those of Stein and Rem- 
mert, have contributed greatly to the develop- 
ment of the theory of analytic spaces. In the 
1960s, active investigations took place also in 
the United States [16]. The theory of auto- 
morphic functions of several complex variables 
has been developed by C. L. Siegel, I. Satake, 
and others in connection with the theory of 
numbers. Entire functions of two variables 
have been investigated from a new point of 
view by Nishino and others since 1968 (-^ last 
paragraph of Section N). The notion of a 
hyperbolic manifold introduced by Kobayashi 
(1967) enables us to obtain many results on 
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complex manifolds by means of the methods of 
differential geometry. The theory of automor- 
phisms of bounded pseudoconvex domains 
was developed extensively in the 1970s by 
Fefferman, Naruki, and others. The theory of 
an envelope of holomorphy has also been 
successfully applied to the theory of elemen- 
tary particles in physics. 
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Analytic Sets 

A. General Remarks 

The notion of analytic sets was first defined by 
N. N. Luzin and M. Ya. Suslin in 1916, and 
it was extended to that of projective sets by 
operations such as complementation and pro- 
jection (Luzin, 1924). Most mathematicians, 
including Luzin and W. Sierpinski, who 
worked in this field, were in agreement with 
tprench empiricism (or tsemi-intuitionism), 
which defended the standpoint of R. Baire, 

E. Borel, H. Lebesgue, and others. An object is 
said to be effectively given if it can be uniquely, 
individually, and unambiguously determined 
in finite terms so that anyone can reach the 
same object by following the defining pro- 
cedure. Semi-intuitionists claim that only 
effectively given objects have mathematical 
existence, and they do not recognize as a 
mathematical object something that needs the 
axiom of choice for its definition. From this 
point of view, '•'Borel sets were “well-defined" 
sets to which classical analysis had to be re- 
stricted. Thus the question was raised whether 
it is possible to extend the class of Borel sets to 
a wider class of sets with the same certainty. 
Lebesgue {J. M ath. Pures Appi, 1 (1905)) de- 
fined a function not belonging to any class 
of Baire functions by using the totality of 
'lordinals of the second class. (Later, this 
method was systematically developed as the 
Theory of sieves by Luzin.) However, it did 
not satisfy Borel as being effective. Can we, 
then, extend the Borel sets without any use of 
ordinals of the second class? The discovery of 
analytic sets gave an affirmative answer. 

In this article (except in Section 1), we treat a 
Space (denoted by X, Y, . ) that is thomeo- 
morphic to a tcomplete ^separable ^metric 
space and its subspace. Denote by 91 the Space 
of irrational numbers (a metric space consis- 
ting of the irrational numbers e R with the 
metric \x — y| of x and y). The following prop- 
erties of a subset S of a space X are equivalent: 
(i) 5 is a continuous image of 91; (ii) S is a con- 
tinuous image of a Bore! set in X; (iii) S is the 
projection of a closed set in a product space 
X X 91; (iv) 5 is the projection of a Borel set in 
X X Y. We call a set satisfying one of these 
properties an analytic set, an A set, or a Z"! set 
(in X). The complement of an analytic set is 
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called a complementary analytic set (or simply 
coanalytic set), a CA set, or a /7j set. 

B. The Operation A and Sieves 

When to each (rij , , n,j) of finite sequences of 
natural numbers there corresponds a unique 
element E(n„ , of a family F of sets, this 
correspondence {E(n„ . . , n,)} is called a 
schema of Suslin (or system of Suslin) consist- 
ing of sets in F. Denoting an infinite sequence 
of natural numbers by {«,}, the set given by 
Pjfc£(n,, ,n^) is called the kernel of a 
system of Suslin, and the operation of taking 
the kernel is called the operation A (analytic 
operation). 

Eet Q be the set of all rational numbers 
between 0 and 1 and f be a family of sets. 

Take a family (Cj,,eQ of sets belonging to F 
with the index set Q (or more geometrically, a 
subset C= X (r) of X X Q when F is a 

family of subsets of a space X), and call it a 
sieve consisting of sets in F. Denoting by a 
(strictly) monotone decreasing sequence of 
elements of Q, we call the set j 
(namely, the set of all x such that 0^^ — 

{r (x, r)eC} is not well-ordered by the 
order ^ of rational numbers) the set obtained 
by a sieve C or the sieved set obtained by C. If 
the family F is closed with respect to countable 
intersection, then the family of all sets ob- 
tained by sieves consisting of sets in F is ident- 
ical to the family of all sets obtained by apply- 
ing the operation A to F. When f consists of 
all the closed sets in a given space, this is the 
family of all analytic sets. In particular, it is 
sufficient to take the family of closed intervals 
as F when X is the space of real numbers. 

Note that we can define sieves and sieved sets 
more generally by using the space of real num- 
bers R instead of the set Q of rationals. 

C. Properties of Analytic Sets 

ft is evident from the definition that every 
^Borel set is analytic. If a Borel set is uncount- 
able, then it is the union of a countable set 
and a one-to-one continuous image of 9i. The 
analyticity of sets is invariant under countable 
unions, intersections, and Cartesian products 
and the operation A and +Borel-measurable 
transformations. An uncountable analytic set 
contains a tperfect subset (Suslin). Therefore 
the possible cardinality of an analytic set is at 
most countable or is that of the continuum. 
Every analytic set enjoys the ^Baire property, 
and in Euclidean space every analytic set is 
■^Lebesgue measurable (Fuzin, Sierpinski). If a 
set E in the Euclidean plane is analytic (coana- 
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lytic), then 1(E) is also analytic (coanalytic), 
where 1(E) is the set of all x such that the 
section of E that is parallel to the y-axis 
has a positive measure (M. Kondo and T. 
Tugue). Concerning the Baire property, the 
similar result for a set E of X x Y is obtained 
by replacing “to have a positive measure” by 
“to be the tsecond category (nonmeager) set” 
(A. S. Kechris). The Eebesgue measure of an 
analytic set is effectively calculable (in the 
sense that the measure of a E,’ (or E"/)®)) set is 
a (resp. E’J(a)) real number (- Section G 
and 356 Recursive Functions) (H. Tanaka). 
Every analytic (coanalytic) set E can be de- 
composed into Borel sets. This decompo- 
sition is called a decomposition of E into 
constituents. An analytic (coanalytic) set is a 
Borel set if and only if it is decomposable into 
a countable number of constituents (Euzin, 
Sierpinski). In a space with the cardinality of 
the continuum, there exist analytic sets that 
are not Borelian. Eor example, in the space 
C( [0, 1]) of continuous functions on the inter- 
val [0, 1] 168 Function Spaces) the set of 

all differentiable functions is coanalytic but not 
Borelian (S. Mazurkiewicz). 

The following theorems are especially im- 
portant in analytic set theory. Luzin’s first 
principle (the first separation theorem): Eor 
every pair of disjoint analytic sets A„ A„ there 
exists a Borel set B such that A, a B and 
Bll A, = 0. An immediate corollary of 
Luzin’s first principle is Suslin’s theorem: If 
both A and X — A are analytic,’ then A is a 
Borel set. Luzin’s second principle (the second 
separation theorem): Eor every pair of analytic 
sets A and B, there exist complementary analy- 
tic sets C and D such that A — J5 cz C, B-Ac 
D, and Cf]D = 0.A one-to-one continu- 
ous image of a Borel set is Borelian (Suslin). 
More generally, for a given B -measurable 
function f defined on a Borel set B, the set A 
(c/(6)) of all points y whose inverse images 
(y) are singletons is a complementary 
analytic set (Luzin’s unicity theorem). In this 
theorem, we can replace “a singleton” by “a 
■'■(T-compact set” (V. Ya. Arsenin, K. Kunugui). 
Therefore, if a set is the image of a Borel set by 
a continuous function such that the inverse 
image of each point is a o-compact set, then it 
is a Borel set. 



D. Generalization to Projective Sets 

A projective set of class n is inductively defined 
as follows: (i) the Borel sets are the projective 
sets of class 0; (ii) the projective sets of class 
2n -H 1 are the continuous images of the sets of 
class 2n; (hi) the projective sets of class 2n are 
the complements of the sets of class 2n — 1 . 



The projective sets of class 1 are exactly the 
analytic sets, and those of class 2 are the com- 
plementary analytic sets. The following are 
fundamental properties of projective sets. 
Denote by L„ the family of the projective sets 
of class n. Then (1) L 2 „ c and l 2 „+i c 
^ 2 n+ 2 +k (^- F2, . ..). (2) the property of 
being a set of class fl is invariant under count- 
able unions, intersections, and Cartesian 
products and homeomorphisms; (3) a contin- 
uous image of a projective set of class 2n -\-l 
is of the same class; (4) the projection on X of 
a set of class 2n -I- 1 in X X Y is a set of class 
2n -I- 1 in X; (5) the family of the projective sets 
of class 2n -f 1 in a space X is the family of the 
projections of all sets of class 2n in X x X (or 
X X )ll); (6) the kernel of a system of Suslin 
consisting of sets of class d is a projective set 
of the same class, where n ^ 0 , 2. 

We frequently call a projective set of class 
2n 1 a set or a set, and that of class 
a C„ set or a /7„‘ set. A B„ set or a A„^ set is a set 
that is both and C„. The respective families 
of these sets are also denoted by P^ (i'n^), 

C„ (/7„*), and B„ (A^), respectively. In general, for 
a family of sets in a space X, we denote by 
C the family of the complements X-E of all 
sets E in 5 . We write Sepi(5) and Sepu(5) for 
the propositions obtained by substituting “set 
in “set in ^ and in C^,” and “set of C^” 
for “analytic set,” “Borel set,” and “coanalytic 
set,” respectively, in Euzin’ s first principle and 
Euzin’s second principle, respectively. Fur- 
thermore, we say that the reduction principle 
holds for and denote it by Red(j^), when for 
any sets A, Be there exist A„ such 

that Aj c A, c B, A, U = A U S, and 
A, n B] = 0. C. Kuratowski introduced the 
latter principle Red(iV) which implies Sep,(C^) 
and Sepji(Ci^), and proved Red(/7(). Classi- 
cally, reduction or separation principles for the 
projective sets of higher classes were not 
settled except for Sep,)//^), Sep„(/7j) (P. S. 
Novikov), and RediiT^) (Kuratowski). Now- 
adays, it is known that these principles for 
{ 2 '„',/ 70„^3 are undecidable from the axiom- 
atic set theory ZEC. If we assume t(he axiom 
of constructibility V — L, then Red(iT„^) holds 
for n^3 (J. W. Addison). On the other hand, 
under the assumption of ^projective deter- 
minacy PD Section H), it follows that 
Red(77„^) (Red(X„^)) holds when n is odd (even) 
(A. Martin, Addison and Y. N. Moschovakis). 

E. Universal Sets 

A set (7 in sji x X is called the universal set 
for the projective sets of class d in X if for any 
projective set P of class n in X, there exists 
such that P = {xl(zo,x)e(/}. Concern- 
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ing universal sets, we have the following result: 
for every n>0, there exists a universal set for 
the projective sets of class n in X that is of the 
same class in 91 x X. Hence in a space with 
the cardinality of the continuum, there exists a 
projective set of class n + 1 which is not of 
class n. In general, if a class g of sets is closed 
under taking a universal set and continuous 
preimages, then and cannot both have 
the reduction property. Therefore Red(/7„^) or 
Red(2'„^) fails for each n ^ 1. 

Any two universal sets of analytic sets are 
IBorel isomorphic. On the other hand, an 
analytic set E is not Borel isomorphic to any 
universal analytic set if the complement of E is 
an uncountable set without a perfect subset (A. 
Maitra and C. Ryll-Nardzewski). Hence, there 
are at least two equivalence classes of uncount- 
able analytic sets with respect to the Borel 
isomorphism = under the assumption of V= 

L, since the latter assumption implies the 
existence of such a set E (- Section H). 

F. Uniformization Principle 

The uniformization problem arose during 
investigations of implicit functions. For a set E 
in a space X x Y, uniformization of £ is the 
finding of a subset V of E such that 

'ixmx,y)GE)o3ly{{x,y)6V)), 

where 3!y is the tquantifier which means 
“there exists exactly one y.” A Borel set can be 
uniformized by choosing a suitable coanalytic 
set (Luzin). Rondo’s uniformization theorem 
(Japan. J. Math., 15( 1938)) is the most impor- 
tant result in descriptive set theory: every 
coanalytic set is uniformizable by a coanalytic 
set. As a corollary to this, any set is uni- 
formizable by a .^2 set, and every Z'l set is 
obtained as a one-to-one continuous image 
of a coanalytic set. 

Rondo’s original proof was very difficult to 
understand. Since ■''eflFective descriptive set 
theory was introduced by Kleene and Ad- 
dison, the proof has been simplified by Ad- 
dison, and a more elegant one has been given 
by J. R. Shoenfield [7, p. 1 88]. Nowadays, the 
theorem is also called the Novikov-Kondo- 
Addison theorem, and is one of the most 
powerful and fundamental theorems not only 
in descriptive set theory, but also in the tfoun- 
dations of mathematics. 

The uniformization of an analytic set is, in 
general, not to be found among analytic or 
coanalytic sets. There was a conjecture that 
any analytic set is uniformizable by specifying 
an A, set (difference of two analytic sets). 
Recently, this conjecture was negatively settled 
(J. Steel, Martin). Assuming that V= L, the 
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uniformization of a (n ^3) set is determined 
by specifying a set, and that of a /7„’ (n ^ 2) 
set by specifying a /7„p set. On the other hand, 
if an axiom system of set theory (e.g,, ZE; 33 
Axiomatic Set Theory) is consistent, then it is 
still consistent even if we add to it the follow- 
ing proposition: There exists a set whose 
uniformization is impossible by any choice of a 
definable set in the system (P. J. Cohen, A. 
Eevy). However, if we assume the existence of 
a ^measurable cardinal (MC), it is known that 
every Fl\ set is uniformizable by a /7] set 
(Martin and R. M. Solovay). On the other 
hand, PD implies “every /7] (iC]) set is uni- 
formizable by a /7] (Z]) set for each odd 
(resp. even) n' (Moschovakis). 

There are sufficient conditions on sections 
of a Borel set E in the Cartesian product 
X X Y for E to be uniformizable by the Borel 
set as follows: all sections are (i) countable 
(Euzin, Novikov), (ii) a-compact (Arsenin, 
Kunugui); or, as “large section property,” (iii) 
in the second category (S. K. Thomason, P. G. 
Hinman), or (iv) of positive measure (Tanaka, 
G. E. Sacks). Eor applications of descriptive 
set theory to analysis, an important uniformi- 
zation result is von Neumann’s selection 
theorem: for any A set E c X x Y, there exists 
a tBaire measurable and labsolutely measur- 
able function f: X-* Y (- 270 Measure 
Theory E (vi)) such that 

Vx(3y ((x, y) 6 £) ^ (x, f(x))e E). 

Concerning implicit functions, any Borel set 
E c X X Y such that all sections £0) are at 
most countable is expressed by a union of 
intersections of E with graphs of some Baire 
functions f„:X ->Y,neN (Euzin). Recently, it 
was shown known as a generalization of this 
that any Borel set £c: A x Y such that all 
sections are a-compact is a union of countably 
many Borel sets E^ for each of which all sec- 
tions are compact (J. Saint-Raymond). 



G. Kleene’s Hierarchy and Effectiveness 

Eirst, projective set theory in any space is 
reducible to the theory in the space of irra- 
tional numbers. Second, if we introduce a 
%eak topology in the set N”* = "cu of 1-number- 
theoretic functions (X with one argument, the 
resulting topological space N"” is homeomor- 
phic to the tBaire zero-dimensional space 
of irrational numbers. Third, any subset B of 
N’’” is open and closed in this topology if and 
only if there exist a function ^(gN'’') and a 
predicate /!%) that is '‘'general recursive in <; 
such that aeB<^ A^{Y). Eourth, logical opera- 
tions such as 1, V , A, 3x (where x is a vari- 
able ranging over the natural numbers), and 3a 
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exactly correspond to the operations (on sets) 
complementation, union, intersection, count- 
able union, and projection, respectively. On 
the basis of these facts, projective set theory is 
regarded as the theory of the tanalytic hier- 
archy of Kleene. Here, the following example 
is remarkable: We can construct a 2" J set which 
is universal for the analytic sets (namely, a 
2j [N”] set) in the space of irrational num- 
bers 356 Recursive Functions H). 

The connection between projective set 
theory and logic has been discussed by C. 
Kuratowski and A. Tarski. From their point of 
view, semi-intuitionists such as Borel regard 
the set of natural numbers to be precisely clear 
in itself and also the continuum to be immedi- 
ately recognizable by our geometric intuition. 

In their argument rational numbers do not 
play such an important role. They take, a 
priori, the set of irrational numbers as the 
fundamental domain, and intervals with ra- 
tional extremities as the simplest sets of points 
among the subsets of the fundamental domain 
as the starting point of their argument. Here, 
the fundamental domain or each interval is 
not conceived as a totality of its elements, but 
recognized as a “uniform extent.” In contrast 
to this, singletons and individual irrational 
numbers are not so simple. For this reason 
Borel introduced the notion of calculable 
numbers to study definable real numbers. 
Following Luzin, we say that a calculable 
number is a constructible real number in the 
sense that we can give it by an arithmetical 
approximation as precisely as we want. Now, 
this notion is nearly identical to the notion of 
an effectively calculable real number given by 
A. Church or A. M. Turing. 

In the mathematics of the semi-intuitionists, 
the word “effective” has played an especially 
important rolc. Although these mathema- 
ticians have always agreed not to accept the 
■•^axiom of choice, the exact meaning of “effec- 
tive” has differed slightly among different 
members of their group or in different stages 
of the development of the theory. Such dif- 
ferences mainly arose in connection with the 
question: How can we tell whether given en- 
tities are finitary or individual? One way to 
guess the original intention held by Borel and 
others when they used the term “effective” is to 
replace the term by “recursive.” Nowadays, the 
concept of “effectiveness” is used in this sense 
356 Recursive Functions C), and all clas- 
sical results in descriptive set theory essentially 
have effective versions (or refinements via 
relativization). For example, the Novikov- 
Kondo-Addison theorem is described as fol- 
lows; Any n\ [n\ (a) for an ae"m) set is uni- 
formizable by a U\ (resp. f7j(a)) set. Similarly, 
Suslin’s theorem is that every analytic and 



coanalytic (i.e., A { (a)) set A is a Borel set of 
class v(x) which is effective relative to the 
“definability a” of given A, where v(a) is the 
-i-Constructive ordinal relative to a (Kleene, 
Tugue and Tanaka, A. Louveau, etc.). 



H. Further Results in Axiomatic Set Theory 
and Strong Axioms 

The recent development of axiomatic set 
theory has yielded the following propositions. 
Under the assumption that V= L (K. Godel, 
Novikov, Addison): (1) there exists an uncount- 
able U\ set that does not contain any perfect 
subset; (2) there exists a nonmeasurable A\ 
set; (3) there exists a ^2 set that does not have 
the Baire property. On the other hand, if the 
axioms of set theory ZFC plus “there exists an 
inaccessible cardinal number” are consistent, 
then so is “every projective set is Lebesgue 
measurable, has the Baire property, and has a 
perfect subset when it is uncountable” with 
them (Solovay). Concerning these properties, 
tMartin’s axiom MA and Nj” (MA -f 
n CH is consistent with ZFC; -v 33 Axiom- 
atic Set Theory E) implies that every 2^^ is 
measurable, and has the Baire property 
(Martin and Solovay). The possible cardinality 
of a 2] set is at most N, or that of the continu- 
um, which is implied from “every 2] set is a 
union of Nj Borel sets” (Sierpinski). It is 
known that one cannot prove or refute from 
ZFC the converse of the latter statement. If 
one assumes MC (the existence of a measur- 
able cardinal), then every 2j set is a union of 
^<2 Borel sets. Therefore MC implies that the 
possible cardinality of a 2j set is at most N 2 
or that of the continuum. 

With each set A c “cu we associate the 
following infinite game of perfect information 
G(A), played by two players I and II. First 
player I chooses «o e then player II chooses 
e m, then I choOSes «2 e and so on. The 
game ends after co steps. Let r(k) = if xe A, 
then I wins G(A); otherwise II wins. A strategy 
(for I or II) is a rule that tells the player what 
move to make, depending on the previous 
moves of both players. A winning strategy is 
one such that the player who follows it always 
wins. The game G(A) is determined if one of 
the players has a winning strategy. The axiom 
of projective determlnacy PD is the assertion 
that for every projective set A( c “o>) the game 
G(A) is determined. PD is widely used to solve 
problems on projective sets that are not decid- 
able from ZFC, and is a plausible hypothesis 
that does not seem to contradict the axiom of 
choice at present; and it has pleasing conse- 
quences in descriptive set theory. For n 3 3, 
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reductions and uniformization results of 
(/7„‘) sets mentioned earlier are such examples. 
Furthermore, PD implies that every projective 
set is Lebesgue measurable, has the Baire 
property, and contains a perfect subset if it is 
uncountable. 

Now, write Determinacy (2’j) (Determinacy 
(A^)), or simply, Det{i7„^) (Det(A„^)), when G(A) 
is determined for every (A^) set A Using 
the fact that the open game is determined, 
namely Det(iT“) (D. Gale and F. M. Stewart), 
D. Blackwell has given a short proof of 
SepiiZ"!). The extremely difficult problem was 
to show Det(Aj) in ZF (indeed, one cannot 
prove Det{A{) only from the Zermelo axioms 
z; 33 Axiomatic Set Theory B). In 1975, 
Martin solved this problem. Concerning , 
Det(2’j) cannot be proved from only the ZF 
axioms, but MC implies Det(2^}) (Martin). 
However, it is known that Det(X'j) cannot be 
proved even if we assume MC. 

By a pre-well-ordering, we mean a relation 

having all the properties of a well-ordering 
except for tantisymmetry. Denote by 6^ the 
least ordinal that cannot be the length of a 
preewell-ordering on “co belonging to A^ . It 
is a classical result that 6 } =o>j (= K,). The 
following results are mainly due to Martin: 

^^ 2 , MC ^ ^ N 3 , and also, PD ^ 

Si ^^ 4 . Let K be an infinite cardinal. A set 
A c X is called /c-SusIin when there exists a 
closed set Cc Ax “k ("fc-{/j/:m^K}) such 
that 

X e A <=> 3/[/€ ‘V A (x, /) e C]. 

A is No-Suslin iff AeZ} If A is a Zj set, then 
A is Nj-Suslin (Shoenfield). Under the assump- 
tions of MC, if A is a Z\ set, then it is K 2 - 
Suslin (Martin). Furthermore, Det(Ai„) implies 
the following facts (Kechris, Moschovakis): 
every Z ^„+2 set is K-Suslin, where k is the 
cardinal of Si„+i ; every set in K-Suslin 
for some k < 8 i„+i 

The axiom of determinacy AD is the asser- 
tion that the game G(A) is determined for 
every set A. AD implies that every set is 
measurable (J. Mycielski and S. Swierczkow- 
ski), has the Baire property, and has a perfect 
subset if it is uncountable (Morton Davis). 
Though AD contradicts the axiom of choice, 
some consequences of AD in the area of pro- 
jective sets are more desirable than the conse- 
quences of the axiom of choice. The following 
are examples of consequences of AD. (1) A is a 
Z 2 set iff A is a union of K, Borel sets (Mos- 
chovakis). (2) For each n, 8 ’ is a cardinal 
(Moschovakis). (3) Generalization of Suslin’s 
theorem holds as follows: = Bgi (= the 

smallest ^Boolean algebra containing the open 
sets and closed under complementation and 
unions of length < 8 ^) (Martin, Moschovakis). 
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I. Polish Spaces, Luzin Spaces, and Suslin 
Spaces 

A topological space homeomorphic to a com- 
plete separable metric space is called a Polish 
space. A subspace £ of a Polish space X is 
Polish if and only if it is a G^-Subset of X, i.e., a 
countable intersection of open subsets of X 
(Aleksandrov and Uryson). A Hausdorff top- 
ological space X is called a Luzin space (resp. 
Suslin space) if we can find a Polish space S 
and a continuous bijective (resp. surjective) 
mapping j : S->X. Every Polish space is a 
Luzin space and every Luzin space is a Suslin 
space. 

Let X be a Hausdorff topological space. A 
subset £ of X is called a standard set (resp. 
analytic set) if the set E with the relative top- 
ology is a Luzin (resp. Suslin) space. The 
analytic subsets are closed under analytic 
operations. Every analytic set is obtained from 
closed sets by applying the analytic operation. 
Every analytic set is tuniversally measurable. 
Every standard set is a Borel set. Every count- 
able union or intersection of standard sets is 
standard. 

A subset £ of a Suslin space is a Borel set if 
and only if both E and E'^ are analytic. A 
subset £ of a Luzin space is a Borel set if and 
only if £ is standard. 

Let / be a Borel measurable mapping from 
a Suslin space X into another Suslin space Y. 
Then the image f{X) is an analytic subset of Y. 
Lurthermore, if f is injective, then f gives a 
^ Borel isomorphism between X and f{X). If 
both X and Y are Luzin spaces and if f is 
injective, then f(X) is a Borel subset of Y. 

Every Suslin space is ^Borel isomorphic to an 
analytic subset of R, and every Luzin space is 
Borel isomorphic to one of the following 
spaces:(l)N„ = {l,2,...,n},(2)N = {l,2,...}, 
(3) R. 

The selection theorem due to von Neumann 
(- Section L) holds when X and Y are Suslin 
spaces. This fact and its ramifications [ 15,161 
are useful in nonlinear functional analysis and 
control theory. 

Practically all useful spaces appearing in 
functional analysis and probability theory are 
Polish or Luzin Spaces. Examples: (i) Every 
locally compact Hausdorff space with a count- 
able Open base is Polish, (ii) Every separable 
Banach space is Polish, (iii) The set C of all 
continuous functions on [ 0 , i] with the ttop- 
ology of uniform convergence is a Polish 
space, (iv) The set D of all right continuous 
functions on [0, 1] with left limits is a Polish 
space when it is endowed with the Skorokhod 
topology (- 250 Limit Theorems in Proba- 
bility Theory E). (v) The space (5^' of distri- 
butions and the space ,9" of tempered distri- 
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butions (- 125 Distributions and Hyperfunc- 
tions) are Luzin spaces. 
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Analytic Spaces 

A. General Remarks 

An ^analytic function of a complex variable 
has as its natural domain of detinition a -i-Rie- 



mann surface, i.e., a 1 -dimensional complex 
lanalytic manifold. In the case of several com- 
plex variables, the set of zeros of an analytic 
function, the quotient space of a domain by 
a tproperly discontinuous group of analytic 
automorphisms, the existence domain of 
^algebroidal function, etc., are, strictly speak- 
ing, not necessarily complex analytic mani- 
folds. It is necessary to consider a more gen- 
eral category of complex analytic manifolds 
with singularities, and the notion of analytic 
spaces is drawn from these examples. Many of 
the properties of complex analytic manifolds 
are extended tO analytic spaces; on the other 
hand, theories specific to analytic spaces have 
also been developed. 

B. Analytic Sets 

We say that a subset A of a complex analytic 
manifold G is an analytic set in Q if it is a 
closed subset and each point of A has a neigh- 
borhood U such that C/ n A is the set of com- 
mon zeros of a finite number of holomorphic 
functions in U. Specifically, if ^ is locally the 
set of zeros of a single holomorphic function 
that does not vanish identically, then A is 
called principal. Two subsets Sj and S 2 of G 
are called equivalent at e G if there exists 
a neighborhood U of z® such that fl L = 

S 2 n [/. By this equivalence relation, every 
subsets S of G defines its germ S^o at z®. A 
germ of an analytic set at z° is the germ at z® 
of an analytic set in a neighborhood of 
Each germ A, of an analytic set at z® = Oe G is 
associated with an ideal I{Aq)= {f\f sH(0), 
f\ A, =0} in the ring H(0) of tgerms of holo- 
morphic functions at 0. We call A, reducible 
if A, is the union of two germs of analytic sets 
A,’ and A,” with Aq ^ A„ Aq A,; other- 
wise, A, is called irreducible. An analytic set A 
is called irreducible at 0 if the germ at 0 of A 
is irreducible. Properties of A, and /(Aq) cor- 
respond to each other. Thus A, is irreducible 
if and only if /( A,) is prime. 

As the ring H(0) is tNoetherian, in a neigh- 
borhood of every point z° an analytic set A 
is represented as a union of a finite number 
of analytic sets A,- that are irreducible at 
These A,- are essentially unique. If an analytic 
set A is irreducible at then there exists a 
system of local coordinates (z^, . , z,) centered 
at and a pair of natural numbers d and k 
such that, in a neighborhood of z°, A is a fc. 
sheeted ramifled covering space with covering 
mapping (p:(z,, . . . . z„)-^(zi, ,z^); i.e., for an 
analytic set R in a neighborhood of 0 e C”*, <?> : 
A — R is, in a neighborhood of 
z°, a k-sheeted covering mapping, where A 
9 ) (R) is a connected d-dimensional com- 
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plex analytic manifold in the neighborhood. 

The coordinates of the k points of A —(p~^{R) 
over each point (z, , , zj are holomorphic in 
these variables. The number d is the (local) 
dimension of A at and is denoted by dim^o A 
From this local representation, we obtain 
Riickert’s zero-point theorem: Every prime 

ideal in H(0) is equal to vvith A, an 

irreducible germ of an analytic set at 0. In this 
case the dimension at 0 of an analytic set A 
that defines A, is equal to the tKrull dimen- 
sion of the tlocaliing The theory of 

local rings is very important in the study of 
germs of analytic sets. The dimension of a 
general analytic set A at z^ is defined by 
dim^oA = sup, dim^oA,-, where A = (J. A,- in a 
neighborhood of z^, vvith the Aj irreducible 
at z°. If dim^o A; is equal to d for all i, then A 
is called purely d-dimensional at z^. The 
(global) dimension of A is defined by dim A = 
sup^e^ dim, A. A purely d-dimensional ana- 
lytic set is defined to be an analytic set that is 
purely d-dimensional at every one of its points. 

A point z° of an analytic set A is called 
ordinary (regular or simple) if A has the struc- 
ture of a complex analytic submanifold in a 
neighborhood of z^. The set A’ of ordinary 
points of A is dense and open in A. The set A* 
= A -A’ of singular (not ordinary) points is an 
analytic set in G. If A is purely d-dimensional, 
then A’ is a d-dimensional complex analytic 
manifold and A* is an analytic set of dimen- 
sion 1. 

Let A be an analytic set of dimension d in G, 
and B a purely r/'-dimensional analytic set in G 
-A with d' > d. Then the closure 5 of 6 in G is 
a purely d'-dimensional analytic set (Remmert- 
Stein continuation theorem [17]). 

For every analytic set A in G, the tanalytic 
sheaf ,/(A) of germs of holomorphic functions 
over G that vanish on A is tcoherent (H. Car- 
tan). We call J'iA) the sheaf of ideals defined 
by an analytic set A. Let be the sheaf of 
germs of holomorphic functions over G. Then 
(fA = (C(;/./(A))| A is a coherent sheaf of rings 
over A. (9^ is called the sheaf of germs of holo- 
morphic functions on an analytic set A. 



C. Analytic Spaces 

A fringed space (X, 0,) with Hausdorff base 
space X is called an analytic space if for every 
point xeX, there exists an open neighborhood 
L of X such that the ringed space (L/, &y,\U)iS 
isomorphic to a ringed space (A, &^), where A 
is an analytic set in an open set G of some C”. 
The structure sheaf (9^ is then called a sheaf of 
germs of holomorphic functions. The notion of 
tholomorphic mapping from one open set in 
C" into another is generalized to the case of 
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mappings from one analytic set into another. 
An analytic set Y in an analytic space X and 
the sheaf &y of germs of holomorphic functions 
on Y are defined as in the case where X is a 
complex manifold. The ringed space (Y, 0,) is 
an analytic space and is called an analytic 
suhspace of X. For an analytic space X, the 
notions of dim, X, dim X, irreducibility, and 
pure dimensionality are defined as for an 
analytic set A c; G c C”. Every analytic space X 
is the union of a locally finite family of irreduc- 
ible analytic subspaces X, called the irreduc- 
ible components of X. 

Let : X Y be a holomorphic mapping 
of an analytic space X into another, Y. Its 
rank at xg X is defined by r^{x) = dim, X — 
dimx(p~H(pix)). The number = sup^.ji: r^(x) 
is called the rank of (p. The set of degeneracy 
of (p is the set of points xgX such that 
r^\x'{^)<^(p\x' irreducible component 
X’ of X through x. The mapping cp is non- 
degenerate if = 0. For any k eN, (x^X 
r^{x) ^kj is an analytic set (R. Remmert[ 16]). 
In particular, E^ is analytic. For a holomor- 
phic mapping :X Y, the inverse image of 
an analytic set in Y is an analytic set in X. 
However, the image of an analytic set is not 
necessarily analytic. If (p is tproper, then the 
image (p{X') of an analytic set X’ in X is an 
analytic set in Y of dimension and is 
irreducible if X’ is irreducible (Remmert’s 
theorem [ 16]). 

D. Modifications and Resolution of 
Singularities 

Let M be a subset of an analytic space X. If, 
for every point x £ X, there exists an open 
neighborhood U of x and and an analytic set 
M* in U, containing U D M SUCh that U-M* 
is dense in V, then M is called analytically 
thin. Let : Y-> Y be a holomorphic map- 
ping. Suppose that there exist two analyti- 
cally thin sets A4 c X, N c Y such that cp 
induces an isomorphism between X — M and 
Y — N. Then X is called a holomorphic modifi- 
cation of Y. If furthermore cp is proper, then X 
is called a proper modification of Y. A mon- 
oidal transformation of an analytic space X 
with respect to a coherent sheaf of ideals / is 
defined as in the case of a complex manifold or 
an algebraic variety (^ 16 Algebraic Varieties 
L; 72 Complex Manifolds H). It is a proper 
modification / : X* -^X such that the inverse 
image ideal sheaf f~^J- &x* is inyertible, and 
is universal among all proper holomorphic 
mappings h\Z-^X with the property that 
h~'.f ■ is invertible, where / * ./ is the 
^inverse image of If J is the sheaf of ideals 
defined by an analytic set Y in X, the mon- 
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oidal transformation of X with respect to / Js 
often called the blowing-up of X with center Y. 

H. Hironaka [ 14] proved that, if X is an 
analytic space which is countable at infinity 
(i.e., a countable union of compact sets), then 
there is a proper modification n: X'->X with 
X’ smooth (i.e., free from singular points). Such 
a modification is called a desingularization (or 
resolution of singularities) of X. Moreover, 
over any relatively compact open set [J of X, ji 
is the product of a finite sequence of blowing- 
ups 71,-: XI~>X-^j (Xq ~X), with smooth centers 
along which XJ i is tnormaily flat. This 
deep result enables one to derive properties of 
analytic spaces from those of complex 
manifolds. 

Let X and Y be two analytic spaces and G 
an analytic set in X x Y. If the canonical pro- 
jection n:G-^X is a holomorphic (or proper) 
modification, then we say that a meromorphic 
mapping (a proper meromorphic mapping) jj, 
of X into Y is defined. The set G is then called 
the graph of /z. A holomorphic mapping (p ; X 

Y can be viewed as a proper meromorphic 
mapping. Let fr.X-^Y he a proper meromor- 
phic mapping. Then p(x) (the projection of 
7r"^(x) into Y) is a nonempty analytic set in Y 
for every point x 6 X. Moreover, there exists 
an analytic set N, with X-N dense in X, such 
that /r maps X-N into Y holomorphic ally. 
The smallest set N with this property is called 
the set of points of indeterminacy or the sin- 
gularity set of p. A meromorphic mapping f\ X 
^P'(C) is called a meromorphic function on X 
if none of the irreducible components of X is 
mapped to {oojby f. The set /“‘ (0) = n{{X x 
{0}) n G) is called the set of zero points of X, 
and the set /~‘(go) is called the set of poles. 
These are analytic sets in X. Let /j , • , be 
meromorphic functions on X. Then, by a 
suitable proper modification of X, one can 
eliminate the points of indeterminacy of the 
meromorphic mapping /: X-^(P^(C))*‘ defined 
by x^(/i(x), . ,/|j(x)), i.e., one can modify / to 
be holomorphic. The ring of meromorphic 
functions on X is invariant under proper 
modifications of X. If X is irreducible and 
compact, then the field of meromorphic func- 
tions on X is a simple algebraic extension of 
the tfield of rational functions of k (^dim Y) 
variables. 

Let (X, 0,) be an analytic space. A point 
X 6 X is called normal for X if 0x,x is a tnormal 
local ring. The set of nonnormal points for X 
is an analytically thin analytic set in X (K. 
Oka). Every ordinary point of X is normal. 

We call X normal if every one of its points is 
normal. Every nondegenerate holomorphic 
mapping of an irreducible X into an irreduc- 
ible and normal Y is an open mapping if its 
rank is equal to the dimension of Y (Remmert). 



Eor every analytic space X, there exists a 
proper modification v.X-^X with X normal 
such that V is nondegenerate and v Y — v~‘(S) 
is an isomorphism, where S is the set of non- 
normal points. Such a proper modification of 
X is unique up to isomorphisms. We call X 
a normalization of X with normalizing map- 
ping V. 

Eet : X ^ Y be a holomorphic modifi- 
cation. Suppose that Y is normal at (p(x°) 
(x°eY). If the set (p~^{(p{x^)) contains an iso- 
lated point, then (p~^{(p(x°)) — x^, and cp is an 
isomorphism in a neighborhood of (an 
analog of tZariski’s main theorem). In partic- 
ular, if cp : Y-> Y is a holomorphic modifica- 
tion and Y is normal, then cp maps X — 
isomorphically onto the dense open set (^(Y 
E^). Eurthermore, if a holomorphic mapping 
^ : X Y is injective and X and Y are irreduc- 
ible, normal, and n-dimensional, then (p{X) 
is an open set in Y and(p~^ : (p{X)-^X is 
holomorphic. 



E. Analytic Spaces in the Sense of Behnke and 
Stein 

Eet (p : G-*Ghe a proper continuous mapping 
of a connected tlocally compact space G ontO 
a domain G C C". The triple © = (G, <p, G) is an 
analytic covering space over G if the following 
conditions are satisfied: (i) is a finite set 

for every point e G. (ii) There exists an ana- 
lytic set A cz G of dimension ^n — 1 such that 
(p G — (p~^ ('4) is a local homeomorphism and 
every point of (p~^{A) has a fundamental sys- 
tem of neighborhoods U such that both U 
and U — (p~\A) ^ 0 are connected. As © is 
unramitied over G-A, the number of points in 
(p (z°) is constant for £ G-A and is called 
the number of sheets of ©. A point z£ G is 
called a ramification point of © if the restric- 
tion of q> to any neighborhood of z is not a 
homeomorphism. Denote by B the set of 
ramification points of ©. Then (p[B) c A is an 
analytic set of dimension « — 1. Eet / be a 
continuous complex-valued function in an 
open set D in G. We call / holomorphic in D if 
for every point z® eD B and for every open 
neighborhood L of z** = <p(z°) over which is 
a homeomorphism, fo is holomorphic in 
V. Denote by Gg the sheaf of germs of holo- 
morphic functions over G. Then (G, Gg) is a 
ringed space. An analytic space in the sense of 
Behnke and Stein is a Hausdorff ringed space 
(X, 0,) that is locally isomorphic to a ringed 
space of the form (G, Gg) [3]. tRiemann’s 
theorem on removable singularities holds for 
such spaces. Every normal analytic space is an 
analytic space in the sense of Behnke and 
Stein. An analytic covering space © =(G, <p, G) 




117 



is a C-covering space (covering space in the 
sense of Cartan) if for every point z® e G there 
exist an open neighborhood V of z” and a 
holomorphic function gin (J = (p~^(V) which 
can be defined by an irreducible polynomial 
of degree k such that its coefficients are holo- 
morphic functions on V and the coefficient of 
its highest term is 1 , where k is the number of 
sheets of (5. A C-analytic space is an analytic 
space in the sense of Behnke and Stein that is 
locally isomorphic to a C-covering space. The 
category of C-analytic spaces coincides with 
that of normal analytic spaces. According to 
H. Grauert and Remmert [10], every analytic 
covering space is a C-analytic covering space. 
Therefore every analytic space in the sense of 
Behnke and Stein is a normal analytic space. 

Let R be an equivalence relation in an ana- 
lytic space X. Given a subset A of X, denote 
by K[A] the set of points of X which are R- 
equivalent to points of A. We call R proper if 
is compact for every compact set K in 
X. Let (phe a proper holomorphic mapping of 
an analytic space X into another Y. For x, 
x’e X, let X =x’(R) be defined by cp{x) = (p{x'). 
The equivalence relation R is then proper. We 
consider the quotient space X /R and the 
canonical projection p.X^XjR. With each 
open set IJ in the quotient space X /R we can 
associate the ring of holomorphic functions in 
that are constant on p“’(x) for every x 
e U. This leads to a ringed space ( Y /R, 
which is proved to be an analytic space by 
Grauert’s theorem [9]: All the ^direct images of 
a coherent analytic sheaf over X by a proper 
holomorphic mapping (p\ X-^X are coherent. 
For every proper equivalence relation R in X, 
the ringed space (X/R, (9x!R) is an analytic 
space if and only if for every point xe XjR 
there exists an open neighborhood V of x such 
that functions in F( K, G^IR) separate the 
points of V (H. Cartan). 



F. Stein Spaces 

For an analytic space (X, (P^) let us consider 
the following conditions: (i) Functions in 
I(X, separate the points of X. (ii) X is K- 
complete; i.e., for every point xeX there exist a 
finite number of /|e I(X, (i = i, , k) such 
that the holomorphic mapping f ={f.):X 
is nondegenerate at x. (in) Every compact 
analytic set in X is a finite set. Condition (i) 
implies (ii), and (ii) implies (iii). If an irreducible 
analytic space X is K-complete, then X is a 
countable union of compact sets (Grauert 
[S]). In fact, if n= dim X, there exist functions 
fieT(X,&x) (/=!,.. .,n) such that the holo- 
morphic mapping / = {f.):X -^C" is nondegen- 
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erate. The notion of holomorphic convexity 
(--21 Analytic Functions of Several Com- 
plex Variables H) is carried over to analytic 
spaces. For a holomorphically convex analytic 
space, conditions (i), (ii), and (iii) are equivalent 
(Grauer [8]). A Stein space (or holomorphi- 
cally complete space) is a holomorphically 
convex analytic space that satisfies one of the 
conditions (i), (ii), or (iii). In a holomorphically 
convex analytic space {X, &x), let R be the 
equivalence relation defined by I(X, 6,); i.e., 
for X, x’ £ X, X = x’(R) if and only if f(x) = f(x') 
for every /e F(X, Then R is proper. The 
analytic space ( Y /R, &x/R) 1® ^ Stein space. A 
Stein space is a generalization of the notion of 
a Stein manifold. Fundamental theorems A 
and B on Stein manifolds hold Verbatim for 
Stein spaces (-^21 Analytic Functions of 
Several Complex Variables L). Therefore the 
main properties of Stein manifolds are in- 
herited by Stein spaces. Let (p : X -> Y be a 
holomorphic mapping of an analytic space X 
into another, Y. If for every xeX all the con- 
nected components of the fibers (p~'^{(p(x)) are 
compact, then the equivalence relation R’ 
defined by those components (i.e., for x and x' 
in X, X = x'{R ) if and only if x and x’ belong to 
the same component of (p“*((p(x))) is proper, 
and the ringed space {X/R', Cx/R') is an ana- 
lytic space. In particular, if X is a holomor- 
phically convex irreducible analytic space 
and R is the equivalence relation defined by 
r(X, (i'x), then all the fibers of the canonical 
projection p: X Y/R are connected. 



G. Further Topics 

The notion of analytic space can he gen- 
eralized as follows (Grauert [9]). A ringed 
space (X, Ox) is a general analytic space if it is 
locally isomorphic to a ringed space (A, 
where A is an analytic set in a domain G ^ C", 
and = (&(j//) I ^ for some coherent analytic 
subsheaf / of ./(A) such that Supp((('(;/ J) 

(= |z 6 G i&G/f)z ^ 0}) = T. An analytic sub- 
space of (X, &x) is a ringed space (Y, &y) where 
Y = Supp(C/x/7^) and 0\ = for some co- 
herent sheaf of ideals / of Ox- (Y, is also a 
general analytic space. A. Douady [6] showed 
that for any general analytic space (X, Ox) 
there exists a natural structure of a general 
analytic space on the totality of all the com- 
pact analytic subspaces of (X, O'x)- The result- 
ing analytic space is called the Douady space 
of (X, O'x). For the proof, the notion of a 
Banach analytic space is used, which is ob- 
tained by first defining analytic subspaces in 
an open subset of a complex ^Banach space 
and then patching them. A Douady space can 
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be used, for example, to show that the identity 
component of the analytic automorphism 
group of a compact Kahler manifold is a 
■''complex Lie group which is naturally an 
extension of a tcomplex torus by a tUnear 
algebraic group (A. Fujiki, D. Lieberman; 

232 Kahler Manifolds C). 
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Ancient Mathematics 

A. General Remarks 

To determine the beginning of the history of 
mathematics, one must detine the term “math- 
ematics.” Only speculation based on the ob- 
servation of primitive peoples today can be 
made regarding the development of the number 
concept among prehistoric peoples. The pre- 
historic period ended in Egypt and Meso- 
potamia c. 3000 B.C., and a little later in the 
valleys of the large rivers in India and China. 

Since the basis of the civilizations in the 
river valleys of the ancient world was agricul- 
ture, the administrators first had to control 
watering systems through irrigation, drainage, 
pumping, and canalization; second, they had 
to measure land and harvests for tax collec- 
tion; and third, they had to establish a calen- 
dar by observation of the heavenly bodies. 

All these tasks demanded some knowledge 
of mathematics. 

Additional knowledge of mathematics was 
certainly needed for construction of the pal- 
aces and tombs. We know something of the 
development of mathematical knowledge 
during these ages from some recovered arti- 
facts, but there remains the possibility of new 
finds that will bring about a basic change in 
our knowledge of the history of mathematics 
during this period. 

B. Mathematics in Egypt 

The main sources for our understanding of the 
history of mathematics in Egypt are the Mos- 
cow papyrus and the more important Rhind 
papyrus, both discovered in the 19th Century. 
The Greeks place the origin of their mathe- 
matics in Egypt, but it seems that Egyptian 
mathematics was limited to practical mathe- 
matics. The Egyptians had a decimal numera- 
tion system, but the place value was not clear; 
they used fractions, which they always decom- 
posed into the sums of unit fractions (i.e., frac- 
tions with 1 as numerator); they solved the 
problems of everyday arithmetic that were 
reducible to linear equations; they computed 
approximate areas and volumes of some fig- 
ures for the purpose of measurement of farm- 
land or granaries and for construction work; 
they had exact formulas for the computation 
of areas of triangles and of trapezoids; and 
they used (16/9)'^ = 3.1605 as the value of 
7 t; but no trace has been found to prove the 
existence of demonstrative mathematics in 
ancient Egypt. 
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C. Mathematics in Mesopotamia 

Sources abound for the study of Mesopota- 
mian mathematics, and these sources may 
very well increase in the future. The Meso- 
potamians kept exact records of astronomical 
observations for long periods of time. Their 
more advanced mathematics was not limited 
to practical use, as was that of the Egyptians. 
They used a sexagesimal system of numeration 
with place value, and also used sexagesimal 
fractions; however, they lacked a cipher to 
denote zero until the 4th Century b.c., and they 
did not have a symbol corresponding to our 
decimal point, so that exact place value had to 
be determined from the context of each ex- 
pression. They had a multiplication table and 
tables of inverses, squares, and cubes of num- 
bers, and they used these tables to solve equa- 
tions, even some simple equations of the third 
degree, as well as simultaneous equations of 
the second degree for two unknowns. They 
had accurate solutions for quadratic equations 
(expressed in words); they discarded negative 
roots, but they admitted both positive roots 
when two existed. They studied integral solu- 
tions of -\-b^ = (the largest of their solu- 
tions were 12,709, 13,500, and 18,541) and 
approximate computation of quadratic roots, 
which suggests some relation to Greek mathe- 
matics. We have evidence that some of the 
geometric algebra in Euclid’s Elements can be 
traced to Mesopotamian algebra. Some his- 
torians also affirm that the concept of dem- 
onstration in Greek mathematics originated 
with the Mesopotamians, but this theory lacks 
suffcient proof. 

By the 7th Century, the Mayas in Central 
America also possessed a numeration system, 
with the base 20. As far as we know, the Meso- 
potamians and the Mayas were the earliest 
people to possess numeration systems with 
place value (- 57 Chinese Mathematics, 187 
Greek Mathematics, 209 Indian Mathematics). 
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Approximation Methods in 
Physics 

A. Introduction 

It is often possible to solve a differential equa- 
tion of mathematical physics analytically by 
expanding the solution in a series with respect 
to a small (or large) parameter involved in the 
equation. In general, such an approach is 
called a perturbation method (- Section B). 

For an equation which is difficult to treat 
analytically, it is possible to get a numerical 
solution either by replacing the derivatives by 
difference quotients (^difference method), or by 
expanding the solution in terms of suitable 
functions and determining the expansion coef- 
ficients numerically from the equation re- 
written in a weak form ('’'variational method, 
’'method of weighted residuals, '’Galerkin’s 
method, '’'finite-element method, etc.). For such 
numerical methods 46 Calculus of Vari- 
ations, 301 Numerical Solution of Algebraic 
Equations, 302 Numerical Solution of Finear 
Equations, 303 Numerical Solution of Ordi- 
nary Differential Equations, 304 Numerical 
Solution of Partial Differential Equations, 441 
Variational Principles. 

In Section C we describe a method for deriv- 
ing an asymptotic expression for a function 
written as an integral in a complex plane. 

B. Perturbation Method 

(1) Regular Perturbation, (a) Initial-value prob- 
lem. Consider an initial-value problem of an 
ordinary differential equation involving a 
small parameter f.: 



u = 6(e) at t = 0. (2) 

We assume that / is sufficiently smooth as a 
function of t and is regular as a function of u 
and e, and that b is a regular function of s. 

Substituting the power series in e for f u, 
and b, 

fit, u; s) = f°\t, u) -(- ep ’ ’((, u) + , 
hie) = , 
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into equations (1) and (2) and equating terms 
of equal power, we get 

dt 

dt 

du^^^ 1 

^ + r/r(l, 

dt 2 

the solutions of which determine the perturba- 
tion series for u_ The equation for (the 
unperturbed equation) is nonlinear in general, 
but those for are inhomogeneous 

linear equations with identical principal parts. 
The system of differential equations 

dill 

flit, Uj , . . . , U„, fi) (i — 1, 2 j . . . jH); 

M, = /?;(e) at t = 0. 



or a differential equation of higher order. 



dt" 



t,u, 



d''-^u \ 






d"-'u 



= 6*" * *(g) at t = 0, 



can be treated in a similar way. 

(b) Boundary-value problem. Given a linear 
differential operator H = Hq +sV involving 
a small parameter £ linearly [Hq is a linear 
differential operator and Visa function of 
(Xi,X 2 , consider the boundary-value 

problem 

{Ho + cV)u=f[xi,...,x„) (xi,...,xJeQ, (3) 

Bu = 0 (xi,...,xJeoQ, ( 4 ) 

where B is a linear and homogeneous 
operator. 

Substitution of the Taylor expansion 

U = Uq + F,Ui + 

into equations (3) and (4) gives 
HqUq=I Bu,=0, 

HqUi + Vuq = 0, Bu, =0, 



X t)(^i , , 

where G(xi, x„|^j ,.••??„) is the Green’s 
function for the unperturbed problem. The 
perturbation series 

u = K/-F.K VKf + c^KVK VKf- . . . 

converges if V K is bounded and |£| < 1/|| VK\\. 

(c) Eigenvalue problem. The solution of an 
eigenvalue problem 

(Hq + £V)u^)m (5) 

with the boundary condition Bu = 0 for the 
operators considered in (b) can be obtained in 
power series in e; 

?. = -\-£^ -\- ..., 

where Uq and are the eigenfunction and 
nondegenerate eigenvalue, respectively, of the 
unperturbed operator Hq, which is assumed to 
be self-adjoint. Substitution of the series into 
(5) gives 

(Ho-AQ)ii^^>^-VuQ + ;f%, ( 6 ) 

(Hq-Aq)u<^>= + + 

(Hq - ™ 



If we normalize Uq and u by the conditions 
(uq, Uq) = 1 and (u, Uq)= 1, we get from equation 
( 6 ) 

a'^^ = (Vuq,Uq) = x. 

Let S be the inverse of Hq—AqI in the sub- 
space perpendicular to Uq (we put Su, = 0), 
then each term of the perturbation series for u 
and X is obtained in the following way from 
the system of the equations given above: 

-SVuq, 

= - (VS VU„ Mo), 

w'2) = S(|/-a)SKMo, 

f^^ = (VS(V-x)SVuQ,Uo), 

_s(y-a)S[V-x)SVuQ 

+(VSVuq,Uq)S^Vuq, 



of which the solutions are 
Uo^Kf Ut = -KyKf, 

U 2 = KVKVKf.... 

Here K = Hq^ is an integral operator such that, 
for any function v= t’jXj , X 2 , , x,), we have 



(2) Singular Perturbation. Formal application 
of the procedure described in (1) often fails 
when the term including the highest-order 
derivative is multiplied by a small parameter 
(or a lower-order term is multiplied by a large 
parameter), i.e., when we deal with singular 
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perturbation problems. Typical methods for 
treating such cases are described below. 

(a) WKB method. The method for getting an 
asymptotic solution of the second-order dif- 
ferential equation 

d^u 

-^ + FP(x)w = 0 (7) 

dx^ 

for large values of k was developed for prob- 
lems arising in classical wave motion (Jeffreys, 
1924 [1]) and for problems of quantum 
mechanics (Wentzel, Kramers, Brill ouin, 

1926), and is called the Jeffreys method, WKB 
method, or WKBJ method. 

If the function P in (7) does not vary rapidly, 
the solution is expected to have the form 

ucjzQxp{ik(p{x)}, 

with (p(x) not very different from a linear func- 
tion of X, since the exact solution for P = Pq = 
const is exp( ± x). 

Substitution of this expression for u in equa- 
tion (7) gives 

ik(p" — k^(p'^ + k^P — 0. 

Neglecting the first term, we get 

(p'^ = P, i.e., (p(x)= |* P^^^dx. 



This result suggests a transformation of the 
form 








If P is positive for the interval Zq^z^Zj in 
question and has continuous derivatives up to 
second order, the foregoing series converges to 
the solution of (8). 

On the other hand, when kz takes complex 
values, the series (9) does not converge in 
general. The series expansion appropriate for 
this case is 

ai 1 

w{z-,k}^exp{ip{z;k)}, \jj= +//cz + ^— -tA„(z), 

1 (j/c) 

and we get the asymptotic expansion for w: 
w-exp I ±ikz + ^^ QdC + ~Q{z) 

+^mz)-2S(zfl .} 

Now, in many cases, P takes both positive 
and negative values within the interval in 
question, e.g., in the problem of transonic flow 
of a gas, in the problem of determining the 
reflectivity and transmissivity of a wave propa- 
gating in a nonuniform medium, or in the 
quantum-mechanical problem of a material 
wave passing through a potential wall. The 
point at which P vanishes is called a turning 
point. In this case there arises the problem of 
analytic continuation of the solution obtained 
in one side into the other through the turning 
point. For the special case when Focx" (the 
turning point is chosen as x = 0), it can be 
shown that the exact solution of equation (7) is 
explicitly given by 

woc,yz Z„(/cz), 

where v = \/(n + 2), and is a cylindrical 
function of order v. Based on this fact, the 
continuation formula for the case when P 
is approximated by x" near x = 0 has been 
obtained. 

(b) Lighthill’s method. With nonlinear equa- 
tions, formal application of a perturbation 
procedure sometimes leads to the difficulty 
that the solution of the unperturbed equation 
has a singularity not exhibited by the exact 
solution of the original equation and whose 
order increases in higher approximations, so 
that the perturbation series ceases to have any 
meaning near this point. Such a difficulty can 
often be overcome by so-called coordinate 
straining. This method has been applied to a 
number of fluid-dynamical problems, such 
as those arising in theories of aircraft wings, 
boundary layers, and shock waves. 

Before describing the method, it is conve- 
nient to sketch Poincare % method of getting 
the perturbation series for the period of a 
nonlinear oscillation. In order to solve the 
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nonlinear equation 
du 

where Tq is the period of oscillation when the 
small parameter £ is taken to be zero, Poincare 
wrote series expansions for both u and the 
period T: 

M = Mo(t) + 6Mi(t)+ , T= 7^ + gTj + ... , 

and determined Tj , T 2 , • • - in such a manner as 
to avoid resonance at each step in the deter- 
mination of u,(t), u,(t), . . 

Based on this idea, Lighthill(1949) [2] 
developed coordinate straining in order to 
overcome the difficulty stated above. We illus- 
trate the method here by mcailS of the follow- 
ing simple example. 

Consider a boundary-value problem of a 
nonlinear differential equation 

(x-f£u)— -f M = 0 (0<x< 1): (10) 



u(l)= 1. 



( 11 ) 



Application of the usual perturbation method 
would give the series solution 



1 £/l 

W-- + -I - 

X 2\x 





As is seen, however, the order of an apparent 
singularity occurring at x = 0 increases as we 
proceed to higher approximations so that it is 
not possible to see the true behavior of the 
solution near x = 0. 

In Lighthill’s method, we assume that series 
expansion is valid not only for u but also for 
the independent variable x, i.e., 

U = Uo(^) + £Ml(^)+..., X = i^ + £X,(<^)-t-.... 

Substitution of these series into equations (10) 
and (11) gives 



^u'q + Uq = 0, Mo(1)=1, 

^U\+U^ = ~(UqX\ +u'qX^+UqU'q), 

w,(l) = 0, Xi(l) = 0, 



From the first equation we have Uq = \j^. 
Before solving the second equation for Uj , we 
determine Xj ((^) so as to avoid the increase 
of the order of singularity in «o(^) ^ =0. 

This requirement is satisfied by taking Xj = 
(1/2) (^ l/(^), and this gives Uj = 0; thus, at 

this stage of approximation, we have 



The guiding principle is the same for higher 
approximations. (The foregoing expression 
happens to be the exact solution, but this a 
fortuitous result of our choice of this particular 
example.) 

The convergence of the series obtained by 
Lighthill’s method has been proved by Wasow 
(1955) [3] for the more general problem 

d u 

(^ + £w)— +^(x)w = r(x). 



u(a) = b, 



where e, a, and b are positive constants and 
q(x) and r(x) are functions regular in |x| 

Lighthill’s method can also be applied to 
partial differential equations. Sometimes it is 
called the PLK method (Poincare-Lighthill-Kuo 
method), after its successful application to the 
problem of the boundary layer of a thin flat 
plate by Kuo. 

(c) Method of matched asymptotic expan- 
sions. In some cases it is possible to get a uni- 
formly valid series solution if we divide the 
domain into two or more subdomains, solve 
the equation by use of a suitable independent 
variable for each subdomain, and then deter- 
mine the coefficients in each solution by the 
process of matching neighboring solutions 
on their common boundary. Such a perturba- 
tion technique was developed in treating the 
boundary-layer equation in fluid dynamics 
and has been systematized into the method 
of matched asymptotic expansions. 

The idea of the method can be shown by 
the following simple example. Consider the 
boundary-value problem 



d^u d u 
dx^ ^ dx 



(0<x< 1), 



u(0) = 0, m(1)=1, (12) 

where £ is a small positive parameter and a is a 
constant such that 0 <a< 1. In order to solve 
this problem, we first take x itself to be an 
independent variable (the outer variable) in the 
domain x » £ and expand the solution as 

U= Uq{x) + £U^{x) + 

Because the highest derivative is multiplied by 
a small parameter g in equation (12), the per- 
turbation equations to be solved are all of first 
order, of which the first one is 

d^o n n\ 1 

— -a, C/o(l)-l. 



Solving this, we get an approximation for the 

outer solution: 

C/o(x)= 1 -a(l-x). 

On the other hand, we introduce a new 
independent variable ^ (the inner variable) by 
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putting X = 6(^ for X < g. This transforms equa- 
tion (12) into 



d^v dv 



= viO^uix). 



If we assume the expansion 

v=V,iO + eVd^)+..., 

we have the equation for Vq. 



d^Vp dK 

de d^ 



W)=o. 



This gives an approximation for the inner 
solution: 

V,{S)=c(\-e~% 



Finally, in order to match the inner and the 
outer solutions, we equate their values at some 
point within the interval g « x « 1 ((^ -> oo for 
the inner solution, x->0 for the outer solution): 



Fo((X))=[/o(0)= 1 -a, 

from which the coefficient in Vq is determined 
to be c = 1 -a. Therefore we have an approxi- 
mate solution valid in the whole interval 

u^l/o(x)+Fo(0-0 -a) 

= ax + (l — a)(l 



C. Method of Steepest Descent 

If an analytic function f(z) of a complex vari- 
able z is expressed in terms of an analytic 
function g(t) by an integral 

/(z) = |^exp{z6^(^)}^i^, (13) 

where C is a curve on the complex t-plane, 
then an asymptotic expression for f(z) for 
large values of jz) can be derived by the 
method of steepest descent (the saddle-point 
method). The idea of the method may be 
traced back to Riemann, and various asymp- 
totic expressions for cylindrical functions were 
obtained by Debye (1909) [5], 

The point at which the first derivative of g(t) 
in the integral (13) vanishes is called a saddle 
point. The function g(t) is expanded in a power 
series near this point as 

giO = gitp) +^g"(h) + • • 

We have an inequality 
z/(fo)(t-fo)^<0 
along the line 

7T 1 

L: arg(g-to)----arg{z£^"(ro)}. (14) 



Therefore the absolute value of the integrand 
exp{zg(t)} reaches a maximum at to on the 
line L and decreases along it more rapidly 
than along any other direction. Hence if we 
deform the integration path in such a way 
that it passes through the saddle point Iq in 
the direction of steepest descent, the value of 
the integrand is practically zero on the new 
path except very near to when |z| is large, 
whereas the value of the integral remains the 
same. Therefore we can get an asymptotic 
expression of f(z) for large |z| by truncating 
the Taylor series for g{t) up to the second term 
and taking the line L as the integration path: 



/(z) xexp{zg(to)} exp <-zg''(to)(t -tof}dt 



^exp{z3(to)}. 

y -zg (to) 

As an example, Stirling’s formula 



n\iii^/znn n e 



for a large positive integer is derived if we start 
with the formula 





exp(u(logs-s)}ds 



and make use of the method of steepest descent. 
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26 (Xx1.4) 

Arab Mathematics 

The role of the Arabs in cultural history has 
been partly that of cultural transmitter. Between 
the 7th and 13th centuries, they established a 
religious empire that extended from India to 
Spain; later it was divided into the eastern and 
western empires. The caliphs of these empires 
encouraged research in the sciences, so the 
capitals Baghdad and Cordova became centers 
of culture where scholars from different coun- 
tries gathered. 

Arabic scholarship is sometimes called the 
stepfather of European culture. During the 
13th Century, Alphonso X (1252- 1284) invited 
Islamic and Hebrew scholars to the Spanish 
court to translate their writings on algebra, 
medicine, and astronomy into Spanish. This 
accomplishment earned him the title of Al- 
phonso the Wise. 

The first contact between Greek and Indian 
mathematics took place in Baghdad under 
Caliph Al-Mansur (754-775); Euclid’s Ele- 
ments was introduced by way of the Byzantine 
Empire, while Brahmagupta’s Brahmasphuta- 
siddhdnta came directly from India. Many 
mathematical texts found in the Eastern Ro- 
man Empire and Syria, including some Greek 
works, were translated into Arabic. Though it 
is difficult to discern essential scientific ad- 
vances in Arabian works, the diffusion of 
these translations was instrumental in the 
development of European mathematics. 

The Arabs did not use written numerals 
until Mohammed’s time (570-632). Signs 
representing numbers had been introduced 
into Arabia when its influence encompassed 
Egypt and Greece. Indian numerals were 
imported with Brahmagupta’s book and 
became our present Arabic numerals after a 
series of modifications. 

Among all the branches of Arab mathe- 
matics, algebra was the most advanced. It 
started with Alkwarizmi’s (820) A1 Gebr W’al 
Muquabala, the origin of the word “algebra.” 

It was the first mathematical book written in 
Arabic. Its content was essentially a variety of 
methods of solving algebraic equations. A1 
gebr means “transposition of negative terms 
on one side of the equation to the other side 
and changing their signs.” and al muqua- 
bala means “simplification of the equation by 



gathering similar terms.” Eor example, qua- 
dratic equations can be brought by these 
methods into one of the following three types: 

—px + q; -\-q — px; x^ + px~ q, where p, q 
are positive numbers. The Arabs expressed the 
rule of solving these equations verbally. They 
apparently knew that quadratic equations 
have two roots, but they adopted only posi- 
tive roots; when the equation had two posi- 
tive roots they adopted the smaller root. The 
proof was given geometrically. It is possible 
that they learned geometric proofs from the 
Greeks. 

Eor the Arabs, geometry was secondary to 
algebra. They did not appreciate proof as seen 
in Euclid’s Elements. The book on conic sec- 
tions by Apollonius was also translated into 
Arabic, but no essential progress was made in 
this area. The only remarkable contribution 
was that of Omar Khayyam, author of The 
Rubaiyat, who applied conic sections to the 
solution of the cubic equation x^+ bx = a. 

In trigonometry, Al Battani (c. 858-929) left 
a notable contribution. He studied The Almag- 
est, the Arabic translation of Ptolemy’s astro- 
nomical work. He added nothing outstand- 
ing to plane trigonometry, but obtained such 
formulas as CDS a = CDS cos c + sin sin c oos a 
for spherical triangles, which were not men- 
tioned in The Almagest. 

References 

[1] M. Cantor, Vorlesungen liber Geschichte 
der Mathematik I, Teubner, third edition, 

1907. 

[2] G. Sarton, Introduction to the history of 
science I. Erom Homcr to Omar Khayyam, 
Carnegie Institution of Washington, 1927. 



27 (V.18) 

Arithmetic of Associative 
Algebras 

A. General Theory 

Eet g be a tDedekind domain (i.e., an integral 
domain in which every ideal is uniquely de- 
composed into a product of prime ideals), let F 
be the field of quotients of g, and let A be a 
^separable algebra of finite degree over E. A g- 
iattice a of A is a g-submodule of A that is 
linitely generated over g and satisfies A = Fa. If 
a subring o of A is a g-lattice containing g, 
then o is called an order. A maximal order is 
an order that is not contained in any other 
order. A maximal order always exists although 
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it may not be unique; in partieular, if A is 
eommutative, it has only one maximal order. 
Ifweput o, = {xe/l|xac:a} (o, = {x6/4|qxc 
a}) for a g-lattiee a of A, then o, (o^) is an 
order of A and is ealled the left (right) order of 
a. If the left order of a is maximal, then so is 
the right order; the eonverse is also true. If O; 
and Oy are maximal, we call a a normal g- 
lattice. To describe the same situation we say 
that a is a left o, -ideal or a right Or-ideal. If 
0 , = 0 ^ = o, then we say that a is a two-sided o- 
ideal. A lattiee a with a c o, (or equivalently 
with a c 0^) is ealled an integral g-lattice or 
integral left (right) ideal. The product ab of 
two normal g-lattiees a, b is ealled a proper 
product if the right order of a coincides with 
the left order of b. The proper produet defines 
the strueture of a tgroupoid on the set of all 
normal g-lattiees of A. In partieular, the in- 
verse a of a normal g-lattiee a satisfying 
aa“^ = 0 (, a“^a = o,. is given by = {xeA |xa 
c oJ= {x eA ax c or}. For a fixed maximal 
order o, a maximal integral two-sided o-ideal p 
different from o is called a prime ideal of o. If p 
is prime, then o/p is the matrix algebra of 
degree k over a division algebra, and k is 
ealled the capacity of the prime ideal p. The set 
of all two-sided o-ideals forms a multiplieative 
group, of whieh prime ideals are independent 
generators. 

B. Maximal Orders of a Simple Ring 

In the rest of this artiele, A is a tsimple ring, F 
is the tcenter of A, and o is a maximal order of 
A. The prime ideals q of o and the prime ideals 
p of g are in one-to-one correspondence by 
the relation qf)g = p; o/qis a simple algebra 
over g/p, and q'^^ po for some natural num- 
ber e. The different b of o is defined by = 
{xeA I T’r(xo)cg}. (Tj- is the tredueed traee 
from A to f ; b is an integral two-sided o-ideal, 
and is divisible by q^~i if q® = po.) For q to 
divide b, it is neeessary and sufficient that 
either e > 1 or o/q not be separable over g/p. 

In partieular, if A is a ttotal matrix algebra 
over F, then b = o, q = po. The ideal of q gener- 
ated by the 'f'reduced norms (to F) of the ele- 
ments in a normal g-lattiee a of A is denoted by 
N,„(a). If ab is a proper produet, then we have 
iV^/f(ab) = iV, 4 /f(o)Af^/f(b), where iV^/f(b)does 
not depend on the choice of o; this is ealled 
the discriminant of A. If [A : F] = and q*^ = 
po, then iV^/p(q) = p^, ef=n. 

C. Simple Rings over a Local Field 

Let f be a field that is complete with respeet 
to a tdiscrete valuation whose field of residue 
elasses is finite. Let q be the tyaluation ring of 



F, p be the maximal ideal of g, and A be a 
simple algebra with F as its center. If A is a 
division algebra and [A : F] = rF, then A has 
only one maximal order o, and o has only one 
prime ideal q such that q"= po; o/q is an exten- 
sion of degree n of g/p. If A is not neeessarily a 
division algebra, the relation = o’ with 

an element ^ of A holds between two maximal 
orders o and o’ of A. Furthermore, for any left 
(right) o-ideal a, there exists an element 0( such 
that a = oa (a = ao). By using the notation of 
tcyclic algebra, we can express A in the form 
{K„, o, n’’). Here is the unramified extension 
of degree n over F, (j is the tFrobenius substi- 
tution of K„/F, 7T is a prime element of F, and 
0 Y < n. The element of Q/Z determined by 
rjn (mod Z) is denoted by {A} and is ealled the 
tHasse invariant of the algebra class contain- 
ing A. The mapping A -{A} gives an isomor- 
phism of the ■tBrauer group of F onto the 
additive group Q/Z. If M is an extension of F 
of finite degree, then {A”‘} = [M : F] {A} holds 
for the algebra obtained from A by sealar 
extension (- 29 Assoeiative Algebras). 

D. Simple Rings over an Algebraic Number 
Field 

Let F be an algebraie number field of finite 
degree, and let A be a simple algebra with 
center F. Then A is a eyelie algebra and is 
isomorphie to a total matrix algebra Over a 
division algebra D. The order n of the algebra 
class of A over F is determined by rF = [/) •' F] 
(H. Hasse, R. Brauer, and E. Noether). 

Denote by F^ the eompletion of F with 
respeet to a tprime divisor p of F, and let A, be 
the algebra obtained from A by the sealar 
extension Fp over F. For a finite prime divisor 
p, the meaning of {A,,} is as before; for an 
infinite prime divisor p, put { A, } = 0 or 1/2 
(mod Z) aeeording as A, is a total matrix alge- 
bra over Fp or not. Furthermore, define the 
subgroup Jp of Q/Z by 

dp = Q/Z, p a finite prime 

divisor, 

= {0, 1/2 (mod Z)| p a real infinite 
prime divisor, 

= {0}, p a complex infinite 

prime divisor. 

Now let J be the subgroup of the direet prod- 
uct rip dp consisting of all elements of the 
form (a,) («p eJ,) such that oCp = 0 except for a 
finite number of prime divisors and Xp i^p = 0- 
Then A-\-( | A,}) gives rise to an isomorphism 
of the Brauer group over F onto J (Hasse). 
Each {A,} is called the tp-invariant of A. In 
particular, A is a total matrix algebra over F if 
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and only if A, is a total matrix algebra over 
fp for all p (- 29 Associative Algebras). These 
theorems are closely related to tclaSS field 
theory. 

If o is a maximal order of A and a, b are left 
o-ideals, then = b with an element of A 
defines an equivalence relation between a and 
b. The number of equivalence classes of left 
ideals with respect to this equivalence is called 
the class number of A; it is independent of the 
choice of o and is equal to the class number 
defined by using right ideals. The product of 
all real infinite prime divisors p with {Ap} = 1/2 
is denoted by P^. If is the product of all 
inlinite prime divisors and [A : F] = 4, then A is 
called a totally definite quaternion algebra. 

If o is a maximal order of A and A is not a 
totally detinite quaternion algebra, then 
N,„(a) gives a one-to-one correspondence 
between the classes of left o-ideals and the 
congruence classes of ideals of F modulo P^ 

(- 14 Algebraic Number Fields H) (Eichler’s 
theorem). 

In particular, if A is a total matrix algebra 
over F, then the class number of A is equal to 
the class number of F. The class number of a 
totally detinite quaternion algebra was deter- 
mined by M. Eichler by using the zeta function 
(- Section E) of A [4]. 

Eet o be a maximal order of A, a be an 
integral two-sided o-ideal, b be an integer of F, 
and be an element of o. Eurthermore, assume 
b = l (mod P^), = b (mod a D E) (tmulti- 

plicative congruence). Then there exists an 
element of o such that = b, 

(mod a) (Naif is the reduced norm), provided 
that A is not a totally definite quaternion 
algebra [5]. This theorem, which is called 
Eichler’s approximation theorem, is widely 
applicable; e.g., it yields the previous theorem 
on the class number and can be generalized to 
the case of semisimple talgebraic groups (-^13 
Algebraic Groups). 

E. Algebras over a Function Field 

The Hasse-Brauer-Noether and Hasse theo- 
rems also hold for tnormal simple algebras 
over a +field of algebraic functions of one vari- 
able over a finite field. On the other hand, a 
normal simple algebra over a tield K of alge- 
braic functions of one variable over an alge- 
braically closed field is a total matrix algebra 
over K (Tsen’s theorem). 

F. Other Notions 

Adeles and ideles for a simple algebra A over 
an algebraic number field of finite degree can 



be introduced as in the case of number fields 
(^ 6 Adeles and Ideles). If N(a) stands for the 
number of elements in o/o, where o is a maxi- 
mal order of A and a is an integral left o-ideal, 
then the zeta function of the simple algebra A, 
called the Hey zeta function, is detined by (^(s) 
= Z N(a)~^ (the sum over all integral left o- 
ideals). This function has properties similar to 
those of Dedekind zeta functions (- 450 Zeta 
Functions L). Eet p be an infinite prime divisor 
of the center F of A, and let be the group of 
elements in A, with the reduced norm 1. Put 
G = Hp Gp (the product over all infinite prime 
divisors of F). Then the group F of units with 
the reduced norm 1 in o is naturally regarded 
as a subgroup of G. To be more precise, F is a 
discrete subgroup of G, the volume of G/F is 
finite with respect to an invariant measure, 
and G/F is compact if and only if A is a divi- 
sion algebra 122 Discontinuous Groups). 
This result can be viewed as a special case of 
more general facts about semisimple algebraic 
groups (- 13 Algebraic Groups). If K is a 
tmaximal compact subgroup of G, then E gives 
rise to a tdiscontinuous group operating on 
the homogeneous space G/K, and we obtain 
^automorphic forms with respect to F. The 
case where A is a tquaternion algebra has 
been studied extensively (- 32 Automorphic 
Functions). 
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28 (XXI.1 3) 

Artin, Emil 

Emil Artin (March 3, 1898-December 20, 

1962) was bom in Vienna. After he studied at 
the Universities of Vienna, Leipzig, and Got- 
tingen, he taught at the University of Ham- 
burg from 1923 to 1937. In 1937 he left Ger- 
many under the Nazi regime for America, 
where he taught at Notre Dame, Indiana, and 
Princeton universities. He returned to Ger- 
many in 1948 and taught again at the Univer- 
sity of Hamburg until his fatal heart attack. 

In his thesis (1923), he proved affirmatively 
the +Riemann hypothesis in function fields 
over finite constant fields in the hyperelliptic 
case, and conjectured that this should be valid 
in general. This was eventually verified by A. 
Weil in 1941. Also in 1923, he introduced the 
■•■L-function for Galois extension and was led 
to the tgeneral law of reciprocity, which he 
stated as a conjecture and proved four years 
later, thus bringing tclass field theory to com- 
pletion. Around the same period, he estab- 
lished in collaboration with 0. Schreier the 
theory of tformally real fields and solved the 
17th tproblem of Hilbert. He also initiated the 
theory of ’’’braids, which he later developed 
with F. Bohnenblust in the 1940s. In view of 
these and other ingenious works as well as 
his inspiring teaching in all areas of mathe- 
matics, Artin is considered to have been one of 
the most influential mathematicians of this 
Century. 
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Associative Algebras 

A. Fundamental Concepts 

Let .A be a commutative ring with unity ele- 
ment 1 (- 368 Rings A), and let A be a ring 
which is a tunitary K-module (- 277 Mod- 
ules). Such a ring A is called an associative 
algebra over K (or simply algebra over K) if 
it satisfies the condition X{ah) = {Aa)h = u(//>) 
(2e K; a, he A). An (associative) algebra A over 
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K is often written A/K, and K is called the 
coefficient ring (or ground ring) of the algebra 
A= A/K. In particular, if .A is a tfield, then it is 
called the coefficient field (or ground field) of A. 
Algebras over fields have been studied in de- 
tail. Notions such as zero algebra, unitary 
algebra, commutative algebra, (semi) simple 
algebra, and division algebra are replicas of the 
respective ones for rings (- 368 Rings). Con- 
sidering both structures as rings and as K- 
modules, homomorphisms and isomorphisms 
are defined in a natural manner, and are called 
algebra homomorphism and algebra isomor- 
phism, respectively. In this connection, sub- 
algebra, quotient algebra (or residue class 
algebra), and direct product of algebras are 
also defined as in the case of rings. 

An '''ideal of an algebra A is defined as an 
ideal of the ring A, which is at the same time a 
submodule of the module A over the coeffi- 
cient ring. The tradical of an algebra A con- 
sidered as a ring is then an ideal of A in this 
sense. In fact, the existence of a unity in an 
algebra A implies that any ideal of A consid- 
ered as a ring is necessarily an ideal of the 
algebra A. 

In the rest of this article, we assume that all 
rings have a unity element and that all homo- 
morphisms are unitary. Hence, when we con- 
sider subalgebras of an algebra A, we require 
that they share the unity element with A. If e is 
the unity element of an algebra A over K, then 
the mapping = X' (XeK) is a homomor- 
phism K A whose image Ke is contained in 
the '''center of A, and the scalar multiplication 
Xa is equal to the ring multiplication Xa {X e K, 
ae A). Conversely, given a homomorphism of 
K into a ring A whose image is contained in 
the center of A, we can regard A as an algebra 
over K in an obvious way. Hence we are given 
an algebra A over K if and only if there exists 
a pair (A, p) of a ring A and a homomorphism 
p : K-^ A whose image is contained in the 
center of A. There exists a uniquely determined 
(unitary) homomorphism of the ring Z of 
rational integers into any ring; hence any ring 
can be regarded as an algebra over Z. If the 
coefficient ring A of a nonzero algebra A is a 
field, then A can be regarded as a subfield 
contained in the center of A, and the unity 
element 1 of A coincides with the unity ele- 
ment of A. 

Let A and B be algebras over A. Then the 
ttensor product A 5 of K-modules is an 
algebra over A under the multiplication 
(a ®b)(a'®b’) ~ad ®bb’ {a,a'eA, b, h'eB). 

This algebra is called the tensor product of 
algebras A and B. Moreover, the mapping 
a^a® 1 (resp. h-^\®b) {aeA, beB) gives an 
algebra homomorphism A-1 A ®f^ B (resp. 6^ 
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A B, which is called the canonical homo- 
morphism. In particular, if A is a commutative 
algebra, then A B can be regarded as an 
algebra over A by this canonical homomor- 
phism; and in this case A B is called the 
algebra obtained by extension of the coeffi- 
cient ring of 5 to A (or a scalar extension of B 
by A), and is often denoted by The algebra 
K B is canonically isomorphic to B. Fur- 
thermore, (A(^f,B)®i^C and A (x)^(B 0^ Cj 
are canonically isomorphic and are written 
A 0 B 0 C. Let A and B be commutative 
algebras over K. Then, for any commutative 
algebra C and homomorphisms a : A+ C, [j : B 
->C, there exists one and only one homomor- 
phism y:.4 0jjB->C such that a(a) = y(fl0 1), 
li{b) = y{i ® b){a6 A b e B). This property char- 
acterizes the tensor product A ®>^B of com- 
mutative algebras A and B. In this sense, 

A 0^ B is sometimes called the coproduct of A 
and B (- 52 Categories and Functors E). 



B. Examples of Associative Algebras 

As we mentioned, any ring can be regarded as 
an algebra over the ring Z of rational integers. 
But it is often useful to deal with algebras over 
“large? or more “efficient” coefficient rings. 
For instance, we have many rings which are 
algebras over a commutative ring K, such as 
the tring of polynomials, the ‘''ring of formal 
power series in n variables with coefficients in 
K, the tendomorphism ring of a K-module, 
and the tfull matrix ring of degree n over K 
(- 368 Rings C). There are other important 
classes of algebras, such as (semi) group alge- 
bras, Hecke algebras, and crossed-product 
algebras, which will be explained later. These 
algebras are defined by a canonical basis con- 
nected directly with a (semi) group structure. 
On the other hand, the ttensor algebra and 
the texterior algebra of linear spaces and the 
■•■Cliffbrd algebra associated with a given 
quadratic form are also important (-61 
Clifford Algebras, 256 Linear Spaces). 

The most frequently used example of a 
division algebra is the quaternion field H (often 
called Hamilton’s quaternion algebra, W. R. 
Hamilton, 1858). This is a 4-dimensional 
linear space over the real number field R with 
basis { 1, Uj, k}, with the following laws of 
multiplication: 1 is a unity element, p —p = 

= -!,//= -ji = k, jk = -kj = i, and ki - -ik = j. 
An element of H is called a quaternion. The 
only finite-dimensional division algebras over 
the real number field R are the real number 
field R, the complex number field C, and the 
quaternion field H. 



C. Group Algebras and Hecke Algebras 

Let K be a commutative ring and be the 
direct sum T.sec niodules K^, where each 
Kj is isomorphic to K with a group G as the 
index set (- 277 Modules F). The elements of 
are the families (/ij)sgg of elements of K 
whose components are all zero except for a 
finite number of them. Let be the 

canonical basis of namely, is an ele- 
ment of of which the sth component is 1 
and the others are 0. The module has the 
structure of an algebra over K, where the law 
of multiplication is determined by u,u, = 

Mst(s, teC). This algebra is called the group 
algebra of G over K. The product 
elements a = (XJ and ^ = (^J of is then 
given by 

E seG. ( 1 ) 

s — rl 

Each basis element is often identified with 
the group element s, and in this manner, the 
group G is regarded as a basis of which is 
usually written KG or K[G]. 

In this definition, the group Q Can be re- 
placed by a semigroup G, and then the algebra 
X*^^ is called a semigroup algebra. As an 
example, let N be the additive semigroup of all 
nonnegative rational integers and N" be the 
direct product of n copies of N. If we denote 
the elements (ij, , i,) of N” by X\' X)p and 
use multiplication instead of addition, then the 
semigroup algebra of N" over X is exactly the 
■•■ring of polynomials X [Xj, • , XJ. On the 
other hand, if G is a semigroup, then even 
when it is infinite, it may occur that for any 
SE G, there exists only a finite number of pairs 
(y, /) of elements of G such that s = rl. In this 
case, formula (1) also defines the law of multi- 
plication on the tCartesian product K^. This 
algebra is called a large semigroup algebra 
and contains as a subalgebra. In par- 
ticular, the large semigroup algebra of N" 
is exactly the tring of formal power series 
K[[A-, 

Let H he a subgroup of a group G, and 
assume that the index of // fl sHs~^ in H is 
fmite for any s e G. This assumption is equiva- 
lent to the condition that any double coset of 
G by // is a union of a finite number of left 
as well as right cosets. Let H\G, G/H, and 
H\G/H be the set of all tright cosets, +left 
cosets, and ■''double cosets, respectively. Then 
each element of the direct sum 
or can be regarded as a function 

defined on G taking a constant value on each 
right, left, or double coset, respectively. Con- 
versely, any function defined on G can be 
regarded as an element of or 
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G/H] jj- takes a constant value on each 
left, right, or double coset, respectively, and if 
it vanishes everywhere except on a finite num- 
ber of cosets of respective type. Under such 
identification, let and denote the 

values of a function a:G-*K on seG, /e H\G, 
r e G/H, and t G H \G/H, respectively. For any 
/■, n E we define a function A * /x by 

seG, ( 2 ) 

serl 

where the right-hand side is the sum taken 
over all pairs (r, /) such that serl, re G/H, 
IeH\G . It can be shown that this sum is a 
tinite sum and that 1 * pe Hence the 

module has the structure of an alge- 

bra over K, which is called the Hecke alge- 
bra of (G, H ) over K and often denoted by 

H ). If H = {b}, the H ) is exactly 

the group algebra of G. In general, ,^f;^(G, H ) 
can be regarded as an algebra obtained by 
extending the coefficient ring of Jfi(G, H) to 
K. Furthermore, can be re- 

garded as a right (left) module over the group 
algebra and the endomorphism ring 

(resp. is canonically 

isomorphic (anti-isomorphic) to the Hecke 
algebm 

D. General Crossed Product 

Let G be a group that operates on a commuta- 
tive ring L, and denote the operation by (s, a)^ 
s(x) {seG,XeL); thus for any seG the map- 
ping a->s(a) (/6 L) is an automorphism of i, 
satisfying s(t(/l.)) = st(A) (s, t E G). For any I, 

HE we define the product/!, if fxe L'^'by 

(A*/t), = X V(Mi)/(lO, seG, (3) 

s=rl 

where {f{r,l)}r,i^G is a given family of elements 
of L . If this family satisfies the equations 

/(s, r)f{sr, 1) = s{f{r, /))/(s, rl], s,r,le G, 

then forms a ring. In terms of the canon- 
ical basis {wJseG, this ring structure is defined 
by the formulas f(r, u,a = s(a)Us 

(Ag L). If K is the subring of L consisting of all 
Ae L such that s(A) = A(se G), then the ring 
is an algebra over K, called the crossed product 
of L and G with respect to the given operation 
and the given factor set / of G. In a narrower 
sense of the term, we consider only the case 
when L is a field, G is a finite group, f{r, 1) + 0, 
and G operates on L faithfully. In this case, G 
can be identified with the ^Galois group of a 
tinite ■''Galois extension L/X, and so the crossed 
product is written {L/K, f). This is a tcentral 
simple algebra over (- Sections E, F). 

If the operation of G in the above- 
mentioned general crossed product is trivial. 
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namely, \- = K, the crossed product of K and G 
is called an algebra extension of G over K with 
respect to f. Usually, we assume that K is a 
field and f {r, /) ^^0. If f(r, 1) = I, the algebra 
extension is the group algebra. If G is a finite 
group whose order is prime to the tcharacter- 
istic of K, then any algebra extension of G over 
K (in particular, the group algebra) is always 
^semisimple and +separable. If G is a finite 
Abelian group and fis a factor set of G, then a 
bihomomorphism (p:G x G-^K* (a mapping 
which is a homomorphism in each variable) 
is detined by (p{s,t)= f{s, t)-/(t,s)“h An alge- 
bra extension of G over K. with respect to f 
is a central simple algebra if and only if (p is 
nondegenerate. In particular, if G is a direct 
product of n copies of a group of order 2, 
then by choosing n elements s,, S 2 , . of G 
suitably, any element of G can be uniquely ex- 
pressed as Sfc . Sj,, where {a, b, . , z} is a 
subset of { 1,. , n} in the natural order. Hence, 
if we take as a basis of an 

algebra extension, then any factor set f is 
determined by f{Si,Sj) = Aj^-, where Ay = 1 (/ <j), 
A;y= ± 1 (/ >j), and A;; aTc arbitrary. When A;^- 
= -1 (i>J), the corresponding algebra exten- 
sion is a tClifford algebra. Furthermore, if A,; 

= 0, then it is a tGrassmann algebra. If A,-,- 0 
and n is even, then it is a central simple alge- 
bra whenever the characteristic of K is not 2 
(-61 Clifford Algebras). 

Now let K be the real number field R and 
A, be the Clifford algebra with A,-,- = 1 . Then 

A, is the quaternion field. Elements of A, 
are called sedenions, and are important in 
■''spinor theory and '•'Dirac’s equation. In gen- 
eral, let K be an arbitrary field whose charac- 
teristic is not 2 and put n = 2, Aj 2 = 1, A 2 ] = 

—1, Ai , —Ay>^0, and A 22 = in the pre- 
vious notation. Then the corresponding cen- 
tral simple algebra Q is called a (general- 
ized) quaternion algebra. Thus Q has a basis 
{ 1, u, r, w} satisfying the following laws: 1 is 
the unity element, w = uv= ~vu, = A, and 
= fj- {A, HE K). Any central simple algebra of 
dimension 4 is isomorphic to a certain quater- 
nion algebra. (In particular, if K = R and A = H 
= -1, then Q coincides with the quaternion 
field H.) Eor any element x = 0 H + fiu + yv+ dw 
of Q, the element x = 01 — — yv — Sw is said to 

be conjugate to x, and N(x) = xxeKis called 
the norm of x. An element x of Q is invertible 
in Q if and only if N(x) 0. 

E. Finite-Dimensional Associative Algebras 
over a Field 

Eor the rest of this article, we assume that the 
algebras considered are unitary and finite 
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dimensional over a field K. Then by general 
properties of left (right) tArtinian rings, any 
associative algebra A has the following struc- 
ture: the tradical N of A is the greatest tnilpo- 
tent ideal, and the quotient algebra A/N = /4 is 
tsemisimple and decomposed into a direct sum 
of ideals which are simple algebras: 

^ = A j -f- . . . -4 „ . 

Each simple component A As a full matrix ring 
of degree r- over a certain division algebra D,-, 
and A ; is decomposed into a direct sum of r,- 
minimal left ideals which are mutually A . 
isomorphic: 

A^ = Ael^^ + ... + AeA\ 

where are orthogonal +idempotent 

elements of A; whose sum is equal to the unity 
element of A ■ On the other hand, 

Ai = ej^*A + ... +e}''‘^A 

gives a decomposition of A ,• into the direct 
sum of minimal right ideals that are mutually 
A-isomorphic. Moreover, we can choOSe an 
idempotent element ef^ of A from each residue 
class eO such that {eO} forms a system of 
orthogonal idempotent elements, whose sum is 
equal to the unity element 1 of A, and 

A=tiAer=ttef'‘* 

i=l s=l 1=1 s=l 

gives a decomposition of A into a direct sum 
of tdirect indecomposable left (right) ideals. 
Here, Aef^ and Ae^p {e^A and efA) are A- 
isomorphic if and only if i —j. Conversely, 
every decomposition of A into indecom- 
posable ideals is obtained as above. The A- 
submodule NeP of AeP is the unique maxi- 
mal proper submodule, and AeP jNeP and 
AePiNeP are A-isomorphic if and only if i = j , 
Any simple A-module is A-isomorphic to a 
certain AePlNeP (- Sections H, I). 

Any simple algebra A over K is isomorphic 
to a full matrix ring M,(D) over a certain divi- 
sion algebra D. This is called Wedderburn’s 
tbeorem. Here n is determined uniquely by A, 
and D is also determined uniquely by A up to 
isomorphism. Moreover, the center of A is 
isomorphic to the center of D. If the center of 
A coincides with K, then A is called a central 
simple algebra (or normal simple algebra) over 
K. In this case, any isomorphism of two simple 
subalgebras of A can be extended to an tinner 
automorphism of A. Let V(B) denote the tcom- 
mutor of a simple subalgebra B of A. Then 
V(B) is also a simple subalgebra and V(T(B)) 

= B, dim A = dim B ■ dim V(B). In particular, if 
B is central over K, then there is a canonical 
isomorphism ^ ^ B (g) ^ K(B). If D is a central 
division algebra, then any maximal commuta- 
tive subalgebra L of D is a field and satisfies 



(dim Lp = dim D. Moreover, there are '•'sepa- 
rable extensions over K among such L. In 
general, the dimension of a central simple 
algebra A is a square number r^, where r is 
called the degree of A. 

Two central simple algebras are said to be 
similar if they are isomorphic to full matrix 
rings over the same division algebra. This is an 
equivalence relation, and each equivalence 
class is called an algebra class. On the other 
hand, if A is a central simple algebra and 5 is a 
simple algebra, then A B is a simple alge- 
bra. Moreover, if B is central, then A 0^ B is 
also a central simple algebra. If A and B are 
similar to A’ and B', respectively, then A 0^^ B 
is similar to A ’ 0j^ B’ . Hence, the tensor prod- 
uct 0 defines multiplication in the set ^[K) 
of the algebra classes over K. If A is a central 
simple algebra and A ° is an algebra anti- 
isomorphic to A, then A° is also central simple 
and A 0^^ A° is isomorphic to a full matrix 
ring over K. This shows that ^(K) forms a 
group, which is called the Brauer group, after 
R. Brauer, who introduced this concept, or the 
algebra class group over K. For a central 
simple algebra A, the degree of a division 
algebra similar to it is called the Schur index of 
A (or of the algebra class of A), and the order 
of the algebra class of A in the Brauer group is 
called the exponent of A. The Schur index is 
divisible by the exponent; and conversely, the 
exponent is divisible by every prime divisor of 
the Schur index. 



F. Extensions of Coefficient Fields 

Let L be an textension tield over a field K. 

Then for any algebra A over K,L 0j^ A can be 
regarded as an algebra over L. This algebra is 
denoted by A^ and is called an algebra ob- 
tained by extending the coefficient field to L 
Let us denote the radical of a ring A by 91(A). 
An algebra A over K is called a separable 
algebra if it satisfies 9I(A^) = {0} for any exten- 
sion field L over K. In the special case when A 
is an talgebraic extension field over A, A is a 
separable algebra if and only if every element 
of A (or every element of a subset which gener- 
ates A) is tseparable over K (- 149 Fields). A 
(finite-dimensional) algebra A over K is sepa- 
rable if and only if A is semisimple and the 
center of every simple component of A is a 
separable extension over K. If the quotient 
algebra A/9?(A) of an algebra A is separable, 
then there exists a subalgebra S such that 
A = S + 9?(A) and B n tR(A) - {0}, and S is 
uniquely determined up to inner automor- 
phisms (Wedderburn-Mal’tsev tbeorem). 

In order that an algebra A over K be central 
simple, it is necessary and sufficient that A^ be 
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simple for any extension tield L over K. This 
latter statement holds if and only if is iso- 
morphic to a full matrix algebra over L for a 
certain extension tield L over K. Such an exten- 
sion field is a tsplitting field of A (- 362 Re- 
presentations F). For a central simple algebra 
A, a (finite) extension field L of degree r over K 
is a splitting field of A if and only if there exists 
a central simple algebra B of degree r which is 
similar to A having a subfield which is 
isomorphic to L. In this case, r is divisible by 
the Schur index of A. Furthermore, A has a 
separable splitting field whose extension de- 
gree is equal to the Schur index of A. This 
shows that A has a splitting field which is a 
finite Galois extension tield over K. 

Let L be a finite tGalois extension over ^ 
tield K and G be its tGalois group. If / is a 
factor set with respect to the operation of G on 
L*, then the crossed product (L/K,/) is a cen- 
tral simple algebra over K (- Section D). The 
product of two factor sets f and g is also a 
factor set, and the set of all factor sets forms an 
tAbelian group. Thus {L/K,fg) is similar to 
{LjK,f)®f^{LlK,g). On the other hand, factor 
sets f and g are said to be associated with each 
other if there exists a family of elements 

of L such that 

f{r,l) = g(r,l)r{Ai)?.~,^^^, r, leG. 

Hence, / and g are associated with each other 
if and only if (L/K, f) and (L//C, g) are similar. 
Therefore, the mapping f-^{L/K,f) gives a 
monomorphism of the group fi^(G, L*) of all 
associated classes of factor sets (which can be 
identified with the 2-dimensional tcohomology 
group of G with coefficients in L*) into the 
Brauer group over K. Its image coin- 
cides with the subgroup of all algebra classes 
which have L as a splitting field. In particular, 
any algebra class is similar to the crossed 
product of a certain finite Galois extension L 
and its Galois group G (R. Brauer, E. Noether, 
A. Albert, K. Shoda, and others). 



G. Cyclic Algebras 

Let Z be a '^'cyclic extension field of degree n 
over a field K. Then the crossed product of 
Z and its Galois group G is called a cyclic 
algebra over K. For a fixed generator s of 
G and any element (x^O of K,^ factor set 
/(.s', .sh (0 ^ i, j < n) can be defined by /(.s', s-') 

= 1 (i + j < n) and f{s s^) = ix{i +j ^ n). Let 
(Z, s, a) denote the corresponding crossed 
product. Then (Z, s, a) and (Z, s, j?) are similar if 
and only if a//I is a norm of a certain element 
of Z into K. On the other hand, any crossed 
product of Z and G is similar to a certain 
(Z, s, cc), and the correspondence a^(Z, s, a) 
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gives an isomorphism of K*/Nz/k{Z*) (the 
norm class group of Z/K) to the group con- 
sisting of the algebra classes over K which 
have Z as a splitting field. If K is a tp-adic field 
or a finite talgebraic number tield, then any 
central simple algebra over K is isomorphic to 
a certain cyclic algebra. In this section, we 
describe this situation in detail. 

Let K be a p-adic field and q the number of 
residue classes modulo p. If A is a central 
simple algebra over K, then its Schur index 
coincides with the exponent, which is called 
simply the index. A finite extension tield L over 
K is a splitting field for A if and only if the 
degree of L is a multiple of the index of A. If ri 
is the degree of A, then the field W= K(w) ob- 
tained by adjoining to K a tprimitive (q” l)st 
root of unity w is a cyclic (and tunramified) 
extension of degree n over K, and its Galois 
group is generated by the automorphism 
a of Wdetermined by K = iden- 

tity. Moreover, we have A ^ (W, a, a) for a 
certain a 6 K*. Let v be the texponential p- 
adic valuation i)(a) of y.. Then v/n (mod Z) is 
uniquely determined by the algebra class of A, 
which is called the Hasse invariant of A (or of 
the algebra class of A). By assigning to each of 
the algebra classes its Hasse invariant, we get 
an isomorphism of the Brauer group B(K) 
over K to the group Q/Z, the additive group 
of the rational numbers mod Z (H. Hasse, 

1931). 

Let R be a finite algebraic number tield and 
A be a central simple algebra over K. Let p be 
a (finite or infinite) tprime divisor of K and 
the tp-adic extension field over K. The algebra 
A, which is obtained from A by extending the 
coefficient tield to is a central simple alge- 
bra over K,. Except for a finite number of p, 

A, is isomorphic to a full matrix ring over K^, 
and A itself is isomorphic to a full matrix ring 
over K if and only if A, is isomorphic to a full 
matrix ring over Rp for all p. The index of 
A, is called the p-index of A, and the Hasse 
invariant of A, is called the p-invariant of A, 
which is denoted by (A/p). If p is an tinfinite 
prime divisor, then is equal to 1 or 2, and in 
each case, we define the p-invariant by setting 
(A/p) = 0 or 1/2 (mod Z) correspondingly. The 
Schur index of A is the L.C.M. of the p-indices 
Wp for all p and coincides with the exponent of 
A. This is called simply the index of A. On the 
other hand, the p-invariants satisfy (A/p) = 0 
(mod Z) except for a finite number of p, and 

X(A/p) = 0(modZ). 

p 

Conversely, given a rational number for 
each p such that (i) Pp = o (mod Z) except for a 
finite number of p; (ii)pp = 0 (mod Z) if p is 
infinite and imaginary, Pp = 0 or 1/2 (mod Z) if 
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P is infinite and real; (iii) SpPp =0 (mod Z), 
then there is a uniquely determined algebra 
class of central simple algebras A over K such 
that (/4/p) = Pp (mod Z) for each p. In this way 
the structure of the Brauer group over a finite 
algebraic number lield is completely deter- 
mined (Hasse, 1933). 

H. Frobenius Algebras 

Let A be an algebra over a field K. A is called 
a Frobenius algebra if its tregular representa- 
tion and tcoregular representation 362 
Representations E) are similar. Thus A is a 
Frobenius algebra if the left A-module A and 
the dual module A* of the right A-module A 
are isomorphic as left A-modules. Let 



n ri n ri 




be direct decompositions of A into indecom- 
posable left (right) ideals (-^ Section E). We 
denote by C;. Then A is a Frobenius alge- 
bra if and only if there exists a permutation n 
on 1, . , n such that (i) Ae, ^ A)*; (ii) r,- = 

When there exists a permutation satisfy- 
ing only condition (i), A is called a quasi- 
Frobenius algebra. 

For a subset S of A, the ideals I(S) = 
{fl6A|flS = 0} and r(S)-{a6 A I Sa = 0} are 
called the left annihilator and right annihilator 
of S, respectively. Then A is a quasi-Frobenius 
algebra if and only if (ii) /(r(I)) = I and r(/(r)) = r 
for any left ideal I and any right ideal r. In 
general, if we are given a left (right) A-module 
M, we denote the right (left) A-module 
Hom^(M, A) by fi. Then if A is a quasi- 
Frobenius algebra, there is a canonical iso- 
morphism ]Q ^ M, and the annihilator relation 
gives a one-tO-one and dual correspondence 
between the set of the submodules of M and 
the set of submodules of M (M. Hall). If a 
quasi-Frobenius algebra A satisfies (iv) dim r 3- 
dim I(r) = dim i + dim r(l) = dim A for any left 
ideal 1 and any right ideal r, then A is a Fro- 
benius algebra; the converse is also true. 

A criterion for an algebra A to be a Frobe- 
nius algebra is that there is a linear form 
2(x) on A such that if /l(xa) = 0 for all xe A, 
then a = 0. Moreover, if ^ satishes /l(xy) = 

2( yx) (x, ye A), then A is called a symmetric 
algebra. For example, semisimple algebras 
and group algebras are symmetric algebras. 

If A is a symmetric algebra, then for any left 
(right) A-module M the right (left) A-modules 
M* = Homj.(M, K) and M = Hom^(M, A) are 
canonically A-isomorphic. 

If A is a Frobenius algebra, the radical N of 
A satisfies 1{N) = r(N), and the annihilator of 
N is a principal left and principal right ideal; 



the converse is also true. For a two-sided ideal 
z of a Frobenius algebra A, the quotient alge- 
bra A/z is a Frobenius algebra if and only if 
l(z) and r(z) are a principal left and a principal 
right ideal, respectively. If A is a symmetric 
algebra, then any two-sided ideal z satisfies 
l(z) = r(z), and A/z is also a symmetric algebra 
if and only if l(z) = r(z) is a principal ideal 
generated by an element in the center. 

Furthermore, for any extension L of the 
coefficient lield K, A^ is a quasi-Frobenius 
algebra (resp. Frobenius algebra, symmetric 
algebra) if and only if A is (T. Nakayama, 

1939, 1941). The concept of Frobenius algebras 
has been extended to algebras B over a ring A 
(F. Kasch, 1954). 



I. Uniserial Algebras 

In the notation of the preceding section, if each 
indecomposable left ideal Ae^ (right ideal e^A) 
of an algebra A over K has a unique compo- 
sition series, then A is called a generalized 
uniserial algebra. If an algebra A is decom- 
posed into a direct sum of ideals which are 
primary rings, then it is called a uniserial alge- 
bra. Any left module over a generalized uni- 
serial algebra A is decomposed into a direct 
sum of submodules which are A-homomorphic 
images of Ac,-; An algebra whose radical N is 
a principal left and principal right ideal is a 
generalized uniserial algebra. For an algebra A 
to be uniserial, it is necessary and sufficient 
that every two-sided ideal of A be a principal 
left and principal right ideal. Hence A is uni- 
serial if and only if every quotient algebra of A 
is a Frobenius algebra. If A^ is uniserial for an 
extension field L of K, then A itself is uniserial. 
The converse, however, is not always true. If 
A^ is uniserial for any extension lield L of K, 
then A is called an absolutely uniserial algebra. 
For A to be absolutely uniserial it is necessary 
and sufficient that its radical N be a principal 
ideal generated by an element in the center Z 
and Z be decomposed into a direct sum of 
simple extensions of K (i.e., ideals of the form 
K [a]) (K. Asano, G. Kothe, Nakayama, G. 
Azumaya). 



J. Algebraic Algebras 

Here, we consider general (not necessarily 
finite-dimensional) algebras A over a field K. 
We say that A is an algebraic algebra if every 
element of A is algebraic over K, i.e., every 
element of A is a root of a certain polynomial 
with coefficients in K. We say that A satisties a 
polynomial identity p{X^, ■ ■ , X„) = 0 or that A 
is a Pl-algebra with an identity p(Aj , , X,) = 




133 



0 if there exists a nonzero (noncommutative) 
polynomial p{X^, ,X„) in Xi, , X„ with 
coefficients in K such that p(aj , , a,) = 0 for 
all a, e A. A Pl-algebra satisfies an identity 
which is homogeneous linear in each variable, 
and also an identity of the form [xj , , x„]"' = 
0 (where [ ] is the sum E + x,-^ . . .X; taken 

over all permutations of 1,2, , n and + is the 
sign of the permutation). An algebra is said to 
be locally finite if any finite number of ele- 
ments of A generate a finite-dimensional 
subalgebra. For Pl-algebras, an affirmative 
answer was found for Kurosh’s problem, 
which asks whether an algebraic algebra is 
locally finite if the degree of any element gj of 
A (i.e., dim K [a]) is bounded. 



K. Brauer Group of a Ring 

Let be a commutative ring. An R-algebra is 
called a separable algebra if A is tprojective as 
a two-sided A-module (- 200 Homological 
Algebra F). When the base ring is a field, this 
agrees with the classical notion of separability; 
and A is separable over R if and only if AjmA 
is separable over the residue field R/m for 
every maximal ideal m of R. A central separa- 
ble algebra is also called an Azumaya algebra. 
If R is a finitely generated faithful projective 
R-module (briefly: R-progenerator), the endo- 
morphism ring End^(P) is an Azumaya R- 
algebra. Azumaya algebras A, and A, are 
said to be in the same class (similar) if there 
exist R-progenerators and such that 
A, ® End^j(Pi) ^ A, ® Endu(P 2 ). The set of 
similarity classes forms an Abelian group 
with respect to ®. This is called the Brauer 
group B(R) of R (Auslander and Goldman 
[10]). Every element of B(R) is of finite order 
[12,13]. 

B(R) is a covariant functor from commuta- 
tive rings to Abelian groups. If R is a field, 

B(R) coincides with the classically defined one 
(- Section E). If R is a tHenselian local ring 
with the residue field k, the mapping B(R)-+ 
B(k) is an isomorphism [9], If R is a tregular 
ring with the quotient field K, B{R)~^B{K) is 
injective. If further dim R ^2, we have B(R) = 
P)pR(Rp), p running over all primes of height 
1 of R, where Rp is the localization of R at p 
and B(R) and B(Rp) are considered as em- 
bedded in B(K) [10]. As an example, put- 
ting these facts together with the structure of 
B(K) of the algebraic number field K (->. Sec- 
tion G), we have the structure of B(R) of the 
ring of integers of K: B(R) = 0 if K is totally 
imaginary, and ^ (Z/2Z)''^^ if K has r( > 0) real 
infinite places. 

A commutative R-algebra S is called a split- 
ting ring of A if the S-algebra S ® A is isomor- 
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phic to Ends(P) for some S-progenerator P. 

We denote by B{S/R) the subgroup of B(R) 
consisting of all algebra classes split by S. 

Since an Azumaya algebra over a ring need 
not have a Galois extension as splitting ring, 
the description of the Brauer groups by means 
of Galois cohomology ceases to have full 
generality. Instead we have the following exact 
sequence of tAmitSUr cohomology, assuming 
S is an R-progenerator (Chase and Rosenberg 
[ 1 1 ] ): 0 ^ ' (S/R, U)^Pic{R)^H “(R/R, Pic) 
-^H\S/R, U)^B(S/R)^H\S/R, Pic)-> 

H ^{S/R, [/)-> , where we denote by [/( 7’) 
and Pic(T) of a commutative ring T the unit 
group and the -i-Picard group of rank 1 projec- 
tive modules of T, respectively. The full Brauer 
group B(R) is mapped monomorphically into 
H^{R, U) = \imH^{S/R, V), the limit over the 
tfaithfully fl^ R-algebras S. 

Grothendieck and others studied the Brauer 
groups in a more general geometrical context 
[ 12 ]. 
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Asymptotic Series 



A. Asymptotic Series and Asymptotic 
Expansions 



asymptotic expansions in power series for x-*- 
00. Furthermore, if f’(x) has an asymptotic 
expansion for x->-cc, its expansion is obtained 
by termwise differentiation of the /(x)’s. 

For any asymptotic power series 
there always exists a smooth function f(x) 
defined in such that 



Let (p„(x), n = 0, 1, 2, be functions delined in 
R + . The sequence of functions {<?„(x)}*=o 
called an asymptotic sequence for oo if 

(p„+i (x) =o((p„(x)) as x^co (1) 

for each n. When is an asymptotic 

sequence, the formal series ^ a„(p„{x) is called 
an asymptotic series. In most cases, the func- 
tions (p„(x) have the form [j/{x)(p{x)”. When 
(p„(x) = x~” an asymptotic series is called an 
asymptotic power series. A function f(x) de- 
fined on R+ is said to have the asymptotic 
expansion (or be asymptotically developable in 
the form) 



f{x)'^ao(poix) + ai(pi{x) + ...+a„(p„ix) + ... ( 2 ) 

as 00 if f(x) satisfies 

f{x)-ao(po{x)-a^(p^{x)-...-a„(p„ix) 

= 0{(p„+i{x)) (3) 

for any integer as oo. The coefficients 
a, (n = 0, 1, ) appearing in (2) are uniquely 
determined. This fact immediately follows 
from the formulas 

«o= lim/(x)/(Po( 4 .-M 

X~>oO 

(/(x)-ao<poW---- 

X-^OD 






/(x)~flo-|-aiX ^+U2X ^ + ... 

for 00 , and all the derivatives of f(x) are 
asymptotically developable in power series. 
Indeed, a function delined by 

CO 

Z a„0{xlt„)x~'' 

n = 0 



satisfies the required properties if 6{t) is a 
smooth function satisfying 




and the sequence satisfies 1 ^ ^ 

... and 



|n{n-(-l)...(2n — l)2nu„f„ ^1^2 " 
for all n. 

Until now only asymptotic series for x-> 

00 have been considered, but the notion of 
asymptotic series has been adapted in vari- 
ous forms to the spaces of functions under 
consideration. 

(a) Asymptotic series in a complex domain. 
Let a be a boundary point of an open con- 
nected domain D in the complex z-plane. When 
the asymptotic behavior for z->a of holomor- 
phic functions in D is considered, we adopt as 
an asymptotic sequence {(p„(z)}“=o of holo- 
morphic functions in D satisfying for each 



For example, the '*'Hankel function of the first 
kind H*'’(x) has an asymptotic expansion 







nv n\ 
X 

. 2 4) 



y (v, m) 

A(-2/x)'"’ 



(4) 



(Pn + l{z) = 0{(p„{z)) 

as z tends to a through D . Let D be an angular 
domain with vertex at ct. For any asymptotic 
power series 11*= q always exists 

a function f(z) that is holomorphic in D and 
that admits an asymptotic expansion 






3^). . . [4v^-(2m-l)2] 



If V 4- ^ is not a positive integer, 2m=o 
(— 2ix)~"' is not convergent, and (4) means that 
for each n, 



H<i>(x) 




V7T 71 \ 

Y~4) 



" ‘ (v, m) 



= 0(x " 



for X-^OO. 

Assume that f(x) and g(x) admit asymptotic 
expansions in power series for x-t oo. Then 
all functions delined by /(x)-|-^(x), /(x) — 
six), f{x)g{x), f{x)lg{x) (lim^^^ ^/(x) ^0) admit 



+ ■■■ + a„{z-aY + ... (5) 

for z^a. Concerning the uniform convergence 
of asymptotic expansions, Carleman’s theorem 
is well known [2]: If the asymptotic expansion 
(5) is valid as z tends to a through an arbi- 
trary angular domain in the Riemann sur- 
face of log(z — a) having a as its vertex, then 
the asymptotic expansion (5) is uniformly 
convergent. 

(b) Asymptotic expansions by a large para- 
meter. In the theory of differential equations, 
functions with a large parameter are often 
considered. Let Q be a domain in R” and k a 
parameter ^ 1, and let (p^(x, k), n = 0, 1, 2,. , 
be functions defined for (x, k) e Q X [ 1, oo). If 
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they satisfy for each xgQ and n 

(p„+i(x,k) = o{(p„{x,k)) (6) 

for k-^ 00, the sequence {(p„{x, /c)}„*=o called 
an asymptotic sequence, and the formal series 
S^=o a„(x)(p„{x, k) is called an asymptotic series. 
A function f(x, k) is said tO be asymptotically 
developable in the form 

fix, k) - aQ{x)(poix, k) + ai{x)(piix,k)^... 

+ a„{x)(p„{x, k) + . 

for /c-» 00 if for every X6 Q and every n 
fix, /c) - ao(x)(po(x, k) - ... - a„_i (x)<p„_i (x, k) 

= 0i(p„ix,k)) (7) 

holds as /c-> 00 . Especially, if the order rela- 
tions (6) and (7) hold for each n uniformly in 
xeD, we say that /(x, k) is uniformly asymp- 
totically developable. In most cases, the 
(p„ix, k) are of the form i/^(x, k)k~”. 



B. Methods of Asymptotic Expansion 



There are several general methods of obtaining 
asymptotic expansions of functions represented 
by integrals with a large parameter. These 
methods are useful in applications because 
most tspecial functions are represented by 
integrals (- 39 Bessel Functions, 389 Special 
Functions). 

(a) Laplace’s method. Fet h(s) be a smooth 
real-valued function in [a — a -i- 5] (5 > 0) 
satisfying 

lt(a) < li(s) for all s e [fl - a + 6] - {o} 
and 



h"(a) >0, 

and let ^(s)6C®([fl — ^]). Then 
Ca+d 



I(x) = 



e ^''^^^gis)ds 



J<o-s 



has an asymptotic expansion 








-t-CjX , 



-|- c„x " -t- . . . 

for x-^ 00, where the c„ are determined by 
derivatives of h(s) at 5 = a of order ^ 2n -I- 2 
and those of g(s) at s = a of order ^ 2n. 

As an application of this method, con- 
sider the r -function r(x) = Jo ® dt. By a 
change of variable [ = xs, we have F(x) = 
x^Jg e ~***^’°®*^s ds. Since for any ^ > 0 



ri-d 1 




■* 00 








J'O 




J l+(5 
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the asymptotic expansion by the Faplace 
method 



H-d 



,2n 

g-x(s-|og.,)^-l / (1+CjX * + ... 



l-<5 



implies tStirling’s formula 



F(x) = ,y^x* ^{1 -hO(x ^)). 

(b) Stationary phase method. Let h(s) be a 
real-valued smooth function detined in [a 
(5, a -f such that 

h’(u) = 0, h"(u) andh’(s) ^^0 fors#a. 



and let g(s) be a function in Cq (a a + 6). 
Then 






I(X) 






has an asymptotic expansion 



/{x)'ve 



ixh(a) ^ianjA- _ 



1 



gia) 



■c,x ^ -f .. 



+ c„x "-b 



where 0 = h"ia)/\ h"(u)) and the c„ are deter- 
mined by derivatives of /i(s) at s = a of order 
^ 2n -h 2 and those of g(s) at s = a of order 
< 2n. 

Further, tintegration by parts and the 
tmethod of steepest decent are often used to 
derive asymptotic expansions of integrals with 
a large parameter. 



C. Application to Differential Equations 

For a system of linear ordinary differential 
equations with an 'firregular singular point at 
2=0, even when there exists a formal power 
series solution, the power series is generally 
divergent. Taking the hint given by Stirling’s 
formula for the f-function, Poincare intro- 
duced the notion of asymptotic expansions 
and succeeded in giving an analytic meaning 
to formal solutions of divergent type. Actual 
solutions for tdifference equations, tdifference- 
differential equations, and tordinary differen- 
tial equations of canonical form (including 
second-order linear ^differential equations 
of confluent type) always have asymptotic 
expansions. 

In the study of certain ordinary differential 
equations containing a large parameter k, an 
asymptotic solution is often obtained in the 
form 

u(x, k) + 

+ a„(x)k '" + ...) 

(■'^WKB method). Also, in the study of linear 
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partial differential equations, an asymptotic 
solution of the above form plays an important 
role. For example, consider the Schrodinger 
equation 

1 du 1 

= — — --7^Aw-i;(x)m = 0. 

u dt (jA) 

An asymptotic series 

u{x, t; A) 'V ( a^(x, t) + aj(x,t)^+ ... 

y lA 

-ha„(x,t)(^^^ +.j (8) 

satisfies L;^[u] -0 forA->ooif 
d(p 

— {'^(pf v{x)(p = 0 (eikonal equation), 
da. 

— 2V(p-Va„ — A(pa„=^Aa„.,, n = 0, 1, 2, 

dt 

(transport equation), 

where we set a _j = 0. For each asymptotic 
solution of the form (8) there exists an actual 
solution of L^[/{x, t; A)] = 0 which is asymp- 
totically developable in the form 

/(x,t;A)~ f 

n = 0 

Then asymptotic solutions of the form (8) can 
describe the fundamental properties of the 
phenomenon governed by the equation L;^\_u] 

= 0 . 

tGeometric optics can also be illustrated by 
asymptotic solutions of the form (8) (-^ 325 
Partial Differential Equations of Hyperbolic 
Type L). The ideas of asymptotic series, asymp- 
totic expansions, and asymptotic solutions 
are essential to the theory of tpseudodifferen- 
tial operators and tpourier integral operators 
345 Pseudodifferential Operators). 
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Automata 

A. General Remarks 

Automaton is a generic term for the math- 
ematical models for automatic machines hav- 
ing memory devices and capable of perform- 
ing definite acts within each unit of time. 

There are several types of automata, namely, 
Turing machines for the theory of computa- 
bility [6], linear bounded automata, push- 
down automata, and finite automata for math- 
ematical language theory [10-1 3]. These 
models are also used in the theory of tcomplex- 
ity of computation of algorithms [14]. 

B. Turing Machines 

Since the tarithmetization of metamathematics 
was used in the proof of tGodefs incomplete- 
ness theorem, mathematical formulations of 
the notion of teffectively computable functions 
have been attempted by many mathematicians, 
including S. C. Kleene, A. Church, and others, 
as well as by Godel himself (-^ 356 Recursive 
Functions). With this end in mind, A. M. Tur- 
ing and E. E. Post independently introduced a 
sort of ideal computer [1,2], called a Turing 
machine. 

Such a machine is capable of being in one 
of a finite number’of internal states (possible 
contents of a memory device) . j Q„ 

fixed for the machine; and the machine is 
supplied with a tape divided into squares. The 
tape is infinite in both directions. Each square 
can be blank or can have printed on it any one 
of the previously specified symbols Sq, si, . , 
s^, where Sq stands for the blank. In any situa- 
tion the number of nonblank squares is finite. 

Let Q = {qo,qi,---,cin], M = (so,Si, ...,s„,}, and 
consider the mapping t : Q x M-Q x M x 
{R, L, 0}. A Turing machine T is specified by 
{Q, t, F)> wherc F is a subset of Q called 
a set of final slates and Qq is the specified ini- 
tial state. If t(q, s) = (q\ s’, x), then the 5-tuplc 
(q, S, q', s', x) is called an instruction for T. The 
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machine acts as follows: (1) At the start, the 
machine is in the state qo- Some sequence of 
symbols in M are written on the tape as an 
input, and the machine is on the square next 
to the leftmost nonblank symbol. (2) With 
respect to the present state q and the scanned 
symbol, the machine selects an instruction 
{q, s, q\ s’, x); then the next state becomes q\ $ 
is replaced by s', and the machine moves one 
square right or left or stands still according 
as X is R , L, or 0. (3) The machine stops when 
the state becomes an element of F, and then 
the sequence of symbols on the tape is the 
result of the computation and considered to 
be the output. A Turing machine is called 
deterministic if there is always at most one 
instruction associated with a given pair [q^ s); 
otherwise it is called nondeterministic. 

We represent a natural number y on the 
tape by y + 1 squares, each of which has the 
symbol Sj printed on it. A pair is repre- 

sented on the tape by the following sequence 
of symbols of length y, + }^2 + 5: one Sq, the 
representation of y,; one .Sq, the representation 
of y 2 , and one Sq. Similarly, in general we can 
represent a k-tuple (y„ . , y^) of natural num- 
bers on the tape. Let (p be a partial function 
of k variables, k ^ 0. We say that a Turing ma- 
chine T computes ^ if T starts with the tape 
in which the representation of (Xj , , x^j) is 
printed and stops with the tape having the 
representation of (Xj, , x^,x) when 0(X[, 

. ..> xj is defined and equal to x. A partial 
function is computable (in the sense of Turing) 
if there exists a Turing machine that com- 
putes the function. It is known that a partial 
function is computable if and only if it is par- 
tial recursive. The equivalence of comput- 
ability (in the sense of Turing) and recursive- 
ness for number-theoretic functions is strong 
evidence for the validity of Church’s thesis 
(- 356 Recursive Functions). 

We can apply a Turing machine not only to 
the computation of number-theoretic functions 
but also to the transformation of a finite se- 
quence of letters in any language with a finite 
list of letters. A finite nonempty list of distinct 
symbols is called an alphabet, and a finite se- 
quence of symbols in the alphabet is called 
a word in that alphabet. All the words in an 
alphabet constitute a tfree monoid if we add 
the empty sequence and think of the concate- 
nation of two words as its multiplication. We 
call a finite nonempty list (A BJ, , (A„ BJ 
of pairs of words a set of production rules. By 
an application of a rule BJ to a word W, 
we mean that an occurrence of A, exists in W, 
and by the application, is replaced by B,-. 
We say that two words R and S are equivalent 
if they are transformable into each other by a 
finite number of applications of the production 
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rules. The following problem is called Tbue’s 
(general) problem: Is there an algorithm for 
deciding, for any given alphabet and a set of 
production rules, whether or not any given 
two words are equivalent? E. L. Post and A. A. 
Markov proved by applying Turing machines 
that this decision problem is unsolvable [4, 3] 
(a negative solution for the so-called word 
problem for semigroups). 

C. Universal Turing Machine and 
Generalizations 

Turing showed that it is possible to construct 
a machine U with the following property [1]: 

If we supply U with a tape on which a suitably 
coded list of instructions of any given machine 
T is printed then U successively prints on it 
the behavior of T which results from the exe- 
cution of the instructions of T. In other words, 
V simulates the operations performed by any 
machine. Such a machine U is called a univer- 
sal Turing machine. We can construct a univer- 
sal Turing machine with only two internal 
states and also one having only two symbols, 
including the blank Sq (C. E. Shannon and J. 
McCarthy [7]). 

Eet i/^, , . , be given functions that are all 
defined everywhere on the domain considered. 
Then we can relativize (-^ 356 Recursive Func- 
tions) the notion of computability of functions 
(in the sense of Turing) to Indeed, 

we can consider a machine acting as follows: 
When the machine is in a state q,- ( 1, . , , , k 
and we assume that these states are specified 
correspondingly to i//j , . , and are not in 
E), then the machine prints the representation 
of 'Pjiyi, , Y„) to the right of (pi, , y„) 
and scans the resulting {nj + l)-tuple (y,, • , 
y„j, ijjjiyi, , y,,)). In this process, the symbol 
a that is written to the right of (y,, . . , y,,) is 
moved one step to the right so that a is pre- 
served on the right of the resulting (n -|- 1 )- 
tuple. Extending the preceding, Kleene defined 
a machine that computes a functional of finite 
types [S]. A functional of variables of any 
finite types is computable by such a general- 
ized Turing machine if and only if it is partial 
recursive. 



D. Language and Automata 

Eet M and N be mutually disjoint alphabets 
and V — MU N. Then G =(iV, M, P, S) is called 
a Cbomsky grammar if P is a set of production 
rules consisting of finite number of pairs of 
elements of F* and S is specified element of N, 
where V* means the free monoid generated by 
V. Eor a word UE V*, if the result of a finite 
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number of applications of rules in P is v, then 
we write and v is said to be derived from 
U by G. The language generated by G is L(G) 

= {u S^U and ugM*}. N. Chomsky classi- 
fied the grammars into three fa mi lies with 
respect to the form of production rules: (1) 
Context-sensitive grammar: For each (u, v)eP, 
the length of u is less than or equal to the 
length of r. (2) Context-free grammar: For 
each (u, v)eP,u is an element of N and v is 
not a empty word. (3) Regular grammar: For 
each (u, \) eP, w e N and v is either an element 
of M or of the form ax, where aE M and XE N. 
Let Fi(M), FjiM), and F,(M) be the families 
of languages generated by the grammars in 
the corresponding families classified as above. 
Then F^{M)^F 2 {M)^FJ{M) holds and this 
is called a Chomsky hierarchy [ 10]. Formal 
language theory studies various structures of 
these families of languages. 

Corresponding to the three families of lan- 
guages, we have the following families of auto- 
mata: (1) A deterministic (nondeterministic) 

linear bounded automaton is a Turing machine 
with input tape, namely, T= (g, JWL7 N, r, F) , 
where Q is the set of states, M an input alpha- 
bet, N a tape alphabet, t is a mapping from 
Q x{MU{e}) x{NU{e}) into QxN x {L,R] 
(the power set of Q x N x {L, R}), QqE Q Is the 
initial state, and f is a subset of Q. A word 
we M* is placed on the input tape, and T 
starts its computation from the initial state 
with blank tape and by reading one symbol 
from the input tape, and with respect to the 
mapping t, we decide the configuration of the 
next step. If at a certain stage, T reaches a 
state in F when T reads up w and if the length 
of the tape used is a constant multiple of the 
length of the input tape, then w is said to be 
accepted by T. (The special symbol e makes 
it possible to change the configuration without 
moving the input tape.) (2) A push-down auto- 
maton is a Turing machine with an input 
tape. The working tape is of infinite length 
and moves in only one direction, say to the 
right. The computation with input word we 
M* starts with the position at the left end of 
the work tape. It uses the tape according to a 
first-in-first-out principle; namely, the ma- 
chine may write a symbol to the blank square, 
but if it is on the rightmost nonblank square, 
it may not go left without erasing the non- 
blank symbol. If it reaches a state in F, then 
w is said to be accepted by T. (3) A finite auto- 
maton is a machine without its infinite tape; 
a word w is accepted if T reaches a state in F 
(without using the symbol e). 

Let L(T) be the set of words over M ac- 
cepted by T; then L(T) is called the language 
accepted by T. Corresponding to the three 
families of automata, DF;(M) and NF,(M), i — 



1, 2, 3, denote the family of languages ac- 
cepted by deterministic or nondeterministic 
automata. It is known that DF;(M) = NF,(M) = 
F;(M) for i = 2 and 3, and NF^ (M) = F^ (M); 
however, whether OF^ (M) = NF,(M) holds or 
not is still open, and this problem is called the 
LBA problem. These automata are used to 
study the closure properties under various 
operations within the families F-{M). In par- 
ticular, the LBA problem is equivalent to the 
problem of whether F,(M) is closed under 
complementation. 

E. Applications of Automata Theory 

Finite automata are formulated as models for 
digital circuits with memory. Therefore there 
exist structural studies of finite automata; for 
example, methods to decompose a given finite 
automaton to smaller automata correspond- 
ing to a circuit and its subcircuits are studied 
using tsemigroup theory [14,17]. 

Context-free grammars are useful in describ- 
ing the syntax of programming languages. For 
a context-free grammar G, a derivation tree of 
X e L(G) is a tree whose terminal nodes are 
labeled by each symbol of x, its root is labeled 
by S, and other nodes are labeled by some 
elements of N in such a way that if for a node 
with A 6 N the nodes connected by an out- 
going edge from the former have labels Xj, 

... ,x^, then (A, Xj, , xj is a production rule 
of G. Parsing is an algorithm to construct 
a derivation tree of each xe L(G). Various 
parsing algorithms have been studied within 
subfa mi lies of context-free grammars [16] and 
they are used in the construction of compilers. 
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Automorphic Functions 

A. General Remarks [1] 

Let X be a (complex) tanalytic manifold and E 
a ^discontinuous group of (complex) analytic 
automorphisms of X. A set of tholomorphic 
functions (without zero) on X, (j,(z)) (ye E), is 
called a factor of automorphy if it satisfies the 
condition j„.(z) =j^{Y{z))jy{z) for all y, y'e E, 
zeX.A tmeromorphic function f on X is 
called a (multiplicative) automorphic func- 
tion with respect to the factor of automorphy 
{j,(z)l if it satisfies the condition /(y(z)) = 
f{z)jy{z) for all ye r, zeX. When all j,(z) 
are identically equal to 1, i.e., when f(z) is E- 
invariant, / is simply called an automorphic 
function with respect to E. When all jy are con- 
stant (and hence y-j, is a “quasicharacter” 
of E), / is called a multiplicative function. If 
we denote by J,(z) the functional determinant 
of the transformation y, then for an integer m, 
an automorphic function / with respect to 



a factor of automorphy of the form j„(z) = 

J,(z))” is called an automorphic form of weight 
m. (In this case, it is customary to assume 
that fis holomorphic. Also, in this and most 
other cases, we add a suitable condition on 
the behavior off at the “points at infinity,” 
when E\X is not compact (- 234 Kleinian 
Groups).) 

B. The Case of One Variable [2-6] 

Except for the cases where E is a tinite group 
or its extension by a free Abelian group of 
rank ^ 2 (^ 134 Elliptic Functions E), it is 
essentially enough to consider the case where 
X is the upper half-plane § = {z£C lmz>0}. 
In this case, E may be obtained as a tdiscrete 
subgroup of the tspecial linear group G = 

SL(2, R). In the following, we restrict our- 
selves to the case where E is a +Fuchsian group 
of the first kind (i.e., the case where the tHaar 
measure p(E\f>) of the quotient space E\§> is 
finite; 122 Discontinuous Groups). An 
automorphic function (or Fuchsian function) / 
with respect to E is, by definition, a meromor- 
phic function / on § satisfying the following 
conditions: (i) / is E-invariant, i.e.,/(yz)=/(z) 
for all ye E; (ii) / is also meromorphic around 
any tcusp Xq of E, i.e., if (Po is a real linear 
fractional transformation mapping Xq to oo 
(e.g., (p^{z) = -l/(z -Xq)) and if the trans- 
formation z^Z + h(h>0)isa generator of 
(where E^^ is the '’'stabilizer of Xq), 
then /((/)q“‘(z)) can be expanded into a '•'Lau- 
rent series of qi, = exp((27zi/h)z) (which has only 
finitely many terms with negative exponent) in 
a neighborhood Im z > y, of oo (i^f;, is called the 
local parameter around the cusp Xq). If we 
denote by 9?^ the compact tRiemann surface 
obtained from the quotient space E\§ by 
adjoining a certain (tinite) number of “points 
at infinity” corresponding to the equivalence 
classes of cusps of E, then conditions (i) and 
(ii) amount to saying that / gives rise in a 
natural manner to a meromorphic function on 
iKp. Thus the field of all automorphic func- 
tions with respect to E can be identified with 
the talgebraic function field belonging to the 
Riemann surface 9?p (-^ 11 Algebraic Func- 
tions A). (While any (nonconstant) automor- 
phic function with respect to a Fuchsian group 
of the first kind E has the real axis as its '•'nat- 
ural boundary, an automorphic function with 
respect to a Fuchsian group of the second 
kind can always be analytically extended to 
the lower half-plane through a neighborhood 
of any “ordinary point” on the real axis (-^ 

234 Kleinian Groups).) 

Let k be an even integer. A (holomorphic) 
automorphic form (or Fuchsian form) f of 
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weight k or of dimension -k with respect to r 
is, by definition, a holomorphic function f on 
§ satisfying the following conditions: (i) For 

every = ^ 

other words, = {da{z)/dz)~'^^^ = (cz + df 

is the factor of automorphy.) (ii) f is also 
holomorphic around any cusp Xq, i.e., in the 
above notation, we have an integral “Fou- 
rier expansion” {z)){d(p^'^{z)/dzf^^ = 

Note that this definition of weight 
is different from that given in Section A (- 
Sections F, G). If, moreover, the constant term 
a, in this Fourier expansion vanishes at all 
cusps of r, / is called acuspform. In partic- 
ular, fis a cusp form of weight 2 if and only 
if f[z)dz gives rise in a natural manner to a 
tdifferential form of the first kind on the 
Riemann surface iRp. (By a slight modifica- 
tion of condition (ii), we can also define an 
automorphic form of odd weight k. When 

^eF, condition (i) (for k odd) im- 
plies that f is identically equal to zero. So as- 
f-\ 0\ 

sume that I ^ \p’ cusps Xq are 

classified into two categories, according as 

case we should replace the power series in in 
condition (ii) by x (power series in 
(Note that the above definition of weight is 
slightly different from the general one given in 
Section A. See also the Siegel and Hilbert 
modular cases (- Sections F, G).) 

We denote by 2Rjt(r) (resp. ©^(r)) the linear 
space of all automorphic forms (cusp forms) of 
weight k with respect to F. Since clearly the 
relations c 8^+^- hold, 

the direct sum 5R(F) = is a (commuta- 

tive) tgraded algebra, of which S(F) = 
is an ideal. It is known that is of finite 

dimension, and = dim 93J^(F), = dim S^(F) 

can be determined (from the tRiemann-Roch 
theorem) as follows: 

dj = 0 for k< 0; 






classes of cusps of F, and g is the tgenus of the 



Riemann surface (When 




^F, 



we can obtain a similar formula for k odd 
except for d, , d°, which, in general, cannot be 
determined from the Riemann-Roch theorem.) 

One method of constructing automorphic 
forms is provided by Poincare series as fol- 
lows; Let Gq be a tfinear fractional transfor- 
mation that maps the unit disk to the upper 
half-plane (e.g., the tCayley transformation), 
and put F' = ffo^^Fo-o; let ^ be a function 
holomorphic on the unit disk including its 
boundary (e.g., a polynomial). Then for k ^4, 
the series 



P , ( z ) = Y. <p(«7'(2)) {da'{z)/dzf'^ 
ff'er 

is '^uniformly convergent in the wide sense (i.e., 
uniformly convergent on every compact set) in 
the unit disk and expresses an automorphic 
cusp form of weight k with respect to F', i.e., 
we have P^(oQ\z))idaoHz)/dzf/^e(Bkir). 
Conversely, every element in Sjij(F) (/c^4) can 
be expressed in this form. A series of this type 
is called a theta-Fuchsian series of Poincare (or 
simply Poincare series). 

For k ^ 3, we can define a (positive definite 
Hermitian) inner product on S;j(F) as follows: 



if,9) = 



f{z)g{z)y'^ ^dxdy, 



where z = x + iy and F is a tfundamental region 
of F in §. This is called the P etersson metric. 
Since this integral converges also for f, ge 
iOi^(F) if one of them belongs to 6^(F), we 
can define the orthogonal complement l£;i(F) 
of 8^(F) in 9)J^(F). As we shall see, (£j^(F) is 
generated by lEisenstein series. 

Any automorphic function / with respect to 
F can be expressed as a quotient of two auto- 
morphic forms /i ,/2 e 9?tfc(F) for a sufficiently 
large k. If we put w =/(z), the inverse function 
Z =f ~^(w) can be expressed as the quotient of 
two linearly independent solutions of a tfinear 
differential equation of the form d^zjdw^ = 
(p{w)z, where (p{w) is an talgebraic function 
belonging to the Riemann surface 9{p. 



C. Modular Functions and Modular Forms 

[4,7,8] 



= dfc — t for k even, ^ 4, 

where s is the number of the equivalence 
classes of telliptic fixed points of F, C; (1 < i s) 
is the order of the stabilizers of these elliptic 
points, t is the number of the equivalence 



The (elliptic) tmodular group 
F = F(1) 

= 5L(2, Z) 

<3, h, c, deZ, ad — hc = ij^ 
is a Fuchsian group of the first kind acting on 
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the upper half-plane The quotient space 
r\§ can be compactified to get a compact 
Riemann surface 91^ = F\§ U { oo } of genus 
zero by joining one point at infinity corre- 
sponding to the cusp 00 , around which we 
take q = as a local parameter. The dimen- 
sion dk = dim 50tfc(r) (k even, ^ 2) is given as 
follows: 

, _ |[/c/12] for k = 2 (mod 12), 
[k/12]+ 1 for k^2 (mod 12), 

where [ ] is the tGauSS symbol. 

More generally, an automorphic function 
(automorphic form) with respect to the prin- 
cipal congruence subgroup 

3Ko 



is called a modular function (modular form) of 
level N. (For the number of cusps of F(A^), 
denoted by t(N), and the genus of -^122 
Discontinuous Groups.) At any cusp of T(N), 
the local parameter Qf, in (ii) is given by = 
e\p{2nizlN). For N, k ^ 3, the dimensions d^, 
d^ are given by the general formula of the 
preceding paragraph (including odd k). For 
k ^ 3, we defme the (extended) Eisenstein series 
by 



Gfc(z,Ci,C2,(V) +^22)*’ 



where c, , C2 are integers such that (Cj, C2, N) = 

1 and the symbol denotes the summa- 
tion excepting the pair (m^, m2) = (0,0). Then 
G^{z; Cj,C 2 ,N) depends only on Cj,c2 (mod N), 
and ifwe take a set {(c 1,02)} suchthat {(cj,C2), 

( — Cj, — C 2 )} forms a complete set of repre- 
sentatives of the (primitive) residue classes 
(mod N), then the corresponding set of Eisen- 
stein series forms a basis of the space (£j^(F(A)) 
(k ^ 3), whence we obtain dim = 

t(N) (k^ 3) (for details, including the case k= 

1, 2, see E. Hecke, Abh. Math. Sem. Univ. Ham- 
burg, 5 (1927)). 

The Fourier coefficient a, of the Eisenstein 
series can be calculated easily, and we get an 
estimate a, = 0(n '^'^) for k ^ 2, where £ is an 

arbitrary positive number. On the other hand, 
for cusp forms, Hecke (Abh. Math. Sem. Univ. 
Hamburg, 5 (1927)) gaveu„= 0(n*^^^). In an 
attempt to improve this estimate, H. D. Kloos- 
terman was led to consider the sum 



K(u,m)= Z + 

x(modq) \ ^ \ ^ / / 

(m,A4GZ), 

called the Kloostermau sum, which is also 
related to the arithmetic of quadratic forms. 
Using A. Weil’s estimate |K(u, l>, p)| ^ 



(where p is an odd prime, (u, p) = (v, p) = 1), 
based on the analog of the Riemann hypoth- 
esis, we obtain a, = The estimate 

a„~ is a weak form of what is 

known as the Ramanujan-Petersson conjec- 
ture (-^ Section D). In the case N = 1, we ob- 
tain the classical Eisenstein series G^{z) = 

G^j(z; 0,0, 1) (k even, ^ 4), from which we 
defme the modular forms g 2 {z)= 60G^{z), 
gj(z) = 140 G^(z), and A(z) = g\ ^ 21g\ of weight 
4, 6, and 12, respectively. If we denote hy p = 
p{u; I, z) the Weierstrass +p-function with 
the fundamental period (1, z) we have the 
relation = 4 — P — ^3, and A(z) is the 
Miscriminant of the cubic polynomial appear- 
ing in this relation. It is known that every 
modular form can be expressed uniquely as a 
polynomial in 02 and g^, or in other words, we 
have 9Ji(F(l))^C[g^2>i73l- polynomial in 
02 , expressing a modular form is, moreover, 
isobaric, i.e., consists of terms of the form 
€ 02^*02 , ceC, where 2 g+ 3v is a constant 
called the weight of the isobaric polynomial. 
Also, A(z) is a cusp form of the smallest weight, 
and the ideal of all cusp forms, S(F( 1)), is a 
principal ideal in 9[R(F( 1)) generated by A(z). 
The Eourier expansion of the Eisenstein series 
G^{z) is given as follows: 

where q = and is the ^Bernoulli num- 
ber. The discriminant A(z) is expressed as an 
infinite product as follows: 

OD 

A(z) = (27t)‘2^f](l -q'’f*. 

v^l 

If we put J{z) = gl/A, the function J{z) is a 
modular function and gives an tanalytic iso- 
morphism of the Riemann surface 9Ij’(j) = 
r( i)\§ u { co} onto the Riemann sphere 
CU{oo} (which maps (3, C4, 00 to 0, 1, 00 , 
respectively). Hence the field of modular func- 
tions ^r(i) is a rational function field C(J) 
generated by the function J. The analytic 
isomorphism class of tcomplex tori of dimen- 
sion 1 (= telliptic curves) £(«,, = C/(Zw,-i- 

Z(02) with the fundamental period (mj, 0)2) 
is uniquely determined by the F( l)-equivalence 
class of the modulus r = 0 ) 2 l(o^{e §), and hence 
by the value J(t) of the function J. This is the 
historical origin of the name of tmodular 
function [4]. 

As an example of a modular function of 
level 2, we have the A-function: 

p{{l+z)/2)-p{z/2) 

P(1/2)-^(z/2) ■’ 

which gives an analytic isomorphism of the 
Riemann surface 9?i-(2) = T (2)\§ U (three 
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points) to the Riemann sphere CU { 00 } (map- 
ping 0, 1, 00 to 1, 00 , 0, respectively; this prop- 
erty is used in a proof of the tPicard theo- 
rem). Hence we again have = C(A). The 
following relation holds between J and X: 

4(l-2 + 2^ 

^ ^ 2 7 

In general, the field 5Ir(V) modular func- 
tions of level N is generated by ( 02 / 03 )' 

+ a 2 z)lN; l,z) {a^,a 2 eZ). 



D. The H ecke Ring and Its Representation 
[S-1 13 



In general, let T be a group, F a subgroup of 
r, and suppose that, for every geT^oYg ^ 
and r are tcommensurable, which amounts to 
saying that every double cOSet Fcrr is a union 
of a finite number of left or right cosets of 
r. Then in the free module generated by all 
double cosets FaF (tre F), we can define a 
bilinear associative product as follows: 

rGrrzr=Ym{G,7;p)rpr, 

r>r 

where, writing FaF = [J,- F(7,-,FtF = \Jj Txj, we 
denote by m(ff, p) the number of pairs {ij) 
such that FffjTj = Yp. The associative ring thus 
obtained is called the H ecke ring (or Heck e 
aigehra) and is denoted by Jf{T, F). (If T is a 
ttopological group and F is an open compact 
subgroup, the Hecke ring jT(r, F) can also be 
interpreted as the ring of all Z-valued continu- 
ous functions on F with compact support 
provided with a tconvolution product; 29 
Associative Algebras C.) 

As an example, setting f = GL^{2, Q), F = 
SL(2, Z ), we obtain a Hecke ring Jf(Y, F). In 
virtue of the theory of telementary divisors, a 
complete set of representatives of F\F /F is 
given by 



a^,a2GQ,al,a2>0,al\a2 



fa, 0 \ 

We write T(ai,a 2 ) = r( )F. From the 
\ 0 _ ^ 2 / 

relation T{a^a 2 ,a^a 2 )T{a^ ^,^2 ') = ^(^ 15 ^ 2 ) 
follows the commutativity of jf{T, F). Fur- 
thermore, if we put, for a positive integer n, 

T,= T(n) Y. T(a„a,}, 

then the following multiplication formula 
holds: 



T(n)- T(m) = X dT(d, d) . T{nm/d^). 

d\n, m 

We obtain a representation of dF(r, F) in 
5[R^(F) as follows. For T= YgY = U Fa,G 



jT(r, F) and / g 3J1^(F) we put 

(/I T){z)^Y^f{G,{z)){c,z + dr'‘det{G,)\ 

i 



where C; = ( ) and / is a fixed integer. This 

Vi dj 

representation leaves S/;(F) and ®)t(F) invar- 
iant. In particular, the representation of T„ 
(with l=k 1), called the H ecke operator 
(Hecke, Math. A nn., 114 (1937)), is tHermitian 
with respect to the Petersson metric. 

Following Hecke, we associate with every 
modular form f(z) = (9 “ a 

^Dirichlet series (^( 5 ) = 2„®=i a„n~\ Since a, = 
0 (n*'~* for / g9[R^ {k ^ 2 ), 1 ^( 5 ) is absolutely 
convergent in the half-plane Res > k. The 
conditions for f to be a modular form of 
weight k are equivalent to the following con- 
ditions for (p: (s-k) (p (s) can be extended to an 
Tentire function of fmite tgenus (actually, of 
genus 1) and, if we put R(s) = (2 ti)“*F(s)0)(s), 
R(s) satisfies a functional equation of the 
form R(k- s) = ( -\f>^R{s). A correspondence 
between f and (p that satislies these con- 
ditions is one-to-one; in fact we have a, = 

( — l)'‘/^Res^=k R(s), and the function g(x) = f{ix) 
— ao(x > 0 ) and R(s) are related by the tMellin 
transform as follows: 

R(s)= 0 (x)x*"‘ dx, 

Jo 



gix)^ 



27ii 



R(s)x-"ds. 



This correspondence between Dirichlet series 
and automorphic forms can be generalized to 
the case of a certain discrete subgroup of 
SL{2, R ), which is not necessarily of the first 
kind (Hecke, Math. Ann., 112 (1936); [ 10]). It is 
also known that if the functional equations for 
hold for sufficiently many char- 
acters X, then / = a„q" is a cusp form with 

a character for some congruence subgroup 
(Weil, Math. Ann., 168 (1967); [12]). 

Now, suppose that a linear subspace 931 of 
931, (F) is invariant under all T„ ( n = 1, 2, . . . ), 
and let (/i , . . . ,/J (fc = dim 931) be a basis of 931. 

If we denote by X{n) the /c X k matrix represent- 
ing T„ in this basis, it can be shown that there 
exist KXK (complex) matrices (I ^k) 
and 2(0) such that we have 

00 K 

i=i 

Similarly we associate with F(z) a matrix- 
valued Dirichlet series 

00 

0)(s)= 1 : n-^X(n). 

n=l 

Then from the multiplicative property of the 
X{n), we obtain the following tguler product 
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expression of 3>(s): 

p 

where denotes the unit matrix of degree k. 

In particular, when dimSR = 1, i.e., when 
= Qf where f is an teigenfunction of all Hecke 
operators 7^, the corresponding Dirichlet 
series (p(s) has an Euler product of the follow- 
ing form: 

p 

For instance, the Dirichlet series correspond- 
ing to the Eisenstein series is {2{2ni}'‘ {k — 
1)!)C(s)C(5— k+ 1) (where ( is the tRiemann 
zeta function), which has a well-known Euler 
product. Since Sj 2 = CA, the Dirichlet series 
corresponding to A(z) also has an Euler prod- 
uct. (This is part of the Ramanujan conjecture. 
Ramanujan also,conjectured that the quadra- 
tic polynomial 1 —ApX+ appearing in 

this Euler product has imaginary roots, i.e., 
\Xp\< 2p* Generalizing the Ramanujan 
conjecture, Petersson conjectured that all 
eigenvalues of the Hecke operator Tp in 
have absolute value (Ramanujan- 

Petersson conjecture). P. Deligne proved this 
conjecture for k ^ 2, and his method extends to 
the case when E is a congruence subgroup 
[Publ M ath. Imt. HES, 53 (1974)). For k= 1, 
see Deligne and J.-P. Serre (Ann. Sci. Ecole 
N orm. Sup., 7 (1974)). It may be noted that this 
conjecture was proved earlier for k = 2 (and for 
almost all p) by M. Eichler (1954) and by G. 
Shimura (1958) 450 Zeta Functions M). 



E. The Case of Many Variables [1, 15,161 

Eet X be a bounded domain in and E a 
tdiscontinuous group of analytic automor- 
phisms of X. The Poincare series of weight (~^ 
Section A) m ^ 2, defined similarly to the case 
of the unit disk, converges normally in X and 
expresses an automorphic form of weight m. If 
T\X is compact, let be a basis of the 

space of Poincare series of weight m. Then for 
a sufficiently large m (which is a multiple of 
the order of E^ for all x e X), the map X 9 z-^ 
(f,(z), ,/k(z)) defines in a natural manner a 
projective embedding of the quotient space 
r\^ into (C), which actually gives an 
analytic isomorphism of E\X onto a tnormal 
projective algebraic variety. It follows that the 
field of automorphic functions with respect to 
r is an talgebraic function field of dimension 
N, and that every automorphic function can 
be written as the quotient of two Poincare 
series of the same weight. In the case where 
r\X is not compact, we first discuss some 
examples. 
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F. Siegel Modular Functions [15,17,18] 



The Siegel upper half-space of degree fl (or 
Siegel space of degree n) is, by definition, the 
space of all n X n complex symmetric matrices 
Z—X + iY with the imaginary part Y > 0 (posi- 
tive definite). (^„ is analytically equivalent to a 
tsymmetric bounded domain.) The group of all 
(complex) analytic automorphisms of §„ is 
given by the real (projective) tsymplectic group 
Sp{n, R)/{ ± which acts transitively on 
by Z^(7(Z) = (AZ-hB)/(CZ + D)for (T = 




Sp{n, R). The subgroup E„ = Sp{n, Z) 



consisting of integral matrices, or the corre- 
sponding group of linear fractional transfor- 
mations, is called the Siegel modular group 
of degree h.r„is a tdiscontinuous group of the 
first kind acting on 

A Siegel modular form / of weight k or of 
dimension -k is, by definition, a holomorphic 
function on satisfying the following con- 
ditions: (i) for (j = Q r„, f((T{Z)) = 

f(Z)det(CZ -f- (ii) f has an integral Fourier 
expansion of the form f(Z) = 
where T runs over all half-integral tpositive 
semidefinite symmetric matrices of degree n. 
(Note that the Jacobian determinant J,(Z) of 



^ ~ ( c d) ^ det(CZ + 

) pqj. fj^2, condition (ii) is superfluous 
(M. Koecher, Math. Z., 59 (1954)). We denote 
by = the space of all Siegel modu- 
lar forms of degree n and weight k. When 

we write §„ 3 Z = with Zi jg 

C"“\ ze$i, and 



aRl”>3/(Z)= X 5)^^’'■"^ 

it turns out that <3o(Zi, 3) depends only on Zj 
and, writing it as /j(Zj), we have /j 
The mapping thus defined is a linear 

mapping from into which is SUf- 

jective if k is even and > 2n (H. Maass, Math. 
Ann., 123 (1951)). We denote the kernel of 
by and call / e a cusp form (viewing 
as a cusp of For / the fol- 

lowing three conditions are equivalent: (a) 
fe SI"’; (b) f(Z) = Sr>0 "re"'""”'; (c) 
|/(Z)det( is bounded. We have = 
{0} if k < 0 or if both k and n are odd, and 
= C. In general, dim is finite and 
= 0(k”^"'*'^^^^) (k-t oo). The graded algebra 
®l(rfi) = 2*=o finitely generated, and 

its tKrull dimension is n{n + l)/2 -t 1. In par- 
ticular, for n= 2 the structure of the graded 
algebra 9(R(r2) is determined explicitly. (The 
even part of this algebra is isomorphic to the 
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polynomial ring of four variables.) Also, 
dim and the singularities of the quotient 
variety r 2\§2 known explicitly (J. 1. Igusa, 
Amer. J. Math., 84 (1962), 86 (1964); U. 
Christian). 

As an example of Siegel modular forms, we 
have the Eisenstein-Poincare series, delined as 
follows: 

ki^)= I e2’^--tr(s<.(z)) + D) 

(T6r,as)\r„ 



where S is an fl rational symmetric matrix 
^ 0, and we put 



r„(S)- 



u vu 

0 ‘L/~‘ 



er„ 



vsu=s, 



This series is convergent for k> n + rank S + 1 
if rank S n and for k, 2w if S 0, and the 
totality of Eg (those series with S > 0) spans 
(0j,n)) (Maass, Math. Ann., 123 (1951)). 
tTheta series delined by integral quadratic 
forms are examples of Siegel modular forms 
(with a certain level) and are of great signifi- 
cance in the arithmetic of quadratic forms 
(C. L. Siegel, Ann. Math., (2) 36 (1935)). 

The quotient space F„\§„ can be compacti- 
fied as follows (I. Satake and W. Daily [15]); 
There exists a positive integer /cq SUch that, 
for any multiple k of /cq? any basis of 
defines in a natural manner a one-to-one 
biholomorphic projective embedding of F„\§„, 
of which the image is a Zariski open set of a 
normal projcctive algebraic variety. Since the 
structure of this projective variety is indepen- 
dent of k, we denote it simply by F„\§„ . In 
other words, F„\§„ is the projective variety 
Proj Sdl(F„) associated with the graded algebra 
3R(F„). Then we have F„\§„ = (J"=o fr\§r- 
When fi^2forr<n, F^\.§r codimension 
^ 2 in F„\§„ , so that (by virtue of tHartogs’s 
continuation theorem) the conditions at the 
points at infinity in the delinitions of modular 
functions and forms become superfluous. In 
fact, for n ^ 2, if we define a Siegel modular 
function of degree n simply as a F„-invariant 
meromorphic function on then it can auto- 
matically be extended to a meromorphic func- 
tion on the compactilication F„\§„, and hence 
be expressed as the quotient of two modular 
forms of the same weight. It also follows that 
the field of all Siegel modular functions of 
degree n is an algebraic function lield of di- 
mension n(n -j- 1)/2. (These results can also be 
obtained from the tpseudoconcavity of F„\§„ 
without using compactilication (A. Andreotti 
and H. Grauert, Nachr.^kud. Wiss. Gottin- 
gen).) For n = 2, the field of Siegel modular 



functions is a rational function lield. On the 
other hand, for Fi = 1 mod 8, n > 9, the field 
of Siegel modular functions is not rational 
(E. Freitag, Abh. M ath. Sem. U niv. Hamburg 
(1978)). 

Let F be a subgroup of F„ of finite index. 

Then F\§„ can be compactified in the same 
way as above [23]. However the singularities 
of F\§„ are extraordinarily complicated (Igusa 
[24] ). When F is the principal congruence 
subgroup F„(iV) of level N > 2, the blowing-up 
of F„{A)\§„ along the cusps is nonsingular for 
n = 2, 3 (Igusa, Math. Ann., 168 (1967)). Fol- 
lowing Igusa’ s idea, it is possible to construct 
an explicit nonsingular compactification of 
r\D, in which D is a bounded symmetric 
domain and F is a suitable arithmetic sub- 
group of Aut(D) ([19]; see also Satake, Bull. 
Amer. M ath. Soc, 79 (1973)). ForD= §x§ 
and F the Hilbert modular group (-^ Section 
G), F. Hirzebruch has given explicit nonsin- 
gular compactilications of F\D (Enseignement 



Math., 19 (1973)). 

The Hecke theory (-^ Section D) can also be 
extended to the Siegel modular case. For any 
integer m > 0, let be the set of 2 n x 2n 

integral matrices M such that *MJM = mJ, 



where 





and /„ is the identity 



matrix of degree n . We define the Hecke 
operator T(m) on by 



T(m)/=^m"'‘“"<"+i >/2 ^ det(CZ 

(c n)=wsF(«.) 

FDj-ViMZ), 

where fs = IJ Mmm) F„M, and V{m) is 

a complete system of representatives of F„\S^. 
Then for any prime number p, the formal 
operator series D,(X)= F(p")A"" is form- 

ally equal to a rational function in X: 



D,{X) 



P^(X) 



in which P,(X) and Q,(X) are polynomials in 
X of degree 2” — 2 and 2", respectively, with 
coefficients in the ring of Hecke operators 
(Satake, Publ. M ath. Inst. HES, 18 (1964)). For 
n = 2 we have (Shimura, Proc. Nat, A cud. Sci . 
US, 49 (1963)) 



D,(X) = ( 1 - { 1 ~ T{p)X + (T(p)^ 

- T{p^)-p^'‘~'^)X^ -p^^-^T{p)X^ 

Let /(Z)e9H[^* be a Siegel modular form of 
degree 2 such that T{m)f — /i,(m)/ for all m ^ 1. 
We define Lj(s) by 

® A(m) 

Ms)=C(2s-2fc+4) X -V, 

m = l m 
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in which is the Riemann zeta function. It is 
known that Lj-{s) can be meromorphically 
continued to the entire plane and satisfies the 
funetional equation 

H^j-{2k-2-s) = ^>j.{s), 

where Y,(s) = (2:?r)“^T(s)r(s~ k + 2)Ly(s)and 
the function (s — /c)(s ^ k + 2)'Fy(s) is entire (A. 
Andrianov, Trudy M ath. /nst. Stekloo., 112 
(1971)). 



G. Hilbert Modular Functions 



Following a suggestion of Hilbert, 0. Blu- 
menthal studied a generalization of modular 
functions of the following type: Let K be a 
totally real talgebraic number field of Imite 
degree, and let be the conjugates 

of K, where n = [K : Q]. (“Totally real” means 
that all the are real.) For a E K, we denote 
by the ith conjugate of a. Let X) be the 

+Principal order of K, and consider the group 



r^ = SL{ 2 ,T 2 ) 

y d 



a, (i,y,SGT>,ocS — fiy—\ 



We define the action of Fj, on the n-fold prod- 
uct of the upper half-plane §" = {z = (zj , . , z„) 
z,.€§} by 



a(z): 



-f ’ ’ y‘"^z„ -I- 



fox a f F then F^, becomes a discontinu- 
\Y d/ 

ous group of the first kind (F^ can also be 
considered as an tirreducible discrete subgroup 
of SL{2, R)“)> The group Fj, is ealled the Hil- 
bert modular group of K (in the strict sense). 
(The defmition is sometimes modified by re- 
placing the condition a6 — /fy = 1 by, say, ad 
jiy =(totally positive +unit).) If the tclass num- 
ber of K is h, the quotient space can 

be compactified by adjoining h points at in- 
finity. Therefore, if n ^ 2, a Hilbert modular 
function can be defined as a meromorphic 
function on invariant under r^; and simi- 
larly a Hilbert modular form f of weight k 
or of dimension -k as a holomorphic function 
/ on §"such that (/| o)(z) =/(o-(z))n^i (y‘'^Zi 
+ y(z) for all (TErj;. (In the latter case, 

fis holomorphic at all cusps, i.e., for every 
aeSL(2, K), f\a has an integral Fourier ex- 
pansion.) For Hilbert modular functions and 
forms, results quite similar to those for the 
ease n = 1 or the case of Siegel modular groups 
have been obtained (Kloosterman, Maass, K. 

B. Gundlach, H. Klingen). 



H. Further Generalizations 

As further generalizations of the notion of 
modular function, we have Hilbert-Siegel 
modular functions (1.1. Pyatetskii-Shapiro, 
Baily, Christian [20], where we can find some 
150 references), Hermitian modular functions 
(H. Braun, Klingen), etc. For the most general 
case, i.e., for an arithmetically defined discon- 
tinuous group acting on a tsymmetric bounded 
domain, a unified theory of automorphic func- 
tions has been established in the works of 
Pyatetskii-Shapiro [21,22] and Baily and A. 
Borel [23,24]. 

On the other hand, exactly as in the classical 
theory, where the elliptic modular function 
gave an invariant of 1 -dimensional complex 
tori, generalized modular functions can be 
viewed as giving an analytic invariant of a 
certain family of (polarized) tAbelian varieties. 
From this point of view, a deep number- 
theoretic (and algebrogeometric) study of 
automorphic functions (initiated by Hecke and 
Eiehler) has been substantially carried forward 
by the work of Shimura (see the series of his 
papers in A nn. M ath. starting from vol. 70 
(1959); see also [9,24,25]). 

From the analytic point of view, the theory 
of automorphic functions is closely connected 
with the unitary representation of G in the 
space L 2 (F\G) (or its adelic analog) [26,27] 

(for adelic analogs -^ [ 12,131). In this respect, 
the Itrace formula of A. Selberg [28], gen- 
eralizing the +Poisson summation formula, 
is of fundamental importance; and actually 
it can be used effectively for calculations of 
the dimension of the space of automorphic 
forms and of the trace of Hecke operators 
(R. P. Langlands, A mer. J. M ath., 85 (1963); 

H. Shimizu, Ann. M ath., (2) 77 (1963), J. Fac. 
Sci. Tokyo, 10 (1963); also [24]). When 
X = G/K is a symmetric domain, we can define, 
for any representation p of K, a (matrix- 
valued) canonical automorphy factor, by 
which we defme (vector-valued) automorphic 
forms with respect to a discrete subgroup F of 
G, and under a further condition (say, F free, 
F\X compact, and the thighest weight of p 
suffteiently large) we obtain a formula for the 
dimension of the space of such automorphic 
forms in terms of the (arithmetic genus of r\X 
and certain numbers related to the “dual” 

= GJK and the representation p [ 15,29,30]. 
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33 ( 1 . 5 ) 

Axiomatic Set Theory 

A. General Remarks 

Axiomatic set theory pursues the goal of rees- 
tablishing the essentials of G. Cantor’s rather 
intuitive tset theory by axiomatic construc- 
tions consistent with modern theories of the 
foundations of mathematics. 

A system of axioms for set theory was first 
given by E. Zermelo [47], and was completed 
by A. Eraenkel [8]. J. von Neumann [30] 
expressed it in tsymbolic logic, gave a formal 
generalization, and eliminated ambiguous 
concepts. P. Bernays and K. Gddel [2, 1 1] 




147 



further refined and simplified von Neumann’s 
formulation. The theories based on the sys- 
tems before and after the formal generaliza- 
tion are called Zermelo-Fraenkel set theory 
(ZF) and Bernays-Godel set theory (BG), 
respectively. 



B. Zermelo-Fraenkel Set Theory 

ZF is a formal system expressed in tfirst-order 
predicate logic with the predicate symbol = 
(equality) and based on the following axioms 
1-9 (- 411 Symbolic Logic). These axioms 
do not contain any predicate symbol other 
than e, where xey is read “x is an element 
of y.” Any formula containing only e as a 
predicate symbol is called a set-theoretic for- 
mula. Following the usual convention, we omit 
the outermost universal quantifier, and use 
restricted quantifiers such as 3x e a, Vx g a. 
Axiom I (axiom of extensionality): 

'ix{xea = xeb)->'a = b. 

This asserts that sets formed by the same 
elements are equal. The formula x g a{x £ b) is 
denoted by a c This means “a is a subset of 
h.” Then Axiom 1 can be expressed by 

a(zbAbcza-*a = b. 

Axiom 2 (axiom of the unordered pair): 

Jx\/y{yex = y = a\/ y = b). 

This asserts the existence, for any sets a and h, 
of a set X having a and b as its only elements. 
This X is called the unordered pair of a and b 
and is denoted by { a, b}. The notion of ordered 
pair is characterized by 

(a, b) = {c,d) = a — c Ab = d. 

An example of such a set is (a, {>) = { {a, a}, 
{a,b}}. 

Axiom 3 (axiom of the sum set): 

3xVy(y£xs3zGa(yez)). 

This asserts the existence for any set a of the 
sum (or union) x of all the sets that are ele- 
ments of a. This x is denoted by Ua or S(a). 

We write a U b for u {a, b}and a' for 
aU{a,a}. 

Axiom 4 (axiom of the power set): 

Vx3y(yex = Vzey(zea)). 

This asserts the existence for any set a of the 
power set x consisting of all the subsets of a. 
This X is denoted by P(a). We have S(P{a)) 
a, so 5 is a left inverse of P, and the products 
SP and PS are idempotent. 

Axiom 5 (axiom of the empty set): 

3xVy(nyex). 
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This asserts the existence of the empty set. The 
empty set is denoted by 0 or 0. 

Axiom 6 (axiom of infinity): 

3x(0gx vVyex(/Gx)). 

This asserts the existence of a set containing 
all the natural numbers, where 0, 1 = 0’ = {0}, 

2= l'-{0, 1}, 3 = 2' = {0,1,2), This defini- 
tion of natural number is due to von 
Neumann. 

Axiom 7 (axiom of separation): 

3xVy(yex=y£flA A(y)). 

This asserts that the existence for any set a and 
a formula A( y) of a set x consisting of all 
elements of a satisfying A( y). This x is denoted 
by { y£a A( y)}. This is rather a schema for an 
intinite number of axioms, for there are an 
infinite number of A( y). This axiom, also 
called the axiom of comprehension or axiom 
of subsets, was introduced by Zermelo. 

For example, the set of all natural numbers 
is introduced by 

{xea| Vy(OeyAVzey(z'ey) ->xey)}, 

where a is a set satisfying Axiom 6. The set of 
all natural numbers is denoted by co or N. 

Axiom 8 (axiom of replacement): 

3x Vyea(3z/4(y, z)^3zexA{y, z)). 

This asserts the existence for any set a of a set 
X such that for any y of a, if there exists a z 
satisfying A( y, z) then such z exists in x. If the 
relation A( y, z) determines a function, then the 
image of a set by the relation is included in a 
set, so by Axiom 7, the image is a set. This 
axiom was introduced by Fraenkel. 

Axiom 9 (axiom of regularity): 

3xA(x)->3x(/1(x)a Vy£x(nA(y))). 

This asserts that if there exists a set satisfying 
A(x), then there is a set x satisfying A(x) but 
every element y of x does not satisfy the prop- 
erty ,4(y). This axiom is also called the axiom 
of foundation. Its dual expression, called e- 
induction, is of course equivalent to Axiom 9. 
Axiom 9’ (axiom of E-induction): 

Vx(Vy e xA ( y) ^ /4 (x)) VxA (x). 

Using this we can show that ~lxex, 

“I (x G y A y G x), etc. If we assume the axiom of 
choice stated below, then this is equivalent to 
the nonexistence of an infinite descending 
sequence 

x„£...ex2exi. 

If a model of set theory satisfies the axiom of 
regularity and has an infinite descending se- 
quence that is not in the model, then such a 
model is called a nonstandard model. 
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Axiom 10 (axiom of choice): 

Vx 6 (x, y) ->• 3y Vx e a A (x, y (x) ). 

This asserts that if for any element x of a there 
is a set y such thatA{x, y), then there is a 
choice function y for the formula, i.e., 

A[x, y(x)) for all X in a. Usually a function is 
represented by its graph. A set/is called a 
function defined on a if 

Vx Vy((x, y) e/^x e a), Vx e a3y((x, y)ef), 

VxeaVy Vz((x, y)e/A(x, z)ef ->y = z). 

This formula is denoted by Fnc(/); then the 
formula A{x,f{x)) is an abbreviation of 

Fnc(/)A3y((x,y)G/Ayl(x,y)). 

The axiom of choice is equivalent to many 
properties, such as the well-ordering theorem, 
Zorn’s lemma, and Tikhonov’s theorem on the 
product of compact spaces, and it is used 
widely and essentially in mathematics. 

The system of axioms 1 to 9 is called 
Zermelo-Fraenkel set theory and is denoted by 
ZF; the system ZF minus the axiom of replace- 
ment is called Zermelo set theory, denoted by 
Z; and the system ZF plus the axiom of choice 
is denoted by ZFC. 

The system Z is weaker than ZF. Indeed, the 
existence of the set o) of all the natural num- 
bers and of P(ro), P(F(o))), can be proved 
in Z, but the existence of the set{co,p(o;), 
P(P(co)), •} cannot be proved in Z. However, 
we can prove its existence in ZF. 

The theory ZF minus the axiom of intinity is 
called general set theory. Its consistency can be 
reduced to the consistency of the theory of 
natural numbers as follows. Let n be any 
natural number and n= 2"' -I- 2"^ -h m + 

< H 2 < < rij^) he its binary expan- 
sion. We can define a model of general set 
theory by identifying a set with the natural 
number n and defining PI 6 n by “m appears as 
one of the Pj-.” In general set theory we can 
show the existence of 0, P(0), P(P(0)), but 
the existence of the set {0, P(0), P(P(0)), .} 
cannot be proved. However, we can prove its 
existence in Z. The set consisting of all heredi- 
tary finite sets is the smallest model of general 
set theory. 



C. Bernays-Godel Set Theory 

The existence of {u| A(u)} for an arbitrary set- 
theoretic formula .4(u) cannot be deduced 
from the axioms of ZF. We call this object a 
class to distinguish it from sets. We introduce 
a generalized logical system of first-order 
predicate logic by adding to the logical system 
used in ZF class variables, class constants, and 



inference rules with respect to quantifiers for 
classes. For any set-theoretic formula A(u) in 
which no tfiound class variable occurs, we 
adopt 

3XVu(u6Ji5/l{u)) 

as an axiom, where capital letters X are class 
variables. The set theory thus obtained is 
equivalent to Godel set theory [11]. Von 
Neumann axiomatized set theory by making 
use of the notion of functions instead of that of 
classes [30]. In refining this theory and intro- 
ducing the notion of classes, Bemays and 
Godel [2, 1 1] initiated Bernays-Godel set 
theory, BG. 

ZF and BG are related as follows: Any 
formula provable in ZF is provable in BG, and 
any set-theoretic formula provable in BG and 
having neither clasS variable nor class constant 
is provable in ZF. In this sense, the systems 
can be regarded as essentially equivalent, but 
as BG has class variables and class constants, 
it is more convenient for expressing set- 
theoretic notions. 

Von Neumann defined the following func- 
tion R by +transfinite induction: 

i!(O) = 0, R(a)=U'W«), 

t)<a 

where y. and are ordinal numbers and 

denotes the set sum of ))3{/?(0)), 
'*]^(P(1)), .... 'iPjPjjS)), . . . (j?<a). The function R 
can be defined by a set-theoretic formula, so it 
exists as a class. Now consider the model 
M = M(a) for a fixed ordinal number a. We 
define sets of the model Af as elements of P(a), 
and classes of the model as subsets of P(a). We 
denote the e relation of the model by e^f. For 
classes X and 7of the model, we write Xe, 
e Y Then a necessary and sufficient 
condition for R(x) to be a model of BG is that 
X is an '•'inaccessible ordinal number (-^ 3 12 
Ordinal Numbers). The existence of an in- 
accessible ordinal number cannot be deduced 
from the axioms of ZF. There is a series of 
studies of axiomatization of set theory in 
which any number of inaccessible ordinal 
numbers is assumed to exist [23]. When P(a) 
is a model of BG (ZF), it is called a natural 
model of BG (ZF). Furthermore, if H is [J^ P(a) 
then H satisfies all the axioms of BG (ZF). As 
we do not need the axiom of regularity for 
defining the class H , we see that BG (ZF) is 
consistent as long as BG (ZF) without the 
axiom of regularity is consistent. 

D. Independence of the Continuum Hypothesis 
and the Axiom of Choice 

These axiomatizations of set theory motivated 
a series of studies from the standpoint of the 
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tfoundations of mathematics on problems that 
remained unsolved after the appearance of 
Cantor’s primitive set theory. Among these, 
the problem of the relation between the con- 
tinuum hypothesis and the axiom of choice 
was central. 

Consistency of the Axiom of Choice and the 
Continuum Hypothesis. Godel [1 1] proved 
that if ZF without the axiom of choice is con- 
sistent, then the system obtained by adding to 
ZF the axiom of choice and the continuum 
hypothesis is also consistent. To show this, he 
constructed a model of ZF satisfying the 
axiom of choice and the generalized con- 
tinuum hypothesis as follows. Assume first that 
M is an arbitrary domain of objects among 
which the e relation is detined. By a formula 
on M, we understand a formula having con- 
stants of M as its only constants, having e as 
its sole predicate symbol, and further having 
exclusively variables whose ranges are re- 
stricted to M. Let us denote by M' the totality 
of subsets of M detined by a formula A(x) on 
M. Now we put {0}, =M^, = 

if jS is a tfimit ordinal number. We 
call X constructible if x e for some ordinal 
number a, assumed to be less than the first 
inaccessible ordinal number, if any. We denote 
the totality of constructible sets by L, and the 
totality of sets of ZF by V. We call the asser- 
tion V= L, that is, every set is constructible, 
the axiom of constructibility. If we add this 
axiom to ZF, the axiom of choice and the 
generalized continuum hypothesis become 
provable. On the other hand, if we regard 
elements of L as sets of the model and the 
original e relation as the e relation of the 
model, we have a model of ZF in which the 
axiom of constructibility holds. 

Independence of the Axiom of Choice and the 
Continuum Hypothesis. Since the result of 
Godel, attempts have been made to prove that 
the axiom of choice is independent of the other 
axioms. Fraenkel constructed a model of set 
theory without satisfying the axiom of choice, 
starting from a countable number of objects 
that are not sets. A. Mostowski constructed in 
ZF a model of set theory having objects that 
are not sets, and he proved that the model 
satisfies the axiom: Every set can be tlinearly 
ordered, but does not satisfy the axiom of 
choice [27]. E. Mendelson constructed a 
model of set theory that does not satisfy the 
axiom of choice by making use of an infinite 
descending chain a, 9 «2 9 3 [25]. These 

models, however, do not satisfy all the axioms 
of ZE or of Zermelo set theory minus the 
axiom of choice, even though they satisfy most 
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of the axioms. Consequently, they are not 
sufficient for proving independence. 

P. J. Cohen [3,4] proved the following 
results in connection with the independence of 
the axiom of choice, the continuum hypo- 
thesis, and V— L\ If ZE is consistent, each of 
the following conditions has a model. (1) The 
axiom of choice as well as the generalized 
continuum hypothesis holds, but there exists 
an a satisfying a ^ L and a c Cd. (2) is not 
well-ordered. (3) The axiom of choice holds, 
but the continuum hypothesis does not hold. 
(4) cannot be linearly ordered. 

By (1), we see that V= L is independent of 
the axiom of choice and the generalized con- 
tinuum hypothesis: (2) shows the independence 
of the axiom of choice; and (3) shows that the 
continuum hypothesis is independent of the 
axiom of choice. In (4), C!P(a))) corresponds 

to the set F of all real-valued functions detined 
on the interval [0, 1], and (4) shows that the 
proposition “F can be linearly ordered” is not 
deducible in ZE without the axiom of choice. 



E. Some Recent Results 

Boolean-Valued Set Theory. D. Scott and 
R. M. Solovay defined models of set theory in 
which the set-theoretic formula has values in a 
complete Boolean algebra. This viewpoint is 
motivated by P. J. Cohen’s original notions of 
the forcing relation and the generic filter. 

According to the relation between sets and 
their representing functions, a function / : A-i- 
B corresponds to a subset of A when B is a 
complete Boolean algebra. S) by transtinite 
induction we put 

()<<x 

where is the Boolean-valued power set 

of A defined by 

ueP^‘^\A) = u:A^B. 

Eet be the union of all then the 
truth values of the formulas |u 6 u] and |u = r] 
are defined by transtinite induction as 

{uev}= X = 

Jt6dom(i;) 

[« = t;| 

— n u{x)^[xen]- n r(x)^[xeu|, 

*6dom(u) X6dom(o) 

where a => is an abbreviation for the element 
—a + bin B. 

In K*®' the following properties are satisfied: 

(i) The truth value of any formula provable 
from ZEC is 1, the largest element of B. 

(ii) Any complete homomorphism h: B, 
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can be extended to 

so that the commutative relation 

is satisfied for every restricted formula; further, 
if h is an epimorphism then it is satisfied for 
every formula. 

(iii) (maximum principle) For any formula A(x) 
there is an element u of such that 

|3x4(x)]= [/((«)]. 

Moreover, if [3xK(x)J = 1 then there an ele- 
ment u of such that l[R(w)] = 1 and 

[3x(RWa/1(x))] = [/((«)]. 

Property (i) means that if [A] < 1, then A is 
not provable from ZFC. Many consistency 
results in set theory are obtained in this way 
by constructing special partial order struc- 
tures, topological spaces, or complete Boolean 
algebras. 

Axiom of Strong Infinity. The axiom of infin- 
ity, Axiom 6, of ZF asserts that there exists a 
set including the set to of all natural numbers. 
The set o) is the smallest infinite cardinal num- 
ber. In general, the axiom of strong infinity is 
stated in a form such that certain special prop- 
erties of CO are satisfied for an uncountable 
cardinal number. 

Though such an axiom asserts the existence 
of large cardinal numbers, their properties 
sometimes reflect, for example, those of the 
real numbers, or of the set of real numbers, etc. 
We state here some typical examples: 

(i) Weakly inaccessible cardinal number. This 
is a regular limit cardinal number, that is, 

cf(mJ = (o^, V/i<a((Up+i <(oJ. 

(ii) Strongly inaccessible cardinal number. 

cf(foJ = m«, Vj5<a(2"/?<mJ. 

(iii) Weakly compact cardinal number, co^ is 
uncountable and the space X = with < co^- 
topology is (Oj, -compact. 

(iv) Measurable cardinal number, co^ is un- 
countable and there is a nonprincipal <OJ^- 
additive 2-valued measure /i: P(o)j,)-»{0, 1). 

(v) Strongly compact cardinal number, co^ is 
uncountable and any product of 2 = { 0, 1 ) with 
< m,-topology is cOa-compact. 

The strength of these properties increases in 
the order presented here. Properties (i) to (iii) 
are compatible with the axiom of constructi- 
bility V=L. And (iv) implies K# L while it is 
compatible with V- L(/z). But (v) implies 
V ^ L(a) for any set a. 

There are many other stronger axioms of 



infinity, and the consistency proofs of these 
“axioms” within ZF or ZFC would involve 
essential difficulties. Inconsistency proofs of 
these properties, if they exist, would be very 
interesting. 

F. Examples of Results 

(1) Cardinality and Cofinality (-- 3 12 Ordinal 
Numbers), (i) (W. B. Easton [6]) Let be a 
model of ZFC (the Zermelo-Frankel axioms 
plus the axiom of choice) in which the GCH 
(generahzed continuum hypothesis) is valid, 
i.e., Va(2“« =£Uj,+j), and let g be a function 
from ordinals to ordinals in such that Va, 

/?(sc <f^=> ^ and Va(m^ <cf(m, ,„,)). 

Then there is a Boolean model 91 of ZFC, 91 D 
9J1, having the same cofinality and satisfying 
2“a =0}g{a) for every regular cardinal. (This 
means that Konig’s condition (cf(2"«) > o)J is 
the only restriction on the cardinality of 
powers of regular m^,.) 

(ii) (J. H. Silver [39]) Suppose 

oi <cf(a) = cf(o)j)< CO,. 

Then for any / < cf(a) 

However, the validity of the implication 

Vu < co(2“" = co„+i )-^2"‘“ = (0^+1 

still remains an open question. And it is 
known that if there is a model for 

Vn<m(2"" = co„+i), 

then there is a model m which there are many 
measurable cardinal numbers. 

(iii) (K. Prikry [31] ) Let Q), be a measurable 
cardinal number in a model M of ZFC. Then 
there is a Boolean extension N of M in which 
the notion of cardinality is not changed but 
the notion of cofinality is changed, that is 
cf(m,) = ft), in M but cf(co,) = to in N. 

(iv) (Solovay) Let co, be a strongly compact 
cardinal number. Then for any strong-limit 
singular cardinal cop > co, the continuum hypo- 
thesis holds at cop; that is, the implication 

d{aop) < cOp, Vy < li{2‘^y < co^)->-2"/* - (Op+^ 

is provable. 

(2) Lebesgue Measurability and the Baire Prop- 
erty. As is well known, every A} (Borel) set 
(and consequently every (analytic) set) of 
real numbers is Lebesgue measurable and has 
the Baire property. 

(i) (Godel) V= L implies the existence of a 
A] set of real numbers that is neither Lebesgue 
measurable nor has the Baire property. 
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Let DC denote the principle of depending 
choice: 

'^xea^yeaPix, y) 

^3/ :aj^aineo)P{f{n),f{n + !))• 

DC is adequate for the development of the 
classical notions of measure theory, such as 
Jordan decomposition, the Radon-Nikodym 
derivate, etc. Let I denote the hypothesis 
3a(cf(o)J = co^A Vj? < a(2“^ < m^)) (strongly 
inaccessible). 

(ii) (Solovay [40]) The consistency of ZFC 
and 1 implies that of ZF and either of the 
following two axioms: 

(a) DC plus the hypothesis that every set of 
real numbers is Lebesgue measurable and has 
the Baire property. 

(b) The axiom of choice plus the hypothesis 
that every set of real numbers definable by an 
0 ) sequence of ordinals is Lebesgue measurable 
and has the Baire property. 

Axiom (a) means that DC is not strong 
enough to construct a (Lebesgue) nonmeasur- 
able set, while Axiom (b) implies that every 
projective set is Lebesgue measurable and has 
the Baire property, and hence implies that the 
set of all constructible real numbers, as a Z'] 
set, has Lebesgue measure 0, and is of the first 
category. 

(3) Martin’s Axiom. Let B be a Boolean 
algebra. We say that B satisfies c.c. (chain 
condition) if the cardinality of every disjoint 
family of positive elements of B is at most Wj,. 
Let B* denote the topological space consisting 
of all homomorphisms h: B-*2 = {0, 1} with 
the open base U{a) — {h\h{a)— l}(aeB);then 
B* is a +Baire space. Then Martin’s axiom 
(MA) is: Let B be an CD c.c. Boolean algebra 
and a < 2”. Then the intersection of a dense 
open sets is dense in B*. Since {h X^h{a^) = 
h(b)} is dense and open in B* if = b, 

MA means the existence of an h e B* preserv- 
ing any given set of a ( < 2”) equations in B, If 
2” = oji, MA merely reduces to the Baire prop- 
erty of B*. However, if 2” > g>i, then the to 
c.c. hypothesis is essential, for there exists a 
B satisfying co^ c.c. such that B* contains 
dense open sets with empty intersection. 

(i) (Solovay and S. Tennenbaum [43]) The 
consistency of ZF implies that of ZFC, MA, 
and 2‘^>cOi. 

(ii) (D. A. Martin and Solovay [24]) ZFC, 
MA, and 2” > coj imply the propositions: 

(a) Vto, < = 2“), hence 2” = 2*^^; 

(b) the totality of the first category sets of 
Lebesgue measure zero sets is a-additive for 
any a < 2”; 

(c) every iTj set of real numbers is Lebesgue 
measurable and has the Baire property. 
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(4) Suslin’s Hypothesis (SH) is: Every dense, 
linear, order complete set without end points, 
having at most at disjoint intervals, is order 
isomorphic to the continuum of real numbers. 

(i) (T. J. Jech, Tennenbaum) The consistency 
of ZF implies that of (a) ZFC, “| SH, and 
GCH, as well as (b) ZFC, “| SH, and 2” > coj 

(ii) (R. Jensen [15]) The consistency of ZF 
implies that of ZFC, SH, and 2” = o>i . 

(iii) (Solovay and Tennenbaum [43]) ZFC, 
MA, and 2” > co^ imply SH. 

(iv) (Jensen [15]) V=L implies n SH. 

(5) Measurable and Real-Valued Measurable 

Cardinals. A cardinal K >CD is said to be 
measurable if there is a measure /r : 

{0, 1} with (a) /i(/c)= 1, (b) Vv<fc(^({v}) = 0) 
and (c) ^4v) = 2v«x and k is said 

to be real-valued measurable if there exists a 
Ii:S^{k) -> [0, 1] that satifies (a), (b) and (c), and 
is not measurable. 

Let MC (RMC) denote the existence of a 
measurable cardinal (real-valued measurable 
cardinal). 

(i) (S. Ulam [45]) The existence of an (y- 
additive measure /t:)J3(A)->[0, 1], with ii{A) = 

1 and Vx6/4(^((x}) = 0), implies RMC or MC; 
RMC implies the existence of an extension 

of Lebesgue measure defined on 5^1 ( [0, 1]); 
every real-valued measurable cardinal is ^ 2”’ 
and weakly inaccessible. 

(ii) Every measurable cardinal is strongly 
(hyper) inaccessible, and (a) 3aF(a)^3a< 

coi F(a) for any 27 J formula E(a) on the ordinal 
numbers, (b) 3af («)->3a </tj f (a) for any 77] 
formula E(a) and the smallest measurable 
(Many results have been obtained concerning 
the ordinal magnitude of cui and the measur- 
able cardinals.) 

(iii) (Solovay [41]) The consistency of ZEC 
and MC is equivalent to that of ZEC and 
RMC. 

(iv) (Martin and Solovay [24]) ZEC, RMC, 
and MA are not consistent. 

(v) (Eevy and Solovay [21]) The consistency 
of ZEC and MC implies that of ZEC, MC, 

MA, and 2 ">a) 2 - 

(vi) (Solovay) ZEC and MC imply that 
every 27] set of real numbers is Eebesgue 
measurable. 

(vii) (Martin, Solovay) ZEC, MC, MA, and 
2” > £02 imply that every 27] set of real num- 
bers is Eebesgue measurable and has the Baire 
property. 

(viii) (Silver [38]) The consistency of ZEC 
and MC implies that of ZEC and MC as well 
as the existence of (Eebesgue) non-measurable 
A] sets of real numbers. 

(6) Axiom of Determinateness. We consider the 
following inlinite game. Eet A be a set consist- 
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ing of functions co to oj. The players I and II 
choose alternatively natural numbers ri; and 
ntj, and, for the resulting inlinite sequence, if 

then 1 wins, otherwise II wins. 

A function a: is called a 

strategy. If II plays according to the sequence 

W = (m],m 2 , . ) and I plays according 

to the strategy cr, then the resulting sequence is 

<y*m 

The strategy (j is called a winning strategy for 

I of the game defined above, if for any play m 
of II, the resulting play a*m is in the set A, and 
similarly for the winning strategy of II. 

The axiom of determinateness AD is the 
assertion that for any A cz co", the player I or 

II has a winning strategy. 

(i) (J. Mycielski [28]) Assume ZF and AD; 
then we have: 

(a) axiom of choice for countable sequence of 
sets of real numbers; 

(b) every set of real numbers is Lebesgue 
measurable and has the Baire property, so the 
axiom of choice does not hold; 

(c) every uncountable set of real numbers 
contains a perfect set. 

(ii) (Solovay) Assume ZF and AD; then we 
have: 

(a) cardinalities of and 2 ” are not 
comparable; 

(b) cOj and CO 2 are measurable cardinals, but CO 3 
is a singular cardinal such that cffcOj) = CO 2 . 

(iii) (Martin) In ZFC, every Borel, namely 
A(, game is determined. 

(iv) (Martin, Solovay) In ZFC: 

(a) if there exists a measurable cardinal, then 
every analytic, namely 77], game is determined; 

(b) if every A] -game is determined, then there 
is a model of ZFC with many measurable 
cardinal numbers. 

Since the axiom of determinateness AD is 
very strong, there is some suspicion that it 
might contradict ZF. But at present there is no 
such evidence. 



References 

[1] J. Bell, Boolean- valued models and inde- 
pendence proofs in set theory. Clarendon 
Press, 1977. 

[2] P. Bernays, A system of axiomatic set 
theory I, J. Symbolic Logic, 2 (1937), 65577. 

[3] P. J. Cohen, The independence of the 
continuum hypothesis I, II, Proc, Acad. 
Sci. US, 50(1963), 1143-1148; 51 (1964), 105- 
110 . 



[4] P. J. Cohen, Set theory and the continuum 
hypothesis, Benjamin, 1966. 

[5] F. R. Drake, Set theory, North-Holland, 
1974. 

[6] W. B. Easton, Power of regular cardinals, 
Ann. Math. Logic, 1 (1970), 1399178. 

[7] G. Foder, On stationary sets and regres- 
sive functions, Acta Sci. Math., 27 (1966), 1055 

no. 

[S] A. Fraenkel, Zu den Grundlagen der 
Cantor-Zermeloschen Mengenlehre, Math. 
Ann., (1922), 230-237. 

[9] H. Gaifman, Uniform extension operators 
for models and their applications. Sets, Models 
and Recursion Theory, North-Holland, 1967, 
1222155. 

[ 10] F. Galvin and A. Haynal, Inequalities for 
cardinal powers, Ann. Math., (2) 101 (1975), 
491-498. 

[ 1 1] K. Godel, The consistency of the axiom 
of choice and of the generalized continuum 
hypothesis with the axioms of set theory, Ann. 
Math. Studies, Princeton Univ. Press, 1940. 
[12] K. Godel, What is Cantor’s continuum 
problem?, Amer. Math. Monthly, 54 (1947), 
5155525. 

[ 133 T. Jech, Non-provability of Suslin’s hypo- 
thesis, Comm. Math. Univ. Carolinae, 8 (1972), 
2299308. 

[ 14] T. Jech, Set theory. Academic Press, 1978. 

[ 15] R. B. Jensen, The fine structure of the 
constructible hierarchy, Ann. Math. Logic, 4 
(1972), 2299308. 

[ 16] A. Kanamori and M. Magidor, The evo- 
lution of large cardinal axioms in set theory. 
Lecture notes in math. 669, Springer, 1978, 
99-275. 

[17] H. J. Keisler and A. Tarski, From acces- 
sible to inaccessible cardinals. Fund. Math., 

53 (1965), 2255308; 57(1965), H9. 

[ 18] K. Kunen, Inaccessibility properties of 
cardinals. Thesis, Stanford Univ. 

[19] K. Kunen and J. B. Paris, Boolean exten- 
sions and measurable cardinals, Ann. Math. 
Logic, 2 (1971), 3599378. 

[20] A. Levy, Basic set theory. Perspectives in 
mathematical logic. Springer, 1979. 

[21] A. Levy, and R. M. Solovay, Measurable 
cardinals and the continuum hypothesis, Israel 
J. Math., 5 (1967), 2344248. 

[22] M. Magidor, How large is the first 
strongly compact cardinal?, Ann. Math. Logic, 
10 (1976), 33-57. 

[23] P. Mahlo, Zur Theorie und Anwendung 
der po'Zahlen, Ber. Verb. Koniglish Sach. Ges. 
Wiss. Leipzig. Math. Phys. Kl., 63 (1912), 108- 
112; 65 (1913), 2688282. 

[24] D. A. Martin and R. M. Solovay, Internal 
Cohen extensions, Ann. Math. Logic, 2 (1970), 

143-178. 

[25] E. Mendelson, The independence of weak 




153 



axiom of choice, J. Symbolic Logic, 21 (1956), 
350-366. 

[26] Y. N. Moschovakis, Determinacy and 
prewellordering of the continuum. Mathe- 
matical logic and foundations of set theory, 
North-Holland, 1970, 24462. 

[27] A. Mostowski, Ober die Unabhangigkeit 
des Wohlordnungssatzes vom Ordnungsprin- 
zip. Fund. Math., 32 (1939), 201-252. 

[28] J. Mycielski, On the axiom of determi- 
nateness, Fund. Math., 53 (1964), 205-224. 

[29] K. Namba, An axiom of strong infinity 
and analytic hierarchy of ordinal numbers, 
Proc. Symp. Pure Math., 13 (1971), 279-319. 

[30] J. V. Neumann, Die Axiomtisierung der 
Mengenlehre, Math. Z., 27 (1928), 6699752. 

[31] K. L. Prikry, Changing measurable into 
accessible cardinals, Diss. Math., 68 (1970), 5- 
52. 

[32] J. B. Rosser, Boolean-valued models for 
set theory. Academic Press, 1969. 

[33] G. E. Sacks, Forcing with perfect closed 
sets, Proc. Symp. Pure Math., 13 (1971), 331- 
355. 

[34] D. S. Scott, Measurable cardinals and 
constructible sets. Bull. Acad. Polon. Sci., Ser. 
Math. Astr. Phys., 7 (1961), 145-149. 

[35] D. S. Scott and R. M. Solovay, Boolean- 
valued models for set theory. Lecture notes. 
Summer Institute on Axiomatic Set Theory, 
UCLA, 1967. 

[36] S. Shelah, A compactness theorem for 
singular cardinals, free algebras, Whitehead 
problem and transversals, Israel J. Math., 21 
(1975), 319-349. 

[37] J. H. Silver, Some applications of model 
theory in set theory, Ann. Math. Logic, 3 
(1971), 45-110. 

[38] J. H. Silver, The consistency of the GCH 
with the existence of a measurable cardinal, 
Proc. Symp. Pure Math., 13 (1971), 391-396. 

[39] J. H. Silver, On the singular cardinal 
problem, Proc. Int. Congr. Math., Vancouver, 

1 (1974), 2655286. 

[40] R- M. Solovay, A model of set theory in 
which every set of reals is Lebesgue measur- 
able, Ann. Math., (2) 92 (1970), 1-56. 

[41] R. M. Solovay, Real-valued measurable 
cardinals, Proc. Symp. Pure Math., 13 (1971), 
4977428. 

[42] R. M. Solovay, Strongly compact car- 
dinal and the G.C.H., Proc. Symp. Pure Math., 
25 (1974), 3655372. 

[43] R. M. Solovay and S. Tennenbaum, 
Iterated Cohen extensions and Suslin’s prob- 
lem, Ann. Math., (2) 94 (1971), 201-245. 

[44] G. Takeuti and W. B. Zaring, Axiomatic 
set theory, GTM 8, Springer, 1973. 

[45] S. Ulam, Zur Masstheorie in der al- 
gemainen Mengenlehre, Lund. Math., 16 
(1930), 140-150. 



34 B 

Axiom of Choice and Equivalents 

[46] P. Vopenka and K. Hrbacek, On strongly 
measurable cardinals. Bull. Acad. Polon. Sci., 
Ser. Sci. Math. Astr. Phys., 14 (1966), 587-591. 

[47] E. Zermelo, Untersuchungen liber die 
Grundlagen der Mengenlehre I, Math. Ann., 

65 (1908), 261-281. 



34 ( 11 . 5 ) 

Axiom of Choice and 
Equivalents 

A. The Axiom of Choice 

In set theory, the following axiom is known as 
the axiom of choice: Lor any nonempty family 
51 of nonempty subsets of a set X, there exists 
a single-valued function /, called the choice 
function, whose domain is 51, such that for 
every element A of 51 the value f(A) is a mem- 
ber of A. This axiom is equivalent to each of 
the following three propositions: (1) If {A,},„ 
is a family of sets not containing the empty set 
and with an index set A, then the Cartesian 
product ri; A, is not the empty set. (2) If a set 
A is a disjoint union (J^ A, of a family of 
subsets {A,},„ which does not contain the 
empty set, there exists a subset B, called the 
choice set, of A such that every intersection of 
B and /4^(/le A) contains one and only one 
element. (3) Lor every mapping f from a set A 
onto a set B, there is a mapping from B to A 
such that /o g = 1, (identity mapping). Also 
equivalent to the axiom of choice are the well- 
ordering theorem and Zorn’s lemma, which 
are discussed in the following sections. 

B. The Well-Ordering Theorem 

In 1904, E. Zermelo [2] first stated the axiom 
of choice and used it for his proof of the well- 
ordering theorem, which says that every set can 
be twell-ordered by an appropriate tordering. 
Conversely, the well-ordering theorem implies 
the axiom of choice. Many important results 
in set theory can be obtained by using the 
axiom of choice, for example, that tcardinal 
numbers are comparable, or that various 
definitions of the tiniteness or infmiteness 
of sets are equivalent. Various important 
theorems outside of set theory, e.g., the exis- 
tence of bases in a tlinear space, tcompactness 
of the direct product of compact ttopological 
spaces (tTikhonov’s theorem), the existence 
of a subset which is not tLebesgue measurable 
in Euclidean spaCC, etc., are proved using the 
axiom of choice. But for those proofs the well- 
ordering theorem or Zorn’s lemma (stated 
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below) are used more often than the axiom of 
ehoice. 

Using the well-ordering theorem, it can be 
proved in the following manner that for every 
field k, any linear spaee X over k has a basis. 
Let (x^veA be an enumeration of X. By ttrans- 
finite induetion, we can define the funetion f 
from X to B = { 0, 1 } such that (i) if Xq = 0, 

/(Xq) = 0; if Xq ^ 0, /{xq) = 1; (ii) for v > 0, if x, 
is expressed as a linear eombination of ele- 
ments of {xJm<v,/(x^)= l},/(x,) = 0; other- 
wise /(xj = 1. Then V = {xeX|/(x)= 1} is a 
basis of X. 



C. Zorn’s Lemma 

An ordered set X is ealled an inductively 
ordered set if every '^totally ordered subset of X 
has an '•'upper bound. A eondition C for sets is 
called a condition of finite character if a set X 
satisfies c if and only if every finite subset of 
X satisfies C. A eondition C for funetions is 
called a condition of finite character if C is a 
condition of finite character for the graph of 
the function. Zorn’s lemma [4] can be stated 
in any one of the following ways, which are all 
equivalent to the axiom of choice. It is often 
more convenient to use than the axiom of 
choice or the well-ordering theorem. 

(1) Every inductively ordered set has at least 
one maximal element. 

(2) If every well-ordered subset of an ordered 
set M has an upper bound, then there is at 
least one maximal element in M, 

(3) Every ordered set A4 has a well-ordered 
subset W such that every upper bound of M 
belongs to W. 

(4) Eor a condition C of finite character for 
sets, every set X has a maximal (for the rela- 
tion of the inclusion) subset of X that satis- 
fies C. 

(5) Eet C be a condition of finite character for 
functions from X to Y. Then, in the set of 
functions that satisfy C, there is a function 
whose domain is maximal (for the relation of 
the inclusion). 

Using Zorn’s lemma (1), we can prove again 
in the following way any linear space X over a 
field k has a basis. Eet be the set of all non- 
empty subsets A of X such that arbitrary finite 
subsets of A are linearly independent ovcr k. 
is not empty. If we order ^ by the relation of 
inclusion, then ^ is an inductively ordered set. 
By Zorn’s lemma (1), there is a maximal ele- 
ment U of Since u is maximal, U is a basis 
ofX. 

The same theorem is proved as follows 
using Zorn’s lemma (4). Condition C for the 
subset A of X, that arbitrary finite subsets of A 
are linearly independent over k, is a condition 



of finite character. Hence there is a maximal 
subset U that satisties C and is a basis of X. 

Concerning recent developments regarding 
the axiomatic basis for the axiom of choice 
33 Axiomatic Set Theory D. 
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Axiom Systems 

A. History 

A mathematical theory is based on a specific 
system of axioms, i.e., a system of hypotheses 
from which the whole theory is deduced with- 
out reliance on other assumptions. 

One of the first deductive methods of math- 
ematical reasoning was utilized by Thales, who 
returned to Greece from Egypt with the knowl- 
edge of surveying methods, and who deduced 
additional results from that empirical knowl- 
edge. His method gave impetus to the devel- 
opment of Greek geometry, which flowered 
with the Pythagorean school and research by 
members of Plato’s Academy. In the course of 
this development, the deductive method led to 
the idea of constructing the whole theory upon 
a system of “absolutely obvious” statements 
from which the whole theory could be de- 
duced. Euclid systematized Greek geometry in 
his Elements utilizing this idea. His work 
became the basis of geometry after the Re- 
naissance, and Greek geometry came to be 
called tEuclidean geometry. In the Elements 
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Euclid called the basic obvious statements 
common notions when they were of general 
nature, and postulates when they were specifi- 
cally geometric. Both were later called axioms 
(or postulates). 

Among the axioms stated by Euclid, the 
“lifth postulate” concerning parallels was 
longer and more complicated than the other 
axioms. Many efforts were made to deduce 
this particular axiom from the other axioms. 
The failure of these attempts suggested the 
possibility of establishing a tp on- Euclidean 
geometry, which was actually done by N. 1. 
Eobachevskii and J. Bolyai, who replaced the 
fifth postulate by its negation and showed that 
the new system of “axioms” was as valid as the 
classical one. This development naturally led 
to a new evaluation of the idea of axioms, and 
eventually the traditional concept of recogniz- 
ing the axioms as obvious truths was replaced 
by the understanding that they are hypotheses 
for a theory. D. Hilbert [ 1] established the 
latter idea as axiomatization and claimed that 
the whole science of mathematics should be 
built upon a system of axioms. His idea be- 
came the foundation of present-day mathe- 
matics. Hilbert reorganized classical geometry 
based upon his idea and published his result in 
Foundations of Geometry (1899). 

B. Systems of Axioms 

The system of axioms of a theory, i.e., the 
system of basic hypotheses from which we 
hope to deduce the whole theory, is written in 
undefined terms (or in terms of undefined con- 
cepts) by means of which all other terms are 
defined. On the other hand, a given theory is 
axiomatized by specifying such a system of 
axioms upon which the theory may be re- 
organized. It should also be noted that a sys- 
tem of axioms determines a tstructure 409 
Structures). 

A system of axioms is considered to be 
mathematically valid if and only if it is consist- 
ent. It is also desirable that the axioms in such 
a system be mutually independent (i.e., the 
negation of any one of the axioms is still con- 
sistent with the others). When such a system is 
not independent, it can be simplified by delet- 
ing redundant axioms from it. 

When any two models of a system of axioms 
are isomorphic to each other, we call the sys- 
tem complete or categorical (^ 409 Struc- 
tures). Eor example, the system of axioms (I)- 
(V) postulated by Hilbert as the foundation 
for Euclidean geometry is complete (^155 
Eoundations of Geometry), whereas the sys- 
tems of axioms for the theories of tgroups, 
Tings, or ^fields are not complete since there 



are nonisomorphic groups, etc. Although it is 
desirable that the systems of axioms postu- 
lated for a given theory (e.g., the ttheory of real 
numbers, or Euclidean geometry) be complete, 
the study of partial systems that may not be 
complete is also important (^156 Founda- 
tions of Mathematics). 
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Banach Algebras 

A. Definition [1-4] 

A real or complex IBanach space R is called a 
Banach algebra (or normed ring) if a multiplica- 
tion law between elements of R is introduced 
that makes R an f associative algebra and if 
il-’cyll ^ \\x\\ ' II yll is satisfied. The complex case 
is generally easier to handle, and a real Banach 
algebra can always be embedded in a complex 
one isomorphically and isometrically. When 
a Banach algebra contains a unity element e 
with respect to the multiplication, i.e., it is 
unital, we can suppose ||e|| = 1. If it is not uni- 
tal, we can adjoin a unity element to it- 



B. Examples 

1. Let Cq( 5[R) be the set of complex- (real-) 
valued continuous functions x(^) on a locally 
compact space that vanish at intinity. 
Endowed with the pointwise operations 
and the tsupremum norm (- 168 Function 
Spaces), it forms a complex (real) Banach 
algebra. In the complex case, it has the invo- 
lution X*((^) = X((^). 

2. Let X be a Banach space. The set 

of tbounded linear operators on X forms a 
Banach algebra if we define addition, multi- 
plication by scalars, multiplication between 
elements, and the norm in the usual fashion. 

3. Let G be a locally compact Hausdorff 
group (- 423 Topological Groups) and /i be 
its ■''left-invariant Haar measure. The Banach 
space L,(G) (with respect to /r) can be made 
into a Banach algebra by defining xy{g) = 
^x{h)y{h~^ g)dfi{h) (- Section L). 

4. Every subalgebra of a Banach algebra 
R, which is closed in the norm topology, is a 
Banach algebra with respect to the original 
algebraic operations and norm and is called a 
closed subalgebra of R. 

Throughout this article, all Banach algebras 
are supposed to be complex Banach algebras 
unless otherwise specified. 



C. Spectrum of an Element 

We define a new operation x ‘ Y in R by setting 
x-y = x-|-y-xy. If x-y = yx = 0, y is called a 
quasi-inverse of x. When a unity element e 
exists, y is a quasi-inverse of x if and only if 
e-y is an inverse of e x. Lor a complex 
number A such that |/l| > |jx||, we see that 



A~^x possesses a quasi-inverse y given by the 
strongly convergent series y= — /“-.""x". 

The set of all complex numbers A such that 
X^^x does not have a quasi-inverse is called 
the spectrum of x and is denoted by Sp(x). 
(When X itself does not have an inverse, in 
particular, when R does not contain a unity 
element, we include 0 in this set.) Sp(x) is a 
bounded closed set in the complex plane, and 
sup{ |/t| AeSp{x)}=lim„^ 3 pix"||^^". In Exam- 
ple 2 , this spectrum is the spectrum of the 
operator x, and the problem of determining 
the behavior of this spectrum constitutes one 
of the central problems in the theories of 
Banach and Hilbert spaces. 



D. Representations of a General Banach 
Algebra [1,2] 

We understand by a representation of a 
Banach algebra R on a Banach space X any 
algebraic homomorphism jr of R into the alge- 
bra J*(X) (see Example 2) satisfying ||;i(x)|| ^ 
||x|| for all XE R . We call X the representation 
space. A Banach algebra always possesses 
an isomorphic and isometric representation. 
Especially important are the irreducible rep- 
resentations. A vector subspace (closed or not) 
Y of X is an invariant subspace if 7r(x) 7 c= Y 
for any xe R . A representation is called alge- 
braically irreducible if the invariant subspaces 
are trivial, i.e., they are only {0} or X. A repre- 
sentation is said to be topologically irreduc- 
ible if closed invariant subspaces are trivial. 

The tkernel of an algebraically irreducible 
representation is called a primitive ideal, which 
can alternatively be defined in the following 
way: A left ideal J ( ^ {0}, R) is regular, by 
definition, if R contains an element u, a unity 
element modulo J, such that x — XU G d for any 
XE R . Such an ideal is always contained in a 
maximal left ideal, which in turn is necessarily 
regular and closed. A two-sided ideal / is 
primitive if it is the set of elements a in R for 
which aR a J, where J is some fixed regular 
maximal left ideal. If R is commutative, a 
primitive ideal is a regular maximal ideal, and 
conversely. The intersection of all primitive 
ideals is called the radical of R , and when it 
is {0}, R is called semisimple. 

The set of primitive ideals 3 is known as the 
structure space of R , in which the hull-kernel 
topology (or Jacobson topology) is introduced. 
The tclosure of a set in 3 is, under this 
topology, the set of primitive ideals containing 
the intersection of the ideals in 21. This topol- 
ogy is rather intractable; even in commutative 
cases, it does not coincide with the Gel’fand 
topology Section E), in general. 
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E. The Gel’fand Representation of a 
Commutative Banach Algebra 

A complex Banach algebra is a field if and 
only if it coincides with the field C of complex 
numbers (Gel’fand-Mazur theorem). This is the 
most fundamental fact in the study of com- 
mutative Banach algebras. Now let i? be a 
commutative Banach algebra. Then every 
regular maximal ideal J of is closed and the 
tquotient algebra RjJ is isomorphic to C, so 
that the quotient mapping R^RjJ is viewed 
as a complex-valued homomorphism of R. 
Conversely, if ^ is a nonzero complex-valued 
homomorphism of R, it has norm ^1 as a 
linear functional on R and its kernel, say R^, is 
a regular maximal ideal of R such that (p is 
exactly the quotient mapping -> R/R^. The 
set %II(R) of nonzero complex-valued hotno- 
morphisms (or regular maximal ideals) of R 
endowed with the tweak* topology of the 
Banach space tdual R' ofR is called the maxi- 
mal ideal space of R. 2R(R) is a locally com- 
pact Hausdorff space, and its topology is 
called the Gel’fand topology. For each xe R we 
define a function X on SfifR) by setting = 
(/)(x). Then the mapping is a homo- 
morphism of R into the algebra of 

all continuous complex-valued functions on 
StR(R) vanishing at infinity. This is the Gel’fand 
representation of R whose image, the Gel’fand 
transform of R, is denoted by R. Concerning 
this, we have (1) Jli(R) is compact if R has a 
unity element; (2) Sp(x) equals the closure of 
the range i(!Di(R)) of x; (3) the representation 
x^x is norm-decreasing if Co(9?t(R)) is en- 
dowed with the tsupremum norm; (4) \\i\\^ 

( = sup{ |x((p)|| (pe9R(R)}) equals lim„^^ 

The tkernel of the Gel’fand representation of 
R is the radical of R, which consists of gen- 
eralized nilpotent elements of R. 

F. Banach Star Algebras 

An involution in a Banach algebra R is an 
operation x-»X* that satisfies (1) (x -i- y)* =x* 
+y*; (2) {Xx)*=ax*U 3) {xy)* =y*x*;(4) (x*)* 

= X. A Banach algebra with an involution is 
called a Banach ^-algebra. A * -homomorphism 

d) between two Banach ^-algebras is an alge- 
braic homomorphism which preserves invo- 
lutions, i.e.,0(x*) = $(x)*. To represent a 
Banach ^-algebra we prefer a ^-representation, 
i.e., a representation on a Hilbert space 

such that is equal to the adjoint T* of 
for any xeR. 

G. C*-Algehras [5-10] 

A Banach *-algebra A satisfying \\x*x\\ = ||xp 
for all X 6 A is called a C*-algehra. Every C*- 
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ilgebra is ^-isomorphic and -isometric to a 
lanach algebra of operators on a Hilbert 
pace (see Example 2) that contains, along 
vith an operator, its adjoint (the Gel’fand- 
Vaimark theorem). A C*-algebra is semi- 
imple, and a commutative C*-algebra A is 
f-isomorphic to Co()0i(A)) under the Gel’fand 
•epresentation. A topologically irreducible *- 
■epresentation of a C*-algebra is also algebrai- 
cally irreducible (R. V. Kadison), and the set of 
unitary equivalence classes of these irreduc- 
ble ^-representations is called the dual space. 

[t becomes a topological space if we introduce 
;he hull-kernel topology inherited from the 
structure space, but other topologies are also 
introduced. Moreover, for the study of separ- 
able C*-algebras, Borel structure in the sense 
yf G. W. Mackey is a very powerful tool. C*- 
algebras also have intimate connections with 
the theory of ’''unitary representations of a 
topological group (see below) and with quan- 
tum physics. Many works have been published 
on * -representations, dual spaces, etc. 

A linear functional cu on a C*-algebra A is 
said to be positive if £o(x*x) ^ 0 for any x g A. 

Eor any nonzero positive linear functional (j) 
of A, there exist a ^-representation {n^, HJj of 
A on a Hilbert space If, and a vector 
such that co(a) = (7r„(u)<^„,([J for all aeA and 
that the subspace {7ro,(a)(^„|(3e Ay is dense in 
We call HJ a cyclic representation of 
A induced by cu. A ^-representation (n, H) of A 
is called universal if for any ^-representation 
(p, K) of A there exists a tc-weakly continuous 
♦-homomorphism p of n(A)” onto p(A)” such 
that p(a) = (po7t)(u)for all aeA, where n{A)'' 
is the double tcommutant of n(A) and so is 
a tyon Neumann algebra (-^ 308 Operator 
Algebras). Von Neumann algebras ti:(A)'' for 
universal representations 7T of A are mutually 
♦-isomorphic, so that they determine the envel- 
oping von Neumann algebra of A. Especially, 
the direct sum n of all cyclic representations 
is a universal representation and there is a 
unique isometric isomorphism of n(A)” onto 
the second dual A** of A that is bicontinu- 
ous in the o-weak topology of n(A ) ” and the 
(t{A**, A*)-topology of A**. So A** is identi- 
fied with the enveloping von Neumann algebra 
of A. 



H. Some Classes of C*-Algebras 

Eet A be a C*-algebra. If is a CCR (liminal 
C*-) algebra if it is mapped to the algebra of 
compact operators under any irreducible *- 
representation. It is a GCR (postliminal C*-) 
algebra if every nonzero quotient C*-algebra 
of A has a nonzero CCR closed two-sided 
ideal. These classes of C*-algebras have inter- 
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esting properties. For example, the dual space 
of a separable C*-algebra is a space (tTi 
space) if and only if the algebra is GCR (CCR) 
(J. Glimm). 

A is called an AF (approximately flnite) 
algebra if it is the uniform closure of an in- 
creasing sequence of fmite-dimensional C*- 
algebras (0. Bratteli). AF algebras are classi- 
fied by so-called dimension groups. To see this, 
we call an ordered Abelian group G (written 
additively) a Riesz group if (a) for any integer 
n> 0 and any 3 € G, ^ 0 implies ^^Oand 
(b) for any ( 1 ^ ^ m, 1 < j ^ n) in G with 

^ hj for all i and j, there exists k^G with 
gi^k ^hj for all i, j. Then the isomorphism 
classes of AF algebras correspond bijectively 
to the isomorphism classes, as local semi- 
groups, of generating upward-directed heredi- 
tary subsets of the positive cones of countable 
Riesz groups, and a dimension group, defined 
otherwise, is exactly a countable Riesz group 
(G. Elliot, E. G. Effros, D. E. Handelman and 
C. E. Shen) (- [II]). 

Now consider two C*-algebras A and B. 

The algebraic ttensor product A ® B over C 
becomes a :»:-algebra in the natural way. A 
norm )? on A (x) B is a C*-crossnorm if ||xy ^ 
ll^ll/j and ||x*x||p= ||x||j for x, ye A ® B 
and ||a 0 h||^^ ||a|| |16|| for aeA, beB. The 
completion of A 0 B under such a norm fi is 
a C*-algebra, which is denoted by A 0^ B. 
There are two special C*-crossnorms on A 0 
i?: II • l^in and ||- The former is called the 
spatial (minimal, injective) C*-crossnorm and 
is defined by ||x|Li„ = sup^,^||((p0)//)(x)||, 
where (p and ifj run over all *-representations 
of A and B, respectively. The algebra A 0„in B 
is called the spatial tensor product of A and B. 
The latter is called the greatest C*-crossnorm 
and is defined by ||x||n,ax = supp l|T(x)||, where 
T runs overall ^-representations of A 0B. 

And the algebra A is called the projec- 

tive C*-tensor product of A and B. Any C*- 
crossnorm j? on A 0 B lies in between these 
two norms, i.e., ||xLi„< l|x||^^ ||xL,, for 
X€i4 0 B. A C*-algebra A is called nuclear if 
A 0 B has a unique C*-crossnorm for any C*- 
algebra B. It is known that any GCR algebra 
is nuclear and that an inductive limit of nu- 
clear C*-algebras (e.g., any AE algebra) is 
nuclear. Given a linear mapping cp between 
C*-algebras A and B, we define for any inte- 
ger u ^ 1 a linear mapping (p„:A 0 M„-»B 0 
M„ by setting cp^x 0 e^) = <p(x) 0 e^, where 
{Cij} are the matrix units for the C*-algebra 
of n X n complex matrices, (p is said to be 
completely positive if cp„ is positive for any n ^ 

1 . A C*-algebra A is then called injective if, 
for any C*-algebras B, C with BEG and any 
completely positive contraction (p: B^A, there 



exists a completely positive contraction ip\ 
C+A that extends q>. A C*-algebra A is 
nuclear if and only if its enveloping von Neu- 
mann algebra A** is injective. Every injective 
C*-algebra B is not necessarily a von Neu- 
mann algebra but every AW*-algebra, i.e., 
every maximal commutative *-subalgebra M 
of B, is monotone complete (i.e., every increas- 
ing net of self-adjoint elements has a least 
upper bound in M) and the lattice of projec- 
tions in B is conditionally complete. 

I. Crossed Product 

Eet A be a C*-algebra, G a locally compact 
Hausdorff group with left-invariant mea- 
sure fi, and a a continuous action of G on A 
as ^-automorphisms geG. Let K(G, A) 
be the linear space of continuous functions 
from G to A with compact support, which is 
a ^-algebra under the multiplication y * z 
and the involution y* given by (y * z)(g) = 

and y*(^) = A( 0 -‘)- 

^giyid ’)*) for y, zeK(G, A), where A is 
the tmodular function of G. By completing 
K{G, A) under the norm || y | j =Jg || y|| d^ig), 
we get a Banach ^ -algebra Lj(G, A). Li(G,A)is 
then made into a C*-algebra by furnishing the 
norm ||x||=sup,j || 7 t(x)jl, where ;rruns overall 
^-representations of L^{G,A). This C*-algebra, 
denoted by A 0^, G, is the crossed product of 
A by G relative to the action a. Crossed prod- 
ucts are useful in the structure theory of G*- 
algebras. 



J. Extensions by C*-Algebras (BDF Theory) 

[ 12 ] 

Eet H he a separable infinite-dimensional 
Hilbert space and the ideal of <^{H) 

consisting of all compact operators. The quo- 
tient C*-algebra ^{H)l ^^{H) is called the 
Calkin algebra. We denote it by J(//) and 
the quotient mapping by By 

an extension of ^^{H) by a separable unital 
C*-algebra A we mean a unital (preserving 
unity elements) ^-isomorphism i of A into 
J(/f). We call two extensions Tj, T 2 :A ->.2(//) 
equivalent if, for some unitary operator u 6 
^{H), 7 t(M)Ti(a) 7 t(u)* = 12 ( 0 ) for any aeA, and 
denote by Ext(A) the set of all equivalence 
classes [t] of extensions t by A. Ext(A) forms 
a commutative semigroup with respect to the 
addition [t] = [tJ -|- [ 12 ], where t(u) = Tj (u) 0 
i 2 (fl) 6 ^(//) © J(H)c J(// © H). Call an ex- 
tension x'.A^^{H) trivial if there is a unital 
*-isomorphism a of A into ^{H) with r = no. 
Then (1) all trivial extensions are equivalent. 
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and we define the unity element of the semi- 
group Ext(/1) (D. Voiculescu); (2) Ext(/1) is a 
group if A is commutative (L. G. Brown, R. G. 
Douglas, and P. A. Fillmore) or, more gener- 
ally, if A is nuclear (M. D. Choi and E. G. 
Effros); (3) Ext(/1) is not always a group (J. 
Anderson). Eor further information 390 
Spectral Analysis of Operators J. 



K. Derivations in C*-Algebras 

A linear operator 5 in a C*-algebra A is a 
derivation in A if its domain D(6) is a dense 
subalgebra of A and ^(xy) = ^{x)y -I- x^(y) for x, 
ye D(6). Sis a ^-derivation if, moreover, xeD(6) 
implies x*eD(S) and 5 (x*) = (5(x)*. <5 is called 
bounded (unbounded, closed) if it is bounded 
(unbounded, closed) as a linear operator. 

Every bounded derivation 3 is expressed 
as d{x) = ax — xa with some d in the envelop- 
ing von Neumann algebra A** of A (R. V. 
Kadison and S. Sakai). The element d can be 
taken from the multiplier algebra MfA) = 
[beA**\bA + Ab^A] of A if A is simple 
(Sakai) or if A has continuous trace (C. A. 
Akemann, Elliott, G. K. Pedersen, and J. 
Tomiyama). If A is separable, we see that all 
bounded derivations in A are given by ele- 
ments in M(A) if and only if A is the C*-direct 
sum of a family of simple C*-algebras and a 
C*-algebra with continuous trace (Elliott, 
Akemann, and Pedersen) [13] and that every 
bounded derivation in the quotient C*-algebra 
A/I, I being any closed two-sided ideal, can be 
lifted to a derivation in A (Pedersen). 

Next we consider unbounded derivation. By 
a =t-autOmorphism group on A we mean a tone- 
parameter group p„ f 6 R, of =i:-automorphisms 
of A such that, for each XE A, p,x is contin- 
uous in t e R. A C*-dynamical system i s a 
pair consisting of a C*-algebra A and a :|!- 
automorphism group on A. The fact that the 
time evolution of a physical system is often 
represented by such a dynamical system has 
made the study of unbounded derivations 
quite active. We have to see if a given deriva- 
tion S is tclosable and if, in case of a closable 
(5, the closure 3 generates a ^-automorphism 
group. Sample results: (1) If a ^-derivation () 
is well-behaved in the sense that for every self- 
adjoint X in D(6) there exists a tstate (p with 
|(p(x)|= |x|| and <p((5(x)) = 0, then 3 is closable 
and its closure is well-behaved (A. Kishimoto 
and Sakai); (2) if a * -derivation 3 is closable, 
its closure 3 generates a ^-automorphism 
group if and only if 3 is well-behaved and 
(1 + ^)D((5) is dense in A (R. T. Powers and 
S. Sakai; 0. Bratteli and D. W. Robinson). 

Eor further results ^[14]. 
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L. Applications to the Theory of Topological 
Groups [2,3,6,15] 

Banach algebras have many applications in 
different branches of mathematics and quan- 
tum physics. Here we mention some that con- 
cern topological groups. Eet G be a locally 
compact Hausdorff group and p be its left- 
invariant measure. We make the complex 
algebra E,(G) of Example 3 into a Banach 
*-algebraby defining x*{g) = x{g~^)A{g~% 
where A is the tmodular function of G. This is 
called the -algebra (or group algebra). It is 
not C* but is semisimple. Considering a uni- 
tary representation of G is equivalent to con- 
sidering a ^-representation of the Lj -algebra. 
Replace the norm of xeLifG) by supll7i:(x)||, 
where the supremum is taken over all the *- 
representations. The new norm satisfies the C* 
condition, and the completion of E,(G) with 
respect to this norm is a C*-algebra, which we 
call the C*-group algebra of G. The dual of the 
C*-group algebra thus defined is called the 
dual of the group G, and this notion plays an 
important role in the study of topological 
groups. Unitary representations of a group G, 
^-representations of the Lj -algebra of G, and 
*-representations of the C*-group algebra of G 
are all characterized by positive definite func- 
tions on G. A function p(g) is positive definite, 
by definition, if it is measurable on G and 

^ p{g-^ h)x{g)x{h)dp{g)dp{h)^0 

for any continuous function x(g) with compact 
support. 

The Abelian case. When G is an Abelian 
group (- 422 Topological Abelian Groups), a 
regular maximal ideal M of the Lj -algebra R 
of G and a character y of G are in a one-to-one 
correspondence by the relation 

x(M) =^x{g)y{g)dp{g) 

(the left-hand side is the value of x at M under 
the Gel’fand representation). Moreover, the set 
of regular maximal ideals of R provided with 
the Gel’fand topology and the set Q of charac- 
ters of G provided with the Pontryagin topol- 
ogy (the tcharacter group of G) are homeo- 
morphic by this correspondence. Therefore the 
Gel’fand transform of an element x of the L^- 
algebra R is seen to be a function xly) on G 
defined by the integral on the right-hand side 
in the above expression, which is properly 
called the Fourier transform of x. Of course, 
the Eourier transform can be defined for other 
classes of functions (e.g., for the L 2 -space over 
G. the Eourier transform in the sense of Plan- 
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cherel), and classical theories of Fourier 
series, Fourier integrals, and harmonic analysis 
(- 192 Harmonic Analysis) are studied from a 
more extensive point of view. Thus the state- 
ment that the Fourier transform of an element 
X of the Lj -algebra of G is a continuous func- 
tion vanishing at inlinity Section E), for 
example, is a version of the classical tRiemann- 
Lebesgue theorem. Bochner’s theorem in 
classical tFourier analysis is restated thus: A 
continuous tpositive definite function on an 
Abelian group can be put in the form 

p{g)=^y{g)My), 

where p is a uniquely determined bounded 
positive tRadon measure on the character 
group (5. Developing these theories further, we 
obtain an alternative proof of the tPontryagin 
duality theorem (H. Cartan and R. Godement). 
A closed ideal / in the L j -algebra R deter- 
mines a set Z(I) in C as the set of common 
zeros of the Fourier transforms of elements of 
I. We ask whether, conversely, / is character- 
ized by Z(l). This question is the problem of 
spectral synthesis, and many important results 
have been obtained. The statement that I must 
coincide with R when Z(l) is empty is a for- 
mulation of the generalized Tauberian theorem 
of N. Wiener. A considerable simplification 
of the proof was accomplished by using the 
theory of Banach algebras, which was the first 
application of the theory (f. M. Gel’fand). 

M. Holomorpbic Functional Calculus 
[16,171 

Let R be a unital commutative Banach algebra 
with maximal ideal space 3R = ilR(R). We de- 
fine the joint spectrum Sp(X) of a finite n-tuple 
X = {xi,...,x„} in R by {(x, {(p), , 

(peiUl}, a nonempty compact subset of C". 

Then the bolomorpbic functional calculus says 
the following: For each X = {x^, . , x,} cz R 
there exists a unique algebra homomorphism 
0 ;^^ of the algebra H{Sp{X)) of tgerms of holo- 
morphic functions on Sp{X) into R with the 
following properties: (1) = 1, the unity 

element of R, and = X;, 1 ^ j ^ fi; (2) if 
n<m, A’' = {xi, . . . . X,} mdF{z^, — zj — 

F(Z 1 , . . . . zj with FeH{Sp{X)), then 

(3) if { k = 1,2, } are holomorphic 
in a fixed neighborhood JJ of Sp(A) and F^-> 

F uniformly on U, then 0;f(FJ-+^);f{F) (L. 
Waelbroeck). From this follows the implicit 
function theorem: Let Xj , • , x„ e F, /e C(93i), 
and let F(w, z , z„) be holomorphic in a 
neighborhood of the set Sp(/,Xi,. . • ,X„) = 
{(/(<p),i, ((p), , x„((p))| Suppose that 



F(/,Xi, ...,xj = 0 but dF/dw^O on Sp(/,Xi, 
...,x,). Then there exists a unique ye R such 
that j) = f and F{y,x ^, . , xj = 0 (R. Arens and 
A. Calderon). As an application we obtain 
Shilov’s idempotent theorem: If K is a closed- 
open subset of i0i, there exists xeR with x = 

1 on K and x = 0 on — X. Set Q{R)={xe 
Then Q(R) is an additive sub- 
group of R and Shilov’s theorem says that 
the Gel’fand representation gives an isomor- 
phism of Q(R) onto Q(C(W)) ( ^ Z)). 

Another theorem of this sort is the Arens- 
Royden theorem: R 7exp(R) = C(93i)“V 
exp(C(9Ji)) = Z), where R "4s the set 

of invertible elements of R and exp(R)= {e^\ 
XE Fy. Lurther extensions related to K- theory 
have been given by J. L. Taylor and others. 
(Lor functional calculus for one variable -v 
25 1 Linear Operators) 
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Banach Spaces 

A. General Remarks 

The notion of Banach space was introduced in 
1922 by S. Banach and N. Wiener, working 
independently. The idea was to apply topo- 
logical and algebraic methods to fundamental 
problems of analysis, such as mapping prob- 
lems in infinite-dimensional function spaces 
(- 168 Function Spaces, 197 Hilbert Spaces, 
25 1 Linear Operators). 

B. Definition of Banach Spaces 

We associate to each element x of a tlinear 
space X over the real (or complex) number 
field a real number ||x|| satisfying the following 
conditions: (i) ||x|| >0 for all x, and ||x|| =0 is 
equivalent to X = 0; (ii) ||ax|| = |a| • ||x|| for any 
real (complex) number a; (in) ||x-l-y||^||x|| + 
llyll. Then ||x|| is called the norm of the vector 
X, and X is called a normed linear space. The 
norm is thus an extension of the notion of 
the length of a vector in a ^Euclidean space. A 
normed linear space X is a tmetric space under 
the distance p{x,y)= ||x — yl|. We write s- 
lim„^^x„ = x or simply x„->x when lim„^^||x„ 
— x|| = 0 and say that x„ converges strongly to 
X. If this metric space X is tcomplete, then X 
is called a Banach space. In normed linear 
spaces the addition and the multiplication 
by scalars are continuous. A closed linear 
subspace M of a Banach space X is again a 
Banach space, and the tquotient space X/M 
becomes a Banach space if the norm of a tcoset 
is defined by ||x-i-M!| =inf„gJ^^ ||x-hmij. The 
subset {x II X II ^ 1} of a normed linear space 
X is called the (closed) unit ball (or unit sphere) 
ofX. 

Examples. tFunction spaces C,Lp (1 < p < 

00 ), M, Wp, Hq, tsequence spaces c, /, m, and 
BV are all Banach spaces (- 168 Function 
Spaces). 



C. Linear Operators and Linear Functionals 

Suppose that a linear subspace D(T) of a 
linear space X is the (definition) domain of 
a mapping T with values in a linear space 
Xi T is called a linear operator if 7(ax + 
j8y) = ctTx + fiTy for any scalars a, and x, y e 
D(T). R{T) — {TxeXi |x6D(T)} is called the 
range of T. In the special case where Tfj is 
the real or complex number lield, J is called 
a linear functional. If X and are both 
normed linear spaces, then T is continuous if 
and only if s-lim,„ Tx, = Tx whenever s- 
lim^^^ x,=x. This is equivalent to the condi- 
tion ®UPjcgD( 7 -),||^||^j \\Tx\\ < CO. In particular, if 
D(T) = X, the linear operator T is continuous 
if and only if the set {Tx ||x|| ^ 1} is bounded. 
In this case, T is called a bounded linear oper- 
ator, and II T|| = supn^n || Tx || is called the 
norm of the operator T . In particular, a linear 
operator T satisfying || T|| ^ 1 is called a con- 
traction. (Sometimes T is called a contraction 
only when ||T|| < 1. In that case, an operator 
with II r II ^ 1 is called a nonexpansive opera- 
tor.) The scalar multiple, sum, and product of 
linear operators are defined by (aT')x = a( Tx) , 
(T +S)x= Tx -I- Sx, and (ST) X = S( Tx), respec- 
tively. The identity operator / in X is defined 
by / • X = X for all x eX. If the inverse mapping 
T~^ of x-> Tx exists, then it is called the in- 
verse operator of T. A normed linear space X 
is said to be isomorphic to a normed linear 
space Y if there exists a bounded linear opera- 
tor T from X onto Y with bounded inverse. 

If T can be chosen to be isometric (i.e., || rx|| = 
||x|| for all xeA'), then X is said to be iso- 
metrically isomorphic to Y. 



D. The Dual Space and the Dual Operator 

The totality of continuous linear functionals 
/ defined on a normed linear space X is a 
Banach space X’ under the previously de- 
fined linear operations and the norm ly’H = 
sup||;cli^i |/(x)|. This X’ is called the dual (or 
conjugate) space of X. In view of useful prop- 
erties of the tinner product in tHilbert spaces, 
it is sometimes convenient to write (x,/> for 
f(x). Let X and Y be normed linear spaces, 
and let T be a linear operator with the dense 
domain D (T) in X and the range R(T) in Y. 

If (/, g) is a pair with /e Y’ and ^6 Y' satisfy- 
ing the equation (Tx, f) = (x, g) for any xe 
D(T), then g is determined uniquely by f. The 
operator T ' defined by T'f = g is linear and is 
called the dual operator (or conjugate or adjoint 
operator) of T. This is an extension of the 
notion of the ttranspose of a matrix in matrix 
theory. If T is a bounded linear operator then 
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T' is also a bounded linear operator such that 

IIT'IU ||7|1. 



E. The Weak Topology and the Strong 
Topology 

Let X be a normed linear space and X’ its 
dual space. Take a finite number of elements 
x\,x' 2 , , x'„ from X’, and consider the subset 
of X:{x6X"|supi^i^J<x,x->Ke}, £>0. If 
we take the totality of such subsets of X as a 
tfundamental system of neighborhoods of 0 
of X, then X is a '''locally convex topological 
linear space, denoted sometimes by X^. This 
topology is called the weak topology of X. If 
a sequence {x„} c X converges to x e X with 
respect to the weak topology of X, then it is 
said to converge weakly. This is equivalent to 
the convergence (x„ f )^<x, f) for any f e 
X’. The original topology of X determined by 
the norm is then called the strong topology 
of X, and to stress the strong topology we 
sometimes write in place of the original 
X. Take a finite number of elements Xj, X 2 , 

. . . , x„ from X, and consider the subset of 

A":{x'6X'|supi^i^„KXj,x'>|^£}, £>0. If 

we take the totality of such subsets of X’ as a 
fundamental system of neighborhoods of 0 of 
X’, then X’ is a tlocally convex topological 
linear space. We write this space as X'^* and 
call the topology the weak* topology of X’. 

The topology of X’ defined by the norm || f\\ is 
called the strong topology of X’ , and to stress 
the strong topology we write X'. The terms 
“weak” and “weakly” are used in reference to 
the weak topology, for instance, weak closed- 
ness and weakly closed. Similar conventions 
are used for the weak* and strong topologies. 

The unit ball of the dual space of a normed 
linear space X is weak*-compact (Banach- 
Alaoglu theorem). Then by the tKrein-Milman 
theorem (->. 424 Topological Linear Spaces) 
the unit ball of X’ is the weak* closure of the 
tconvex hull of its ''"extreme points. If X is a 
Banach space, a convex subset K of X’ is 
weak*-closed if and only if the intersection of 
K and each weak*-compact subset is weak*- 
closed (Krein-Shmul’yan theorem). 



F. The Hahn-Banach Extension Theorem 

Let M be a linear subspace of a real linear 
space X and p(x) a real-valued functional 
detined on X such that p{x + y)^p(x) + p{y) 
and p{?,x) — Xp(x) for all x, yeX and A^O. If 
a linear functional defined on M satisfies 
/i (x) <p(x) for all X e M, then there is a linear 
functional f on X which extends /j and satis- 
fies f(x) ^p(x) for all xeX (the Hahn-Banach 



(extension) theorem). The Hahn-Banach exten- 
sion theorem has many applications. (1) Let M 
be a linear sub space of a normed linear space 
X. Then for any we can construct an 

f eX' such that y(x)=/j(x) for all xeM and 
II /II = II II . (2) For any Xq 9^0 of a normed 
linear space X, we can construct an /q e X’ 
such that /o(xq)= IIxqII and ||/o|| = 1. (3) For 
any closed linear subspace M of a normed 
linear space X and a point Xq^ M we can 
construct fo^X' such that |1 /o|| = 1 , /o(xq) = 
and /q(x) = 0 for all xeM. A 
proposition more general than (3) is Mazur’s 
theorem, which is useful in applications: (4) Let 
a closed subset M of a normed linear space X 
be ■'"convex. Then for any Xq ^ M we can con- 
struct an /qEX' such that /o(.Xo)> /o(x) for all 
XE M . By (4) we can prove, e.g., that a convex 
set of a normed linear space is weakly closed 
if it is strongly closed. This proposition has 
the following corollaries: (5) A convex set A 
containing 0 is closed if and only if (y4°)° = 

A , where is the tpolar of A (the bipolar 
theorem). (6) For two closed convex sets A and 

B, containing 0, the polar of A fl B coincides 
with the weak* closure of the convex hull of 
AO u go Poj- complex linear spaces, most of 
the propositions in this section are valid with 
Re f(x) instead of /(x). 

The Hahn-Banach theorem can also be em- 
ployed to prove the existence of the general- 
ized limit (or Banach limit) Lim„^„ defined 
for all bounded real sequences {C„} such that 
lim inf„^„ C„ ^ Lim„^^ C„ < hm sup„^^ C„ and 
Lim„^^(«(^„ + Mn) = a Lim„^„ + /? Lim„^^ p„. 

G. Duality in Normed Linear Spaces 

An element •’^0 of a normed linear space X 
gives rise to an element Xq of (X()' determined 
by <Xo,x') = <x',Xo> for all x'eX/ If we write 
Xq = Jxq, then J is a linear operator satisfying 
P^oll “ dy (2), and so the space X is 
isometrically isomorphic to a linear subspace 
of (X')' . If X^ coincides with (X^)' under this 
isomorphism, we call X a reflexive (or regular) 
Banach space. A necessary and sufTlcient con- 
dition for the normed linear space X to be 
reflexive is that the unit ball of X be weakly 
compact. A convenient criterion for the reflex- 
ivity of X is that any bounded sequence {x„} of 
X contains a subsequence weakly convergent 
to a point of X (Eherlein-Shmul’yan theorem). 
In this connection, a Banach space X is reflex- 
ive if and only if each x’e X’ attains its norm, 
i.e., there is an Xq eX such that ||xo 1| = 1 and 
Kxq, x')| = IIx'II (James’s theorem). 

A normed linear space is said to be uni- 
formly convex if for any fi > 0 there exists a ^ > 

0 such that ||x|| ^ 1, || yj| < 1 and ||x — y|| 




165 



37 K 

Banach Spaces 



implies ||x + y||^2-’^.A normed linear space 
is said to be uniformly smooth if for any £ > 0 
there is a ^>0 such that ||x|| = 1 and || y|| 
imply ||x + y|| + ||x — y|| ^2 + £||y||. A Banach 
space X is uniformly convex (resp. uniformly 
smooth) if and only if the dual X’ is uniformly 
smooth (resp. uniformly convex). If a Banach 
space X is uniformly convex or smooth, it 
is isomorphic to a Banach space that is 
uniformly convex and uniformly smooth 
(P. Enflo). The space with 1 < p < oo is 
uniformly convex and uniformly smooth. 

Any uniformly convex or uniformly smooth 
Banach space is reflexive (Milman’s theorem). 



H. The Resonance Theorem 

Let {T,} be a sequence of bounded linear 
operators from a Banach space X into a 
normed linear space Y. The uniform bounded- 
ness theorem (resonance theorem or Banach- 
Steinhaus theorem) states that sup„^i [| T„\\ < oo 
if sup„^i||7),x|| <00 for every xeX. As a corol- 
lary, we have sup„^i ||x„|| < oo for any weakly 
convergent sequence of X. Another corollary 
states that the set |xgX limsup„ -ooll7;xii< 

00 } either coincides with X or is a subset of X 
of the tfirst category. This implies the so-called 
principle of condensation of singularities, which 
gives a general existence theorem for func- 
tions exhibiting various kinds of singularities, 
for example, a continuous function whose 
tpourier expansion diverges at every point of a 
tperfect set of points having the cardinal num- 
ber of the continuum. 



I. The Closed Graph Theorem 

Let T be a bounded linear operator from a 
Banach space X into a normed linear space Y. 
If the image of the unit ball of X under T is 
dense in the unit ball of Y, then for any £ > 0 
and any yo £ Y the equation Tx = y, has a 
solution X with ||x || ^ (1 -f £) //y, j| (Banach’s 
theorem). By using the tBaire category theo- 
rem, we can derive from this the open map- 
ping theorem: Every bounded linear operator 
from a Banach space X onto a Banach space 
Y maps each open set in X onto an open set 
in Y. As an application of the open mapping 
theorem, we can prove the closed graph theo- 
rem: A linear operator T from the whole of a 
Banach space into a Banach space is con- 
tinuous if and only if T is a closed operator, 
i.e., s-lim„^gQX„ and s-lim,„ Tx, = y imply 
Tx = y. This theorem plays an important role 
in modern treatments of linear partial differen- 
tial equations. 



J. The Closed Range Theorem 

Let X and Y be Banach spaces and Ta linear 
closed operator with domain D(T) dense in X 
and with range R(T) in Y. Under these con- 
ditions, the following four propositions are 
mutually equivalent. (1) R(T) is a closed set in 
Y. (2) R(T') is a closed set in X’. (3) R(T) = 

{ye Tj <y,y*> = 0 for all y*eD(T) such that 
T'y* = 0}. (4) R{T') = {x* eX'\(x,x*') = 0 for 
aU X e Z)( T) such that Tx = 0}. These four 
propositions, as a whole, are called the closed 
range theorem. This theorem implies (5) R(T) 

= Y if and only if T' has a continuous inverse; 
and (6) R( T') = X’ if and only if T has a con- 
tinuous inverse. The following theorem is of 
similar nature: the following three propositions 
on two closed linear subspaces M and N of a 
Banach space are mutually equivalent. (7) M + 
N is closed. (8) -h N° is strongly closed. 

(9) M° -I- N° is weak*-closed. 

The Hahn-Banach theorem, the resonance 
theorem, the open mapping theorem, the 
closed graph theorem, and the closed range 
theorem can be extended to various classes of 
tlocally convex topological linear spaces. By 
virtue of this extension, we are able not only to 
treat various fundamental problems of analysis 
from a unified viewpoint but alsO to develop 
the theory of functional analysis itself in a new 
direction (-^ 424 Topological Linear Spaces; 
concerning linear operators on a Banach space 

68 Compact and Nuclear Operators, 251 
Linear Operators, 390 Spectral Analysis of 
Operators). 

K. Differential and Integral Calculus of 
Functions with Values in Banach Spaces 

Calculus involving functions from a set to a 
Banach space is also an effective tool in vari- 
ous problems. A function x(t) defined on an 
interval [a, ft] with values in a Banach space X 
is said to be strongly (weakly) continuous if x(t) 
converges strongly (weakly) to x(tQ) as 
Lor a strongly (weakly) continuous function 
x(t), the Riemann integral can be defined in a 
standard way, using strong (weak) convergence 
of the Riemann sum 

I x(Odr-lim^x(t/)(t,-+i-tf). 
jifl 

The tfundamental theorem of calculus, i.e., 
strong (weak) differentiability of the indefinite 
integral, remains true. Various definitions of 
integrals of a Banach space-valued function 
on a tmeasure space are discussed elsewhere 
(- 443 Vector-Valued Integrals). 

Now let x(A) be defined on a domain H in 
the complex plane with values in a complex 
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Banach space X. x(/l) is said to be holomorphic 
if f{x{X)) is tholomorphic in Q for every /g X’ . 
If x{X) is holomorphic, then there exists an X- 
valued function y(A) on Q such that 

||r‘(x(i + C)-x(x))-yW))||-.0 a s C-0. 

Therefore there is no difference between 
“strong” and “weak” in analyticity. tCauchy’s 
integral theorem remains true for a holo- 
morphic function x(/l) with values in X, and 
the t Laurent expansion 

oo 

x(^)= ^ a„(A-Ao)", 

n= -00 

an=^.\ x(A)(A-Ao)-"-idA, 

is valid with the integral taken in the Rieman- 
nian sense. Banach space-valued holomorphic 
functions on complex (or real) tanalytic mani- 
folds of higher dimension can be defined in a 
natural way by means of power series expan- 
sion. A function 0(A) defined on a domain in 
the complex plane with values in the Banach 
space B{X, Y) of bounded linear operators 
from X to Y becomes holomorphic if /(0(A)x) 
is holomorphic for every x e X and /g Y’ . An 
operator-valued holomorphic function is often 
called an analytic operator function. 

L. The Approximation Property 

A Banach space X is said to have the approxi- 
mation property if there is a family {S,} of 
bounded linear operators of tfmite rank in X 
such that inf;i ] {S;^ — I)T\\ = 0 for all tcompact 
linear operators T in X. There is a Banach 
space that fails to have the approximation 
property (Enflo [9]). More surprisingly, the 
space of all bounded linear operators on an 
infinite-dimensional Hilbert space fails to have 
the approximation property (A. Szankowski). 
The approximation property plays a deci- 
sive role in the theory of ttensor products of 
Banach spaces. A Banach space X is said to 
have the hounded approximation property if a 
family {S,} in the definition of the approxi- 
mation property can be taken bounded, i.e., 
suPa 115,11 < 00. The bounded approximation 
property does not follow from the approxi- 
mation property. The bounded approximation 
property is closely related to the existence of a 
basis. A sequence {e,} in a Banach space X is 
called a Schauder basis or simply a basis (or 
base) for X if to each xe X there corresponds 
a unique sequence of numbers {cc,} such that 
lim„^„ ||x-2fc=i II —0. Most separable 
Banach spaces appearing in analysis have 
bases. A separable Banach space has the 
bounded approximation property if and only 
if it is isomorphic to a complemented linear 



subspace (-^ Section N) of a Banach space 
with a basis (A. Pelczyhski). 

M. Injective and Projective Banach Spaces 

Banach spaces of the type tC(Q) with compact 
Q and of the type t|j(Q) on a set Q play a spe- 
cial role in the theory of Banach space. This 
is already seen in the fact that every Banach 
space is isometrically isomorphic to a subspace 
of a space C(nri as well as to a quotient space 
of a space /j (0,). A Banach space X is said to 
be injective if for any Banach space Y and 
its linear subspace M, each bounded linear 
operator T from M to X can be extended to a 
bounded linear operator f from Y to X, that 
is, Tx = Tx for all XE JVf. A Banach space is in- 
jective if and only if it is isomorphic to a com- 
plemented linear sub space of the space C(R) 
for a compact Hausdorff space Q with the 
property that the closure of every open set is 
open. Such a topological space is called tex- 
tremally disconnected. The tmaximal ideal 
space of the tBanach algebra L^(Q) is extrem- 
ally disconnected. Hence the Banach space 
L^(Q) is injective. Whether every injective 
Banach space is isomorphic to a space C(0) is 
still an open problem. However, a Banach 
space is isometrically isomorphic to a space 
C(Q) with Q externally disconnected if and 
only if it is injective with the property of norm- 
preserving extension, i.e., || T|| = jj T|| is always 
possible (Nachbin-Goodner-Kelley theorem). In 
this connection, the following propositions on 
a Banach space X are mutually equivalent. (1) 
X’ is isometrically isomorphic to a space 
tLri/t). (2) For any Banach space Y, its linear 
subspace M, and e > 0, each compact linear 
operator T from M to X can be extended to a 
compact linear operator f from Y to X with 
II -T£)||T|| (J- Lindenstrauss). A Banach 

space X is said to be projective if for any 
Banach space Y and its closed linear sub space 

M, each bounded linear operator S from X to 
the quotient space Y/M is lifted to a bounded 
linear operator S from X to Y, i.e., (p{Sx)= Sx 
for all xeX, where (p is the quotient mapping 
from Y to Y/M. Projectivity of X is character- 
ized by its being isomorphic to the space l^iQ) 
on a set Q. A Banach space is isometrically 
isomorphic to a space /j (R) if and only if it 

is projective with the property of norm- 
preserving lifting, i.e., |5|| = ||S|| is always 
possible (- [8]). 

N. Complemented Subspaces Problems 

A linear subspace M of a Banach space X is 
complemented, i.e., there is a closed linear 
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subspace N such that M D N = {0} and M + 

N = X if and only if M is the range of a 
bounded projection P, i.e., = P and R(P) = 

M. Each nonzero closed hnear subspace of a 
Hilbert space is complemented, or more pre- 
cisely, it is the range of a projection of norm 
one (- 197 Hilbert Spaces). This property 
distinguishes Hilbert spaces from general 
Banach spaces: (1) A Banach space of more 
than 3 dimensions is isometrically isomorphic 
to a Hilbert space if each nonzero closed linear 
subspace is the range of a projection of norm 
one (Kakutani’s theorem). (2) A Banach space 
is isomorphic to a Hilbert space if each closed 
linear sub space is complemented (J. Linden- 
strauss and L. Tzafriri [10]). 

0. Quasi-Banach Spaces and Frechet Spaces 

Let X be a linear space over the real (or com- 
plex) number field. Suppose that a real-valued 
function ||x|| on X satisfies (i) and (ii) of Sec- 
tion B and (iii’) Hx + yH ^/c(||x|| -|- ||y||) with a 
constant k ^ 1 independent of x and y. Then 
||x|| is called the quasinorm of x, and X equip- 
ped with a quasinorm is called a quasinormed 
linear space. Let 0 < p ^ 1 be the root of the 
equation k = 2*^^^* *. Then there is a distance 
d{x, y) = d(x -y) depending only on x -y such 
that d{x-y)^\\x-y\\^^2d{x-y). Hence a 
quasinormed linear space is a metric space in 
which a sequence x„ converges to x if and 
only if ||x„ — X || -»0. If a quasinormed linear 
space X is complete under this metric, then X 
is called a quasi-Banach space. The tfunction 
space Lp is a quasi-Banach space for 0 <p< 1. 

If we denote by ||x|| the distance d(x-O) of 
a quasinormed hnear space, then it satisfies (i) 
and (iii) of Section B and (ii’) || — x|| = |[x|| and 
lim„^^ ||a„x„ — ax|| =0 whenever and 

=0- ^ functional ||x|| satisfying 

(i), (ii’), and (iii) is called a pseudonorm. If a 
hnear space X equipped with a pseudonorm is 
complete, then X is called a Frechet space (in 
the sense of Banach). The tfunction space S(0) 
is a Frechet space. Quasinormed linear spaces 
and Frechet spaces are ttopological linear 
spaces, but they need not be t|ocally convex. 
Hence it is possible that there is no continuous 
linear functional except for zero. However, the 
open mapping theorem and the closed graph 
theorem hold for linear operators from a 
Frechet space into a Frechet space. 

References 

[1] S. Banach, Theorie des operations fin- 
eaires, Warsaw, 1932 (Chelsea, 1963). 

[2] N. Dunford and J. T. Schwartz, Linear 
operators I, Interscience, 1958. 



[3] K. Yosida, Functional analysis. Springer, 
1978 . 

[4] M. M. Day, Normed linear spaces. 

Springer, 19.58. 

[5] H. E. Lacey, The isometric theory of class- 
ical Banach spaces. Springer, 1974. 

[6] J. Lindenstrauss and L. Tzafriri, Classical 
Banach spaces I and II, Springer, 1977, 1979. 

[7] J. Diestel, Geometry of Banach space. 
Springer, 1975. 

[8] A. Pelczynski, Some aspects of the present 
theory of Banach spaces. Oeuvres de Stefan 
Banach II, Warsaw, 1979. 

[9] P. Enflo, A counterexample to the approx- 
imation problem in Banach spaces, Acta 
Math., 130 (1973), 3099317. 

[10] J. Eindenstrauss and E. Tzafriri, On 
complemented subspaces problem, Israel J. 
Math., 9 (1971), 2633269. 



38 (XXI.1 4) 

Bernoulli Family 

The Bernoullis, Protestants who came origi- 
nally from Holland and settled in Switzerland, 
were a signiticant family to the mathematics 
of the 17th Century. In a single Century, the 
family produced eight brilliant mathemati- 
cians, all of whom played important roles in 
the development of calculus. 

The brothers James (1654-1705) and John 
(1667- 1748) and Daniel (1700- 1782), John’s 
second son, were especially outstanding. James 
and John were close friends of G. W. t Leibniz, 
with whom they exchanged the correspon- 
dence through which it might be said that 
calculus developed. James studied problems 
related to the ttautochrone and tbrachisto- 
chrone, as well as problems in geometry, dy- 
namics, and other fields, including the tjsoperi- 
metric problem. He was the first to change the 
name calculus summatoris to calculus integralis 
(1690). His Ars conjectandi was published after 
his death in 1713; in it is found the flaw of 
large numbers, which made his name promi- 
nent in the theory of tprobability. James had 
little guidance, learning mathematics on his 
own. He was a professor of experimental 
physics at the University of Basel and later 
became a professor of mathematics. He taught 
mathematics to his brother John, who suc- 
ceeded him as professor at the University. 
John’s many achievements appeared in such 
publications of the time as Acta eruditorum 
and Journal des savants, in 1701, the begin- 
nings of the Icalculus of variations were seen in 
his solution to the isoperimetric problem. He 
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was the first to use the term functio, the root of 
the present term /unct/on (1714). 

Despite discord between the brothers and 
also between fathers and sons, the Bemoullis 
were ardent teachers and brilliant researchers, 
who instructed not only their sons but also 
such mathematicians as tEuler. Their achieve- 
ments were numerous in consolidating the 
content and form of calculus and also in ex- 
panding its application. Daniel was especially 
outstanding in the theory of probability; he 
also made contributions to the field of thydro- 
dynamics and to the tkinetic theory of gases. 
The eldest John’s eldest son Nicolas (1695- 
1726) achieved distinction as a professor of 
mathematics in St. Petersburg. Daniel’s 
youngest brother, John (1710-1790), suc- 
ceeded his father, John Sr., as a professor at 
the University of Basel. The son of John, Jr., 
also named John (1744- 1807), was the chair- 
man of mathematics at the Academy of Berlin. 
His brother, another James (1759-1789), was 
a professor of experimental physics at the 
University of Basel. Nicolas (1687-1759), a 
grandson of the founder Nicolas (162331708) 
and son of Nicolas the painter (1662-1716), 
held Galileo’s old chair of mathematics at 
Padua from 1716 to 1719. 

Nicolas 

1623-1708 



James Nicolas (painter) John 
1654-1705 1662-1716 1667-1748 



Nicolas Nicolas Daniel John 
1687-1759 169551726 1700-82 1710-90 



John James 
1744-1807 1759-89 

In this article, the first names have been 
given in English. The German names corre- 
sponding to James, John, and Nicolas are 
Jakob, Johann, and Nikolaus, respectively. 
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Bessel Functions 

A. General Remarks 

Bessel functions were first introduced in order 
to solve +Kepler’s equation concerning plane- 
tary motions and were systematically investi- 
gated by F. W. Bessel in 1824. Since then they 
have appeared in various problems and have 
become important. 



B. Bessel Functions 



Separating variables for the Helmholtz equa- 
tion A'F -I- = 0 in terms of cylindrical 

coordinates, we obtain Bessel’s differential 
equation 



d^w 1 d w 
dz^ 2 dz 




(1) 



for the component of the radius vector. The 
following two linearly independent solutions 

of(l): 






“izsin^+ivi 



'^di, 



JL, 



^ Jl, 



( 2 ) 



are called the Hankel functions of the first and 
second kind, respectively, where the contour Ly 
of the first integration is a curvC from ( — tj -i- 0) 
-hioo to — 0 — ioo, and L 2 is a curve from -i-O 
— ioo to (ti — 0) -I- too. If both z and v are real, 
we have 



where z is the complex conjugate of z. Hence 

iV,(z)5r,(z) = (H;‘'(z)-H™(z))/2i ( 4 | 

are real functions. If both z and v are complex, 
the functions J,(z) and N^{z) defined in (4) are 
also called Bessel functions and Neumann 
functions, respectively. The other names for 
J,(z), iVv(z), and H,(z) are Bessel functions of 
the first, second, and third kind, respectively. 
Each of them satisfies the following recurrence 
formulas: 



dQ(z) 

dz 



= Cv_i(z)-C,+i(z), 



(2v/z)C,(z) = C,_i(z) + C,+i(z). (5) 

In general, functions satisfying the simulta- 
neous tdifferential-difference equations (5) are 
called cylindrical functions. Every cylindrical 
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function C,(z) is represented in the form C,(z) 
= ai{v)H[^\z) + a 2 {v)Hl^'*{z), where a,(v) and 
a^iv) are arbitrary periodic functions of period 
1 with respect to v. 

If V = (an integer) we have 



7,„(z) = (-l)"J„(z), 

N_„(z) = (-l)"iV„(z), (6) 

which show the linear dependency of and 
and N_„ and N„, respectively. If v n (an 
integer), as the fundamental solutions of (1) we 
can take a pair and or and iV_^. In 
(2) if we take as a contour of integration a 
curve from ( — tc + 0) + /oo to (7i-0) + (oo, we 
obtain an integral representation for J,(z), 
which yields the relations 

J,(z), 

J.ize'n = e''"'^^J.,{z). ii) 

If V = n (an integer) and Re z > 0, we obtain 



2n 



^i2sin^ + in^ 



dC 



1 f" 

— cos{z sin C~r]C)dC„ 

"Jo 



( 8 ) 

( 9 ) 



which is called Bessel’s integral. These repre- 
sentations imply the following expansions by 
means of tgenerating functions: 

n= - CO 

CO 

cos(zsinC) = yo(z)-l-2 ^ i2n(^)cos2nC, 
n=l 

X 

sin(zsinC) = 2 ^ J2„+i(z)sin(2n-h !)(. 

n=0 

Making a change of variable u = exp( 

(2), we obtain 

where L is a contour starting at the point at 
infinity with the argument — n, encircling the 
origin in the positive direction, and tending 
to the point at infinity with the argument Ji, 
From (12) we obtain a power series expansion 

2 00 (—1)'" 

„?om!r(v + m+lO 2 ’ 

obtained also from (1) by a power series ex- 
pansion at z “ 0, which is a tregular singular 
point of (1). Substituting (13) into 

N,{z) = (cos vzJ,(z) - J_,(z))/sin vti, (14) 

we obtain a power series expansion for N^(z). 

A power series expansion for N,(z) for an inte- 
ger n is obtained by taking the limit V->tJ (— ^ 
Appendix A, Table 19). In particular, if v = 



( 10 ) 

(H) 

iQ in 
( 12 ) 
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n + 1/2 (n is an integer), we have 



J 



„,l/2(^) = (-l)" 



(2z)"^’/2 

Jn d{z^y 




n = 0, 1, 2, 

which is represented by elementary functions 
and is sometimes called simply the half Bessel 
function. Bessel functions for half-integers have 
appeared also as radius vector components 
when the variables in the Helmholtz equation 
are separated by spherical coordinates. The 
function 

j„{z) = ^/n/2zJ„+i,2i^) 

is called the spherical Bessel function. 

We have the following addition theorem: 

W/>(kp)e"-= Y J.(kr 2 )Hi!‘Ukr,)e'"\ 

m= - X 

^=1,2, 

where 



P = \/f 1 + d - 2r 1 T2 cos (p , 

pcos\j/ = r^—r2COS(p, psint/f^rjsintp. 



C. Zero Points of the Function Ji(z) 

From the differential equations satisfied by 
Jy{az), we have 

(a^-j5^) [* zJ„{az)J^{l^z)dz 
j(0 

= R(a)j;(/?) oiJ'MJM (15) 

By letting /3->a in (15), we have 

z(J,(az))2dz 

“^)(*^v(a))^ + (-^i(a))^) (16) 

If a and (5 are distinct roots ofJ,(z) = 0, we 
have from (15) 

I zJ^(az)J^{fiz)dz = 0, Rev> -!• (17) 

j'O 

The integral formulas (15), (16), and (17) are 
called Lommel’s integrals. 

As for the zero points of J ,(z), the following 
facts are well known: J,(0) = 0 ifv >0. J,(z) has 
no multiple zero points other than z = 0. — a) 

= 0 if J^(a) = 0. Every zero point of J ,(z) is real 
if V > -1. Between two adjacent zero points 
that are positive, there exists one and only one 
zero point of Jv-i(^) Jv+i(^)’ respectively. 

J.(z) has a countably inlinite set of zero points 
on the real axis. When v ^ 0 is rational, every 
zero point of J,(z) except z = 0 is a ttranscen- 
dental number. The transcendentality of K is 
a special case of this result for v = 1 /2. 
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D. Expansion by means of Bessel Functions 



Let f{r,(p) be defined for 0<r< 1, -n<(p<n, 

and 0£„ j , (X„_2) • • • > ^n.s > • • ■ (0 < tXn.s ' s = 

1, 2, for every n) be zero points of J„{x) {n = 

0, 1, 2, ... ). Then we have an expansion 

CO 00 

/(^ <P)= I I (n„„,cosn(p + 6„^,smn(p)J„{a„,/), 

n=0 s=l 

(18) 

which is called the Fourier-Bessel series. The 
coefficients a„ ^ and b„ ^ are determined by the 
properties of tpourier series and (16) and (17) 
as follows: 






f{r, r d(p dr; 

sm n(p 



£o = l, £i=£2 = --- = 2. 
The integral transformation 

f ® 

d(y)= xf{x)J„{xy)dx (19) 

Jo 



is called the Fourier-Bessel transform. If f(x) is 
sufficiently smooth and tends rapidly to zero 
as x->00, the following inversion formula 
holds: 



fix) 



■*00 

Jo 



yg{y)J„{xy)dy. 



(20) 



There are other types of series expansions in 
terms of Bessel functions as follows: Dini’s 
series 



m=l 

(2„ is the mth positive root of xJ((x) + HJ^{x) 
= 0, where H is a real constant); Kapteyn’s 
series 



GO 

Z amJy+miiv + m)x)- 

m=l 

Schldmilch’s series 

Cl ^ 

I. 

2. m-0 

and the generalized Schlomilch series 

1 ^0 ^ a^ J,(mx) + b„H,(mx) 

2r(v+ l)"*"m=i (mx/iy 

where H^(mx) is the Struve function (- 
Section F). 



E. Asymptotic Expansion 

If z| or V is sufftciently large, the asymptotic 
representation for Bessel functions is obtained 



by applying the tmethod of the steepest de- 
scent for (2). If |z| > I v|, we have 









JAz)' 



NAz)' 



n n 
expii z — V — 



-7i<argz<27t, 



—exp -I Z--V 



cos Z V — 



/ 2 . / n 71 
' — sin z — V — 
7tz \ 2 4 



— 27r<argz<7r, 



— 7i:<argz<7r, 



Hence HA\z) tends to zero as |z|-+oo in the 
upper half-plane, and becomes large exponen- 
tially as |z| -> 00 in the lower half-plane. The 
results for are obtained by interchanging 
“upper half-plane” with “lower half-plane” in 
this statement. 

If both |z| and |v| are sufficiently large, we 
have the Debye asymptotic representations. 

For example, if z = v sec j? (v > 0, > 0), we have 

HA ' ^*(v sec /5) ~ {tzv tan jS/2)"^^^ 

X exp( ± i(v(tan — 15) — tt/ 4)). 
Ifz=vsecha (v>0, a>0), we have 
Jv (v sech a) 

~ (271V tanh exp v(tanh a — a), 

Av A sech a) 

~ (nv tanh a/2)~'^^ exp v(a — tanh a), 

If |v| ~|z|, we have 
tanjS 



//i'’^*(vsec)5)' 






xexp( ±M — 






tanjS — -tan^/I — 



xH[)i^\(v/3)tan^ P) + 0{v~^), 
which is called Watson’s formula. 



F. The Wagner Function 



As an application of Bessel functions to the 
theory of nonstationary aircraft wings, T. 
Theodorsen introduced the function 



C(z) = H[^Az)/iH^o^Az)+ 

[6], and H. Wagner considered the function 



^1(5) = :^ 

ZTU 



2C(-iw) 

gws 

w 
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[5], where are Hankel func- 

tions, and Jgr means a Bromwich integral 
giving the inverse Laplace transform. Then 
C(z) and k,(s) are called the Theodorsen func- 
tion and Wagner function, respectively. The 
function k,(s) is equal to the lift coefficient 
when a 2-dimensional flat wing suddenly 
proceeds forward a distance s at an angle of 
incidence 1 /tt. 



G. Functions Related to Bessel Functions 



[6] T. Theodorsen, General theory of aero- 
dynamic instability and the mechanism of flut- 
ter, NASA Tech. Rep. 496 (1935). 

Also references to 167 Functions of Conflu- 
ent Type, 389 Special Functions. 
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The following functions are closely related to 
Bessel functions (- Appendix A, Table 19.IV). 

(1) Modified Bessel functions. 

I,(z) = 

2 sm V7i 



(2) Kelvin functions. 
ber^(z) ± i bei„(z) ^ 
her,(z)± /hei„(z) = 
ker,(z)= -{7r/2)hei,(z), 
kei,(z)=(7r/2)her,(z). 

(3) Struve function. 



H.(z)= X 



(~f)-(z/2)^'^ 



F[m + (3/2)] r [v + m + (3/2)] 

(4) Anger function. 

1 



Jv(2) = - 



cos(v0 — zsin 9)dd. 



When V is an integer n, we have J,(z)= J„(z). 

(5) H. F. Weber function. 



1 f" 

E„(z) = - sm{v9 — zsin0)d0. 

^ Jo 

The last three functions satisfy certain in- 
homogeneous Bessel differential equations. 
Many other functions, such as tAiry’s integral, 
can be represented by Bessel functions. 
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A. General Remarks 

Biometrics is the branch of science that ap- 
plies mathematical and statistical methods 
to biological problems, and deals with all the 
phenomena that affect the physical, social, and 
psychological well-being of human beings. 
These phenomena involve the relationships 
of groups of human beings to other human 
beings, to animals, microbes, and plants, and 
to physical and chemical elements in the en- 
vironment. In dealing with these problems the 
biometrician encounters such theoretical tasks 
as analyzing autocorrelated data in time series, 
and such practical undertakings as cost-vs- 
benefit evaluations of health programs. 

Biometrics began in the middle of the 17th 
century when Sir William Petty and John 
Graunt developed a new method of analyzing 
the London Bills of Mortahty. Petty and 
Graunt essentially invented the field of vital 
statistics by studying the reports of christen- 
ings and causes of death and proposed a 
method called “political arithmetic.” 

Some fields to which biometries is relevant 
are given below. 



B. Statistical Genetics 

After early development in vital statistics, 
statistical genetics was founded on the new 
ideas emerging in statistics. Major contri- 
butions were made by Charles Darwin (1809- 
1882), Francis Gallon (1822-1911), Karl 
Pearson (1857- 1936), and Ronald A. Fisher 
(1890-1962). 

Gallon was the first to use the term “regres- 
sion” in statistics (-^ 403 Statistical Models 
D), when he observed that sons regressed 
linearly on their fathers with respect to stature. 
He called the phenomenon a “regression to 
mediocrity” because the deviations of the 
stature of sons were less than those of fathers. 
This gave rise to the measurement of cor- 
relation in the bivariate normal distribution 
by means of the coefficient of correlation 
(Pearson, 1897). Pearson is credited with the 
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creation of the discipline of biometry (bio- 
metrics), and he established the journal Bio- 
metrika to promote studies in the tield. Fisher’s 
major contributions were to genetics and 
statistical theories. His General theory of nat- 
ural selection appeared in 1930. This landmark 
book, along with earlier and later publica- 
tions, represented Fisher’s attempts to give 
quantitative form to Darwin’s views and to 
frame a statistical theory of evolution. 

C. Bioassay 

Bioassay is a set of techniques for evaluating 
the effectiveness of dosages of drugs by moni- 
toring biological responses. It entails the use of 
special transformations, such as probits and 
logits, as well as the application of regression 
to the estimation of dosages that are p percent 
effective within stated confidence limits. Prob- 
lems to be solved include measuring relative 
potency, slope-ratio assays, and quantal re- 
sponses vis-a-vis tolerance distributions. 

D. Demography 

Demography, which includes traditional vital 
statistics, rates and ratios, life tables, com- 
peting risks, actuarial statistics, and census 
enumeration techniques, is a part of bio- 
metrics. In this category, many tabulations of 
data consist of time series of events or rates 
classified by age. For the analysis of such data, 
the cohort analysis techniques described in 
Hastings and Berry [3] are employed. 

E. Epidemiology 

The quantitative description of an epidemic 
should state the sensitivity and specificity of 
any diagnostic tests, as well as the true in- 
cidence or prevalence of the epidemic from 
survey results. Within an epidemiological 
theory, a disease is studied by the use of deter- 
ministic and stochastic models, wherein the 
theory of Markov chains can be applied (- 
260 Markov Chains). Differential equations 
involving probability-generating functions or 
tmoment-generating functions can be solved to 
yield the tprobability distribution, tmean, and 
^variance of the number of infected individuals 
as functions of time [4,5]. Fundamental to 
this whole field of application is a clear under- 
standing of causality and association. 

When clinical trials are possible, two groups 
of persons, “treated” and “untreated,” are 
monitored over a period of time with regard to 
the incidence or recovery from the disease 



under study. The techniques in this procedure 
include compiling reports on persons to be 
observed, using double-blind techniques, and 
combining multiple -response variables by use 
of multivariate analysis (- 280 Multivariate 
Analysis). 

If clinical intervention is forbidden on 
ethical grounds, e.g., studying congenital mal- 
formation by infecting pregnant women with 
German measles, the relative risk of expo- 
sure is estimated from retrospective studies. 

In practice, almost all statistical studies in 
epidemiology are retrospective with the sole 
exception of clinical trials. The research is ex 
post facto because investigators are mostly 
confined to describing and analyzing sudden 
and/or obvious events in the etiology of the 
disease. 

F. Clinical Trials 

In clinical trials, many problems have arisen 
for which biometricians have had to develop 
special techniques. One SUch technique takes 
account of unexpected adverse effects of drugs, 
and the consequent early termination of a 
trial. Moreover, when data demonstrate a 
trend earlier than expected, investigators will 
desire to end the accession of patients and to 
stop further treatment with what may be an 
inferior regimen. This means that the bio- 
metrician must be familiar with the problems 
of multiple examinations of data, multiple 
comparisons, and other adjustment procedures 
required by the ex post facto dredging of data. 

G. Future Trends 

There are two areas in which the biometrician 
has played a leading role recently. These are 
pertinent in many different applications and 
problems, and considerable methodological 
research has been devoted to the two areas. 

The areas “mathematical modeling” and 
“effects of hazardous substances.” 

Mathematical Modeling. The relationship 
between a set of independent variables and the 
dependent or response variable(s) is usually 
referred to as a mathematical model. The 
model may take the form of a standard multi- 
ple regression analysis with a single response 
variable or with multiple response variables as 
in multivariate analysis (- 280 Multivariate 
Analysis). 

It is generally assumed that specialists with 
substantive knowledge of the specitic field 
of applications (epidemiology, toxicology, 
pharmacology, radiology, genetics, etc.) play a 
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crucial role in determining a model or rela- 
tionship between a set of independent vari- 
ables and response variable(s). However, it is 
mainly the biometrician who finally selects the 
specific model establishing the functional 
relationship and who attempts to measure the 
strength or influence of the independent vari- 
ables therein. The biometrician is also ex- 
pected to contribute substantially to the deci- 
sion as to whether the relationship is causal, or 
merely one of association or correlation. For 
example, in the measurement of carcinogenic- 
ity of a food additive or drug, questions arise 
as to whether a substance can be judged harm- 
ful if it “accelerates” the appearance of a tumor 
even though it does not increase the incidence 
of the abnormal growth. In general, the answer 
to this question is in the affirmative when an 
unequivocal dosage-response relationship 
is indicated between the substance and the 
tumor. 

Effects of Hazardous Substances. With the 
successful conquest of most of the infectious 
diseases that have plagued mankind through- 
out history, health authorities have recently 
been concentrating on two chronic diseases 
whose etiology is yet to be determined: cardio- 
vascular disease and cancer. In both cases 
there is no disagreement with the thesis that 
heredity exercises a determining influence, but 
the role of the environment in causing many 
cases is also unquestioned. Measurement of 
the risk due to potentially toxic substances in 
the environment, principally with respect to 
these two diseases, represents the greatest chal- 
lenge to the biometrician today. The social 
benefits of SUCCCSS make this a tantalizing area 
of research, though exceptional complexities 
are involved. 

For related topics 263 Mathematical 
Models in Biology. 
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Boltzmann Equation 

A. Introduction 

The Boltzmann equation is an equation of 
motion of a rarefied gas given by L. Boltzmann 
in 1872 [1]. He used it successfully in his 
pioneering work on the kinetic theory of gases 
and on the more general statistical mechan- 
ics (- 402 Statistical Mechanics), but the 
equation was held to be inconsistent with the 
^classical mechanics used in its derivation; 
objections were raised by, e.g., J. Loschmidt 
(1895) and E. Zermelo (1896). Through the 
controversies it became widely recognized that 
the Boltzman equation should be justiiied by 
means of the ttheory of probability, and many 
such justifications have been proposed. Also, 
this equation has been studied extensively 
as a significant nonlinear partial differential 
equation. 



B. Boltzmann Equation 

Let / = f{t, X, be the density of gas molecules 
having position x £ and velocity ^ £ R^ at 
time t. The Boltzmann equation is a conser- 
vation law for f of the form [1-3] 

( 1 ) 

Here a{t, x) denotes the external force and Q is 
a quadratic nonlinear integral operator in 
space describing binary collisions of the mole- 
cules. The integral kernel of Q, called the colli- 
sion cross section, depends on the intermole- 
cular forces. The two classical examples are the 
hard ball model (a gas of rigid spheres) and the 
inverse power law potential (one proportional 
to r *, where r is the intermolecular distance 
and 5 > !)• The latter gives rise to singularities 
in the kernel, and a cutoff is customarily em- 
ployed to avoid this difficulty. Grad’s hard 
(s ^ 5) and soft (s < 5) cutoff potentials may be 
effectively used in the study of (1) [3]. 

If the gas is contained in a vessel (domain), 
then f must also satisfy boundary conditions 
determined by the assumed law of reflection 
at the walls (boundaries) (specular reflection, 
random reflection, etc.). For spatially homo- 
geneous gases with no external forces, (1) re- 
duces to 

f,-QUl ( 2 ) 

with f = f{t, ^). The Maxwellians, those solu- 
tions satisfying the tMaxwell-Boltzmann dis- 
tribution law, are the only stationary solutions 
of (2); they describe equilibrium states. 
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C. Justification by tbe Theory of Probability 

Consider (2). The collision process is a tMar- 
kov process, and the tmaster equation ob- 
tained from the tChapman-Kolmogorov equal- 
ity is equivalent to (2) under the assump- 
tion of “propagation of molecular chaos” (G. 
Uhlenbeck (1942)). This assumption becomes 
valid in the limit as N (total number of mole- 
cules) ->co (M. Kac (1955), H. McKean 
(1967), F. Griinbaum (1971)) [4]. Also, non- 
linear Markov processes defined by (2) (with 
probability density fjN) have been studied 
(McKean (1967), H. Tanaka [5]). No such 
results have yet been established for (1). 

D. Existence of Solutions 

T. Carleman [2] gave the first solutions of the 
Boltzmann equation, solving globally in time 
the TCauchy problem for (2) for the hard ball 
model. His result has been extended to a wide 
class of potentials. 

The spatially inhomogeneous case (1) is also 
known to have global solutions if a(t, x) = 0 
and if the initial data for f are nearly Max- 
wellian. This was first proved by S. Ukai [6] 
assuming periodicity in x, and then proved for 
the Cauchy problem by Ukai [7] and T. 
Nishida and K. Imai [8], and for the initial 
boundary problem by J. P. Guiraud [9] (ran- 
dom reflection) and Y. Shizuta and K. Asano 
[ 10] (specular reflection) for bounded domains, 
and by Ukai and Asano [1 1] for exterior 
domains, all assuming Grad’s hard cutoff 
potentials. The case of soft cutoff potentials 
was also solved (Ukai and Asano, R. Caflisch 
(1980)). All the solutions are unique and 
tend to Maxwellians as oo, with certain 
decay rates. There are some results also on 
1 -dimensional shock wave solutions (B. Nico- 
laenko (1975)) and stationary solutions (J: P. 
Guiraud (1972) (bounded domains), Ukai and 
Asano (1980) (exterior domains)). 

For initial data far from the Maxwellian, (1) 
remains unsolved even locally in time for hard 
cutoff potentials. As for noncutoff potentials, 
the existence theorem in [5] is the only result 
known so far for either (1) or (2). 

E. Hilbert Expansion 

Put f f/N with s= 1/N.Jf has a power 
series expansion in a, then (1) gives an infinite 
system of equations for the coefficients. D. 
Hilbert (19 12) proposed a method of solving 
this system as an application of his theory of 
integral equations, initiating the attempt to 



solve the Boltzmann equation. No proofs of 
convergence exist, but his expansion and the 
improved one by S. Chapman (1917) and D. 
Enskog (19 17) made possible the applications 
of (1) to hydrodynamics. The zeroth-order 
approximation to the expansion gives rise to 
the compressible Euler equation, and the first- 
order approximation yields the compressible 
Navier-Stokes equation. The solutions of 
the Cauchy problem for (1) with nearly Max- 
wellian initial data converge to those of the 
compressible Euler equation locally in t as 
£^0 (Nishida [12], Caflisch (1980), and 
to those of the compressible Navier-Stokes 
equation asymptotically as t 00 when e> 0 is 
fixed [13]. 
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A. Boolean Algebras 

Boolean algebra was introduced by G. Boole 
to study logical operations (-411 Symbolic 
Logic). It is now included within the more 
general concept of ■^lattice or lattice-ordered 
set (- 243 Lattices) and appears not only in 
logic but also very often in analysis in the form 
of a particular lattice of sets, e.g., the lattice of 
tmeasurable sets. 

Let L be a given set and suppose that to any 
pair of its elements x, y there correspond two 
elements xUy,xriyofL (called join and meet 
of X and y, respectively) such that the follow- 
inglaws arevalid: (1) xUy = >'Ux, xny = yrix 
(commutative law); (2) x U (y U z) = (x U y) U z, 

X n (y n z) = (x n y) n z (associative law); (3) x U 
(y n x) =(x U y) n X =x (absorption law); (4) x U 
(y n z) = (x U y) n (x U z), X n (y U z) = (x n y) U 
(x n z) (distributive law); (x, y, zeL). Lrom 
(1), (2), and (3) it follows further that x U x = 

X n X = X (idempotent law). If x ^ y is defined 
to meanx U y = y, L becomes an ^ordered set 
with respect to the ordering ^ Now suppose, 
moreover, that the following law holds: (5) 
there exist a least element 0 and a greatest 
element /, and for any element x there exists an 
element x’ satisfying xUx’ = /, xrix'=0 (law of 
complementation). Then L is called a Boolean 
algebra (or tBoolean lattice). In this case x’ is 
uniquely determined by x and is called the 
complement of x. The binary operations 
(x, y)-x U y, X n y together with the operation 
X-»x' are called Boolean operations. These 
operations obey de Morgan’s law (x.UyJ' = 
x'ny',(xny)'=x'uy'. 

B. Generalized Boolean Algebras 

Suppose that a ^ b holds for two given ele- 
ments a, b of an ordered set. Then the set of all 
elements x satisfying a^x^bis denoted by 
[a, b] and is called an interval. An interval of a 



Boolean algebra is also a Boolean algebra with 
respect to the induced operations U and (1, 
where the least and greatest elements are a and 
b, respectively, and the complement of x in 
[a, b] is equal to a U (x’ fl b) =(a U x’) fl b. More 
generally, if a set L with two operations U, fl 
satisfying (l)-(4) above has a least element 0 
and if each interval of L satisties (5) (i.e., is a 
Boolean algebra), then L is called a generalized 
Boolean algebra. 

C. Boolean Rings 

A ring L satisfying the condition xx = x for all 
X e L (i.e., all of its elements are tidempotent) is 
called a generalized Boolean ring, and if it has 
a unity element then it is called a Boolean ring. 
A generalized Boolean ring L satisfies x -i- x = 0 
for all X e L and is necessarily a commutative 
ring. A (generalized) Boolean algebra L be- 
comes a (generalized) Boolean ring if for any 
elements x, y of L the sum x -i- y is defmed to 
be the complement of x fl y in the interval 
[0, X U y], and the product xy is delined to be 
X (ly. A nonempty subset J of a (generalized) 
Boolean algebra L is an ideal with respect to 
the corresponding structure of the ring if and 
only if xUysJ for x, yeJ and xHyeJ for 
xeJ, yeL. More generally, in any lattice, a 
nonempty subset that satisfies these conditions 
is sometimes called an ideal of the lattice. 

D. Representation of a Boolean Algebra 

Any Boolean algebra L is isomorphic to a 
Boolean lattice of subsets in a set X. If L is of 
finite theight, then L is isomorphic to the 
Boolean lattice ^(A) of all subsets of X. In 
general X can be taken to be the set of all 
max im al ideals of L. Let aeL and let 0(a)= 
(mlmeX, a^m}. The isomorphism is obtained 
by the mapping a-^0{a). If we define a topol- 
ogy in X such that (0(a) aeL} is the topen 
base, then X is a compact, totally discon- 
nected space and 0(a) is characterized as 
a compact open set in X. Such a space X is 
called a Boolean space (M. H. Stone [3, 4]). 

In any complete Boolean algebra L, the 
complete distributive laws hold: (sup/X,)n y = 
supj(x; n y) and its dual. These are equivalent 
to the stronger relations: (sup|X;)n(supjyj) = 
sup;j(x,- n Tj) and its dual. In order for a 
Boolean algebra L to be isomorphic to the 
Boolean algebra ']3(2f) of all subsets of X, it is 
necessary and sufficient that the following 
strongest complete distributive laws hold: 
inb(supj(,.)Xi^.) = supF(inf;X,. ^(i)) (where F is the 
set of all functions (p assigning to each ie I a 
value (p(i)GJ(i)) and its dual. 
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A. General Remarks 

A complex-valued function defined on a subset 
E of the complex z-plane is called a bounded 
function defined on E if its range f(E) is 
bounded, that is, if there exists a positive 
constant M such that |/(z)| ^ M on £. How- 
ever, when studying the theory of bounded 
functions, we usually restrict ourselves to the 
consideration of tanalytic or tharmonic func- 
tions. On the other hand, the classes of func- 
tions f(z) satisfying conditions such as Re/(z) 
> 0 or a < arg/(z) < rather than \f{z)\ ^ M 
are studied by a method similar to that ap- 
plied to the study of bounded functions. 

tSchwarz’s lemma, tLiouville’s theorem, and 
tRiemann’s theorem on the removability of 
singularities (which will be explained later) are 
among the classical theorems in the theory of 
bounded functions. 

B. Maximum Principle 

When a function f(z) is holomorphic and not 
constant in a domain D of the complex plane, 
|/(z)| never attains its maximum in the interior 
of D . In particular, when f(z) is continuous on 
the bounded closed domain D = D U dD, the 
max im um of |/(z)| on D is taken on its bound- 
ary dD. This fact is called the maximum (mod- 
ulus) principle. 

As a direct application of the maximum 
principle, we can deduce Schwarz’s lemma: If a 
holomorphic function f(z) in |z| <R satisfies 
|/(z)KMand/(0)=0, we have |/(z)|^M- 
|z|/R (|z| <R). The equality at Zq, 0<|zo| <R, 
occurs only for the functions f(z) = e'^Mz/R 
(where A is a real constant). 



C. Lindel5f’s Principle 

E. Eindelof extended Schwarz’s lemma and 
obtained various extensions of the maximum 
principle, from which he, together with E. 
Phragmen, deduced several useful theorems on 
the behavior of a function that is single-valued 
and holomorphic in a neighborhood of the 
boundary. We mention some representative 
theorems: 

Eet z = (p{Q and w = tjj{Q both be tmero- 
morphic and tyni valent functions in |{|< 1 that 
map ICI < 1 onto and D„,, respectively. Set 
(P(0) = Zq and t/^(0) = WQ. Let D,(p) and D,(p) 
denote the images of|(|^p(0<p<l) under 
the mappings (p and \j/, respectively. Under 
these circumstances, if a function f(z) that is 
holomorphic in satisfies f{D^) c and /(zq) 
= Wq, then mip)) c D,(p). Eurthermore, 
unless f(z) maps onto D.^ univalently, 
f{D^{p)) is contained in the interior of D,(p) 

(an extension of Schwarz’s lemma). 

Eet f(z) be analytic in a bounded domain D 
but not necessarily single-valued. Suppose that 
|/(z)| is single-valued. Suppose, furthermore, 
that there is a positive constant M such that, 
for each boundary point ( of D , except for a 
finite number of boundary points and for each 
£ > 0, the inequality |y(z)| < M +e holds on the 
intersection of D with a suitable neighborhood 
of C, and suppose also that each of the excep- 
tional points has a neighborhood such that 
/(z) is bounded on the intersection of D with 
this neighborhood. Under these assumptions 
we have |/(z)| < M . Moreover, if |/(zo)[= M at 
a point Zq of D , then f(z) is a constant (an 
extension of the maximum principle). 

Eet f(z) be holomorphic in an angular 
domain W : arg z| < a7t/2. Suppose that there is 
a constant M such that, for each £ > 0, each 
finite boundary point has a neighborhood 
such that |/(z)| < M -f £ on the intersection of D 
with this neighborhood, and that for some 
positive number fi> CL and for sufficiently large 
|z| the inequality |/(z)| <exp|z|'^^ holds. Under 
these assumptions we have |/(z) ^ M in D 
(Phragm^n-Lindelof theorem). 

Eet f(z) be a function that is holomorphic 
and bounded in a closed angular domain W: 
(X^argz^f] except for the point at infinity. 
Suppose that f{z)-*a as z^ oo along a side of 
the angle and that f{z)-*b as z-^oo along the 
other side of it. Then we have a= b and /(z)->a 
uniformly as z-> oo in W (Lindelof ’s asympto- 
tic value theorem). 

D. Bounded Functions in a Disk 

If f(z) is a bounded holomorphic function in 
the unit disk |z| < 1, it has a limit at every 
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point Zq on the circle C: |z| = 1, except for a set 
of 1 -dimensional measure zero, as z tends to Zq 
from within an angle with vertex at Zq and con- 
tained in |zj < 1 (or along a +Stolz’s path at Zq) 
(Fatou’s theorem). Under the same assump- 
tion, if the boundary value function = 

lim,._.i_o/(r^‘^), which exists by Fatou’s theo- 
rem, is equal to a constant a for a set of posi- 
tive measure on the circle C, then f(z) = a in 
\z\ < 1 (F. and M. Riesz theorem). These theo- 
rems are valid for some kinds of meromorphic 
functions in |z| < 1 (- 272 Meromorphic 
Functions D). 



E. Three-Circle Theorem and Related 
Theorems 

Let f(z) be a function that is single-valued, 
holomorphic, and not identically zero in an 
annulus p< |z| <i^. Set M(r) = maX| 2 |=,|/{z)| 

(p < r < R ) for f(z). Then log M(r) is a tcon- 
vex function of log r in log p < log r < log R 
(Hadamard’s three-circle theorem). The same 
assertion holds for a function f[z) that is not 
necessarily single-valued, as long as |/{z)| is 
single-valued. When f(z) is single-valued, a 
stronger assertion can be obtained (0. Teich- 
mtiller). The following theorems are regarded 
as the analog of Hadamard’s three-circle 
theorem for the respective basic regions: 

Set L{(t) = sup ^ |/((7 + it) \{oi<a<^) 
for a function that is bounded and regular in a 
strip a < Re z < /?. Then log L(a) is a convex 
function of (j in a < (T < j? (Doetsch’s three-line 
theorem). 

Set /(ff) = lim sup,_,^ |/{(T + it)| (a <(J <(i) for 
a function that is holomorphic and bounded in 
a half-strip a < Re z < j?, hn z >0. Then log 1(a) 
is a convex function of a in a < c < ^ (Hardy- 
Littlewood theorem). 

Set 

271 J« 

for a holomorphic function in a disk |z| <i?. 
Then for every p > 0, log i,(d is an increasing 
convex function of logr for — oo < logr < log R 
(Hardy’s theorem). 



F. Hardy Class 

Hardy’s theorem motivates us to introduce a 
class of functions. An analytic function / in the 
unit disk is said to belong to the Hardy class 
//<’ (0< p< ») if /,(rj) = (271) Ifire^Wde 
remains bounded as r->l. For the case p= oo, 
/ e if /co(r,/) = maX| 2 |=d/(z)| is bounded. 



An analytic function fis said to be of the class 
N if Jo" log'*' |/(re'®)| d9 is bounded for r< 1. 

/ e N if and only if f = cp/ij/, where 
f E H’’ and only if \f\’’ has a tharmonic majo- 
rant. If 1 ^ p ^ oo, //t’ is a tBanach space with 
the norm ||/l|p = (sup,<i Ip{r,f))^^^ for p< go 
and ll/ll ^ = sup,<i /«,(r,/) for p = 00, andifO< 
p < 1, //T’ is a complete metric space with the 
metric sup^<i lpir,f~g). For Q<p<q< oo the 
inclusion relation N D 3 holds, so 

that Fatou’s theorem and that of F. and M. 
Riesz are valid for functions of (p > 0). 

For a nonnegative integer p and a sequence 
of nonzero complex numbers a„ |a^| < 1, the 
infinite product B(z) = z7’II*=i (««(«„ - z)!\ a„ | ( 1 
a„z)) converges locally uniformly in the unit 
disk if and only if S“=i(l -|«„|)<oo. Then 
B(z) is called a Blaschke product and { a, } a 
Blaschke sequence. If the sequence {a,}U {0} is 
the set of zeros of an analytic function / (an m- 
tuple zero appears m times in {«„}, and z = 0 is 
a p-tuple zero), B(z) is said to be generated by 
the zeros of f. UfsN, then Z®=i(l 
for the set {a,} of zeros off: An analytic func- 
tion f in the unit disk is called an inner func- 
tion if its modulus is less than 1 and its non- 
tangential limit on the unit circle is of modulus 
1 almost everywhere. A Blaschke product is an 
inner function. An inner function f without 
zeros is said to be singular if f(0) > 0. A sin- 
gular inner function f can be represented by 
/(z) = exp{ -Jo"(e‘®-Fz)/(c'®-z)dp(0)} with a 
positive measure p ^singular with respect to 
Lebesgue measure. An outer function F(z) for 
//^ ( P > 0) is an analytic function that can be 
represented by 

F(z) = e‘''exp|— -p^Jogil/(0)d0j, 

where r is a real number and i^(0) is a non- 
negative function such that i//(0)e L’’ and 
logi//(0)eLf An f/T'-function /(p> 0 ) is fac- 
torized uniquely as f(z) = B(z)S(z)f(z), where 
B(z) is a Blaschke product (generated by the 
zeros off), S(z) a singular inner function, and 
f(z) is an outer function for (with \l/{0) = 
|/(g‘®)l) (factorization theorem). Conversely, 
such a product belongs to H’’. The inter- 
polation problem raised by R. C. Buck asks 
whether or not there exists a sequence of dis- 
tinct points {z„ lz„| < 1} such that, given any 
bounded sequence of complex numbers |w„}, 
there is a function /e fT” for which f{z„) = w„. 
The sequence {z,} in the problem is called an 
interpolating sequence. L. Carleson proved that 
a necessary and sufficient condition for {z,} to 
be an interpolating sequence is that there be a 
^>0 such that n„^kl(z„-zt)/(l -ZfcZ„)|^(5 
(k = 1, 2, . . . ). An interpolating sequence is a 
Blaschke sequence, and any sequence {z, 1 1 z„ 

-♦ 1} contains an interpolating subsequence 
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tending to the unit circle. An analogous theo- 
rem is obtained for a function of //^ (p ^ 1). 



■''rational approximation of bounded analytic 
functions. 



G. Corona Problem 



The Hardy space //“on the unit disk D is 
furthermore a commutative ''Banach algebra 
with identity. Let M denote the 'maximal ideal 
space of //“. It is a compact Hausdorff space 
with respect to the tGel’fand topology. To 
each point z£ D, there correspond a homomor- 
phism (p^' <Pj(/) =f(z) for f £ and a maxi- 
mal ideal = { /e //“ |/(z) = 0 } as its kernel. 
This correspondence gives a continuous injec- 
tion of D into the maximal ideal space M of 
//“. By identifying z and M^, D is regarded as 
an open subset of M , and the corona problem 
then asks: Are there points of M outside the 
closure of D ? The negative answer was given 
by L. Carleson, that is, D is dense in A4 (corona 
theorem). The corona theorem is obtained as a 
direct consequence of the following theorem 
and is equivalent to it. For functions f-e //“, 
i=l,2,..,n, satisfying 2 l/i(^)| > ^ in D with 
some ^>0, there exist such 

that Z/i{z) 0 ,(z) = 1 in D . As a trivial conse- 
quence of the corona theorem, we have the 
following cluster value theorem: LetCaC) 
denote the 'cluster set of /£ //“ at ( with 
I I = 1 ; then C{f,Q=f (M^), where / is the 
Gel’fand transform off and is the 'fiber of 
M over M is decomposed into pairwise 
disjoint 'Gleason parts. A Gleason part is a 
connected open set or a singleton according as 
each point of it is captured in the closure of an 
interpolating sequence or not. Each point of 
the 'Shilov boundary T of M forms by itself a 
trivial Gleason part, and F is 'adherent tO a 
sequence S in D with the property that the set 
of its nontangential limit points covers the unit 
circle (Brown-Shield-Zeller theorem). Let D 
be, in turn, a general bounded domain. The 
corona theorem is proved for some classes of 
such domains. A Shilov boundary F lies only 
over the set of points on dD which are not 
'removable for bounded analytic functions. It 
is adherent to a sequence S in D on which 
supsl/(z)| = ll/IL- For any/e//“ and CedD, 
the boundary of/(M^) is contained in /(T^) if 
= F n is not empty. This, together with the 
cluster value theorem, has an important im- 
plication in the theory of cluster sets. The ex- 
tremal problem of maximizing f '(zo)|, Zq£ D , 
among all /e //“ satisfying H/l| oo ^ 1 has the 
unique normalized solution G such that G’{Zq) 
= max |/'(zq)|. G(z) is called the Ahlfors func- 
tion. The Ahlfors function has unit modulus on 
the Shilov boundary (Fisher’s theorem). The 
theory of //” is also applied to the problem of 



H. Applications of the Maximum Principle 

Theorems of the following type are useful for 
some problems of holomorphic functions: 

Let D be a domain. Suppose that there 
exists an arc of angular measure a that is on a 
circle of radius R centered at a point Zq of D 
and not contained in D . Let C denote the 
intersection of the boundary of D with the disk 
|z — ZqI < R. If f(z) is a single-valued holom- 
orphic function that satisfies /{z)| and if 
lim sup 2 _^ f (z)| ^ M for every (e C, then the 
inequality /(zq)| ^ holds for every 

positive integer n satisfying 27c/n a (Lindelof ’s 
theorem). 

Let D be a domain bounded by two seg- 
ments OA, OB both starting from 0 and mak- 
ing an angle na, and a Jordan arc AB, and 
let R be the maximal distance between 0 and 
the points on AB. Suppose that f(z) is holo- 
morphic in D and that lim sup] /{z)) as z-» 
dD with z£ D is not greater than M for 
( e OA U 0 B and m for ( e A B . Then we have 
(where /i = (lzl/R)^^*) at every 
point on the bisector of the angle /_ A 0 B in D 
(Carlemann’s theorem). 



I. Holomorphic Functions with Positive Real 
Parts 



Holomorphic functions with positive real parts 
are intimately connected with bounded func- 
tions. Concerning these functions we have the 
following classical result, which is equivalent 
to Schwarz’s lemma: If f(z) is holomorphic in 
z| < R , Re f (z) ^ 0 in the same domain, and 
m = 1, then {R -\z\)/{R + |z|) < Re/(z ) ^{R + 
|z|)/(R — |z|)(|z| < R). The right or left inequality 
becomes equality for some Zq, 0 < |zq| < R, only 
if /(z) = (Rzo +Zoz)l{R ±ZoZ), respectively. 

In order to prove various results for the 
class of functions with positive real part, 
Herglotz’s integral representation, which is 
based on Poisson’s integral representation and 
unique to this class, can be used effectively. It 
is given by 






z = 



^ +i_ 

0 



■dp{(p), \z\<R, 



where p{(p) is monotone increasing (real- 
valued) with total variation 1 and is deter- 
mined uniquely up to an additive constant by 
f(z). An analogous integral representation is 
introduced for a holomorphic function in an 
annulus. 

Similar to the case of holomorphic functions 
is the notion of Hardy classes to harmonic 
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functions, that is, a harmonic function u in the 
unit disk is said to be of the class /i'’ (0 < p ^ 
oo) if Ip{r, u) is bounded for r < 1 . M e if and 
only if M + ive H’’, where v is conjugate to u.h^ 
is equal to the vector space of functions which 
are representable as a difference of two non- 
negative harmonic functions. The Herglotz 
theorem stated above is then a corollary of the 
theorem of integral representation for h^- 
functions. 



J . Coefficient P roblems 



There are many classical results for partial 
sums and coefficients of the Taylor expan- 
sion of bounded functions in a disk. Let f(z) 

= Taylor expansion of a 

bounded function in |z| < 1. Set its partial sum 
s„(z) = Sv=oCvZ'’(n = 0, 1, • ••)• and let t,(z)= 
(l/(n-t- l))Xv=o‘^v(2) {« = 0, 1, ... ), which is the 
sequence of the arithmetic means of the par- 
tial sums (the Fejer sums). Then l/(z)l ^ 1 in 
|z|< 1 if and only if |f„(z)|<l for lz| = 1 (n = 

0, 1, ) (L. Fejer). Thus the sequences ft,(z)} 
for bounded functions f(z) are uniformly 
bounded, whereas the sequences fs„(z)} are nol 
uniformly bounded. Indeed the maximum 
value of |s„(l)| over the set of function f satis- 
fying |/{z)|^ 1 for|z|< 1 is 






J=l 



/l-3-...-(2/-l)^2 



J=0 



2-4-...-2J 



(G„~7i“' logn as n->oo). 

The following result is decisive for coeffi- 
cient problems: Set = Xj=oCp-jC^.j (/i v), 

for f(z) = 2^=0 Cr.2", let be the 
maximal eigenvalue (a nonnegative real num- 
ber) of the Hermitian matrix ( — /Lv)I1,v=o> and 
let m =lim m„( ^ 0). 

Let H be a Hermitian form in inlinitely 
many variables given by 






v=0 


/J,v = 0 


GO 


OO 


OO 2 






Z ^'v^^ + v 


v = 0 




v=0 



Then a necessary and sufficient condition for 
|/(z)| ^ 1 when |z| < 1 is that H be positive 
semidetinite, i.e., the sequence of the tprincipal 
minor determinants of H , 



1 2 
1 2 



m = 1,2 



is all positive or positive for an initial finite 
number of them and zero for the remainder 
(I. Schur). 



A corresponding result for functions with 
positive real part in a disk can be stated in a 
simpler form: A holomorphic function f(z) = 
l/2-l-X®=iC„z" in |zl<l satisfies Re/(z)>0 if 
ind only if 



1 Cl . . 

^ 1 



^n~i 



^0, n=l,2, 



1 



(C. Caratheodory). Furthermore, for n = 

1,2, , when we regard (cj, . , c,) as a point 
of complex n-dimensional Euclidean space, we 
can determine the domain of existence of 
points satisfying this criterion by Caratheo- 
dory. This result is generalized for coefficients 
of the Laurent expansion of a function that is 
holomorphic and single- valued in an annulus. 

Next, there are some results for a function 
that omits two values: If f(z) = a, -i- a, z-|- 
... (ai 9 ^ 0 ) is holomorphic in |z|<R and 
/(z)^0, 1 in the same domain |z| < R, then 
there exists a constant L{aQ, a,), depending 
only on a, and a„ such that R ^ L{aQ, Uj) 
(Laudau's tbeorem). Under these circum- 
stances, \ f{z)\^S{aQ,6) in for O<0< 1, 

where S(a„ 0) is a constant depending only on 
a, and 9 (Scbottoky's tbeorem). These theo- 
rems have applications in value distribution 
theory. On the other hand, coefficient prob- 
lems have been investigated as extremal prob- 
lems in //P-spaces under general conditions. 



K . Angular Derivative 

Let f(z) be holomorphic in |z| < 1. If /(z)-»Wo 
uniformly as along Stolz paths with end 
point at Zq and if the limit lim^^z^ff/fz)— Wq)/{z 
— Zq)) = D exists, we call D the angular deriva- 
tive of j”(z) atzg. In the case of the half-plane 
Re z > 0, the angular derivative at the point Zq 
on the imaginary axis is similarly defined. It 
should be noted that f(z) — Wg is replaced by 
l//(z) for Wq = 00 and l/(z - Zg) by z for Zq = oo. 
In the latter case, a Stolz path is a path con- 
tained in an angular domain |argz| ( < n/2) 
and tending to oo. The study of angular de- 
rivatives was initiated by G. Julia (1920) and 
J. Wolff (1926) and was further advanced by 
Caratheodory (1929) and E. Eandau and G. 
Varilon (1929). 

A fundamental theorem for angular deriva- 
tives can be stated as follows: If a holomorphic 
function f(z) in Re z > 0 satisfies Re/(z) ^ 0, 
there exists a constant c (0 ^ c < -f-oo) such that 
f(z)/z-^c and f'(zj->C uniformly as z^oo 
along every Stolz path. Moreover, the pth 
derivative of f(z) for an arbitrary positive 
p, denoted by D^f(z), has the property that 
2 ?-! £)P/(z)->c/r(2 —p) uniformly. Further- 
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more, the inequality Re f(z) ^ c Re 2 holds 
everywhere in Re z > 0. An analogous theorem 
is valid for the unit disk. 

An important problem in the theory of 
conformal mapping is to find some condition 
for a mapping w =/(z) of the unit disk (or half- 
plane) G onto a simply connected domain B to 
have a nonzero and finite angular derivative at 
a boundary point Zq, that is, the condition for 
conformality at the boundary point. Cara- 
theodory showed that a sufftcient condition is 
the existence of two circles that are mutually 
inscribed and circumscribed at the boundary 
point Wq —/{Zq) ^ CO of R and that lie inside 
and outside of B, respectively. Ahlfors later 
established a necessary and sufftcient condi- 
tion for the existence of the angular derivative 
by making use of his tdistortion theorem for a 
strip domain. The angular derivative was 
used by Wolff in his research on the iteration 
of conformal mappings. 
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Branching Processes 

A. General Remarks 

A branching process is a mathematical model 
for random motion of a family of particles 
each of which is in an isolated process of multi- 
plication and death. Examples of such ran- 
dom motions are population growth, miosis of 
genes, growth of the numbers of neutrons in an 
atomic chain reaction, and cascade showers of 
cosmic rays. The simplest and most fundamen- 
tal branching process is discussed in Section 

B. Eor a historical introduction to the study 
of branching processes Kendall [2] and 
Harris [3]. 



B. Galton-Watson Processes 



Although there is a similar process with con- 
tinuous parameter teR, we consider here 
only the case of a discrete time parameter 
{i = 0, 1,2,.). Suppose that we are given a 
family of particles of the same kind. Each 
member of the family splits into several par- 
ticles according to a given probability law 
independently of the other members and its 
own past history. Eet Z„ be the number of 
particles of the family at a moment (or gener- 
ation) n; then {Z„} gives rise to a iMarkov 
chain. This is called the Galton-Watson pro- 
cess. A precise mathematical description of the 
process is as follows: A Galton-Watson pro- 
cess is a Markov chain [Z„ n = 0, 1, . . . } on the 
nonnegative integers with ttransition proba- 
bility defined in terms of a given tprobability 
distribution k = 0, 1, }, ^ 0, 

by 



Py'~F[Z„ + , — ]|Z„ — t] 



p/‘ ifi>l, _/>0, 
(5ij if i = 0, j>0, 



where the probability distribution | p,^‘ k = 

0, 1, . } denotes the i-fold tconvolution of 
the probability distribution {pj, k = 0, 1, . . . } 
and dij is the tKronecker delta. Then p^ is 
interpreted as the probability that an object 
existing in the nth generation has k children in 
the (n -|- l)th generation. We assume that Zq = 

1 a.s. for the rest of this section. An impor- 
tant tool in the analysis of the process is the 
^generating function f(s) = |s| ^ 1. 

Denote the generating function P(Z„ = 
fc)s'‘ (a'< 1) of Z„ by f„(s). Then we have /o(s) = 

s, /i (s) = f{sl fi+jis)^fiifj{s)) {i,j = 0, 1, 2, . ), 

and the texpectation of Z„ is given by E{Z„) = 
m", where m= /'( 1) = E{Z]) is the expecta- 
tion of Z,. A Galton-Watson process is said 
to be subcritical, critical, or supercritical if 
m<l,= l,or>l, respectively. If we have Z„ = 
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0 for some n, then = Z„^_2 = . . = 0. The 
probability q = P (lim, Z = 0) is called the 
extinction probability. The case f^{s) = s is 
excluded in the following. The extinction pro- 
bability q of the process |Z„} is the smallest 
nonnegative solution of the equation s —f[s). 
It is 1 if 1 and < 1 if m> 1. Moreover, 
P(lim„Z„= oo)= 1 -q when m> 1, and hence 
the process {Z„} is ttransient. 

If we put W„ = Z^lm" when w < oo, 
gives rise to a tmartingale, and the limit W= 
lim„_,^ W„ exists with probability 1. If m> 

1, then the tmoment-generating function 

(p(s) = £(exp( -siy)) of W satisfies the Konigs- 
Schroder equation (p{ms) -f{(p{s)), Re s ^ 0. If 
m ^ 1, then P(Z„ = 0 for some n) = 1. The fol- 
lowing theorem on the conditional distribu- 
tion of Z„, given that Z„ 0, was first proved 
by Yaglom [6] under moment restrictions. If 
m<U there exist = lim, P{Z„ = k Z„^0), k = 
1,2 ,....Then {bk\k—i,2 }isaprobability 
distribution and its generating function g(s)= 
2^=1 |s| ^ 1, is the unique solution of 

the equation g{f{s)) = mg{s)+ 1 -m, |s| ^ 1, 
among generating functions vanishing at 0. 
Furthermore, when < 1, jbj< oo if und 
only if 1*1 Pj{j\ogj) < co (i.e., E[Z^ logZj] < 
oo). 

If m = 1 and = variance of Zj < oo, then 
lim„P[Z„/n> u| Z„ 7^0] =exp[ - 2 m/( 7^], u ^ 0. 
If = 00, then this result is still correct with 
the interpretation that the limit in the left- 
hand side equals 1. 

For further details regarding the theory of 
Galton-Watson processes - [3, S]. 



C. Multitype Galton-Watson Processes 

These are generalizations of the Galton- 
Watson process, involving k types of particles 
(k ^ 2), say, 7i, T 2 , , T^. Let p'(ri, r2, . , rj be 
the probability that a particle of type 7^ splits 
into r„ particles of type T^(m= \,2,. . ,k), and 
set /'{s,,S2,. ,.sJ = S,p'(r,,r2 ,. . , 

. . sjT The number of particles Z„ =(Z„\ Z„^, 

... ,Z*) at a moment n gives rise to a tMarkov 
chain over Z+ , and its transition probability 
is determined by p‘{r^, ^2, , rj as for Galton- 
Watson processes. Here Z+ denotes the space 
of all k-dimensional tlattice points whose 
components are nonnegative integers. We 
now define a mathematical model of the 
process mentioned above. A Markov chain 
{Z„ [ n = 0, 1 , . . . } on Z + is called a multi (k)- 
type Galton-Watson process if its transition 
probability is given by 

P(i, j) = P[Z„+i = j Z„ = 1] = coefficient of 

k k 

fls/' in n(/'(si,S2,...,s^))'' 

1=1 LI 



for i = (/i , I2 , .... Ifc), i = 0'i ,j2, • • • • Jk)eZ ’[ , n= 

0, 1,... and s = (si,S2 5 where = 

{s = (Si,S2, 

When m,y = 5/‘{l, , l)/dsj< 00 (ij= 1,2, 

. , k), the tconditional expectation of Z„+^, 
given Z„, is given as E[Z^+„ ZJ = Z^M\ 
where M is the matrix (mij). From now on we 
assume that niij < 00 (i, j = 1, 2, . , kj. A multi- 
type Galton-Watson process {Z„ n = 0, 1, . } 
is said to be singular if the generating func- 
tions S2, ... , Sjj), /= 1,2 ,..., k,arealllinear 

in Si, S 2 , , S|^ with no constant terms. We 
assume nonsingularity throughout. If there is 
a positive integer N SUCh that every compo- 
nent of is positive, the process Z„ is said 
to be positively regular. In this case, M has a 
positive ^eigenvalue A that is simple, and X> 

Ip I for all other eigenvalues p. The eigenvalue 
A plays the role of m in the case of the Galton- 
Watson process. For every i, \ ^i^k, set q’ = 
P[Z„ = 0 for some n|Zo = e,], where ei—(e],ej, 
...,ef), ej— dij. Then q' is called the extinction 
probability of the process {Z„ n = 0, 1, . } 
starting with a single particle of type 71; let 

q = {q^,q^, q''). Suppose that the process 

{Z„ n = 0, 1, . . . } is positively regular and not 
singular. Then q = (l, 1, . . . . 1) if 1, and 0^ 
^'< 1 (1 = 1,2, ...,k) if /1>1, and q satisfies the 
equation q = {f^{q),f^{q), ■ ,/'‘(q))- Further- 
more, the process {Z„ n = 0, 1, . . . } is ttran- 
sient, i.e., F[Z„ = j infinitely often] = 0 for any 
j ^ 0, j e Z + Moreover, if 2 > 1 , then W„ = 

ZJX'' converges with probability 1, and 

lim Z„/2" = V IT a.S., 

n-MX) 

where v denotes the positive left teigen vector 
V = (Vj , . , v^) for X of the matrix M, and W is 
a nonnegative trandom variable. P[ W> 0] > 

0 holds if and only if 

£[Z)'j log Z[']]< 00 fora 11 (1) 

Here Z)‘j denotes the number of type 7) par- 
ticles in the first generation for a process with 
Zq = e,-, I ^i^k. If condition (I) holds, the 
tmoment-generating functions of W, (p.{a) = 
£[exp[-alF] |Zo = eJ, i= 1,2, ...,/c, a>0, 
satisfy <p,(a)=/'((Pi (a/2), (pjic^/X ), . , %(a/2)), 
l^i^k. 

When A 1, a theorem analogous to Ya- 
glom’ s holds. For further information Ioffe 
and Spitzer [J. Math. Anal. Appl. 19 (1967)) 
and Athreya and Ney [S]. 

D. Markov Branching Processes 

The branching processes dealt with in Sections 
B and C are limited in the sense that gener- 
ation times are fixed. We now formulate a 
continuous-time version of branching pro- 
cesses. The treatment in this section is limited 
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to Markov processes that are extensions to 
continuous time of the Galton- Watson pro- 
cess. A Markov process Pj on Z+ is 
called a Markov branching process if the tran- 
sition probability P{t,iJ)= P.[X,=j] (iJeZ + , 
t ^ 0) satislies 

CO / ao \‘ 

lP(i,uV= IC'. i.;V 

j=o v=0 / 

for all (6Z + and 1. Then there exist 
a positive constant a and a probability 
distribution { k = 0, 2, 3, . } such that 
lim,|o( 1 ^P{t,hi))/t = iaiUeZ\sLnd 
lim,|o P{t, ij)/t = iapj^i+i if i - 1 , i The 

+Kolmogorov equations are 

^P{t,iJ)= -jaP{t,i,j) + a Y 
at 

(tforward equation) 
and 

^P(t,U)= -iaP{t,iJ) 
at 

ao 

+ ia X Pi-i+iP(tJJ) 
(tbackward equation). 

Then a and {Pi, k = 0, 2, 3, . } are interpreted 
as follows. An object existing at t has a proba- 
bility adt of dying in the time interval [f, t + dt> 
of length dt. If it dies at any time t, the proba- 
bilities are Po, P 2 ^ Pi, ■ replaced by 

0,2,3,. objects. When P{t, \, k) < 1, the 
number of particles attains -t-co in a finite time 
interval with positive probability. In order that 
Hk=o i, k) = 1 for all iG ZV , it is necessary 
and sufticient that for each 1 >fi>0, 

-w)“' du = 00 , where f(s)= jjt 1 PjS^ for 
Isl^l. 

Most of the theory of the Gal ton- Watson 
process carries over to the continuous case. In 
particular, concerning the thmit distribution of 
X„ more precise results have been obtained. 
(For detailed discussion of Markov branching 
processes [3,8].) 

Some work has also been done on the 
asymptotic behavior for temporally inhomo- 
geneous cases of multitype Markov branching 
processes. 



E. Branching Markov Processes 

We now give a systematic treatment of gen- 
eral branching processes in which an object is 
characterized by a parameter x in a tcompact 
'’’Hausdorff space S with a countable open 
'''base. The results are formulated in terms of 
strong Markov processes. Set S° = {d}, where 



d is an extra point. For every positive integer 
n, let S" = S X S X ... X S/~, where ~ isthe 

n 

tequi valence relation given by the tpermu- 
tations of coordinates. S” is compact with 
respect to the '^'quotient topology, and its 
ttopological direct sum S = S" is tlocally 
compact. Let § = S U { A} be the one-point 
foompactification of S. A point x e § is de- 
noted by x = [x,,X 2 , ... ,x„] if x is the tequiv- 
alence class containing (Xj, X 2 ,. . , x„gS"). 

X ^ [x] is denoted simply by x. Set B(S) = 

{J\f a bounded Borel measurable function on 
S}, B*(S)={/eB(S)|||/||<l},C(S)={/|/a 
bounded continuous function on S}, and 
C*(S) = C(S) n B=!=(S). B(§), C(S), . are defined 
similarly. For / gB*(S'), define/ gB(§) by /(x)== 
1 ifx = o,=ri'’=i/(Xj)ifx-[xi,X2,...,x„]G 
S", and = 0 if X =A. A '''strong Markov pro- 
cess X = {X„ P,} on § is called a branching 
Markov process if its semigroup {T,} (- 261 
Markov Processes) satisfies 

t/(x)=(t;7J(x) for every xe§ and 



where for i^GBjS), is the restriction of 
g on S [10-12]. We set(i) Z, = n if X,e5", 
n = 0, 1, ..., 00 , where S”= {A}, and (ii) T = 
inf{ t Z, /Zq). Z, and t are called the number 
of particles and the first splitting time, respec- 
tively. Let X = {X(, be a strong Markov 
process on S. We assume that the semigroup 
{H,} of X is a strongly continuous semigroup 
on C(S). Let (p, be a nonnegative continuous 
^additive functional of X. Let p,(x), u = 0, 2, 

3,. be a sequence of nonnegative functions 
in B(S) such that D”=o,„ 7^1 P-(x) = T Con- 
sider a sequence of stochastic kernels ;r„(x, dy), 
n — 0,2,3,. . . . on S X S", i.e., for fixed A g ® (S"), 
7 t„(x, A ) is a measurable function of x, and 
for fixed x it is a probability on (S", 
where ®(S") is the ttopological cr-field on S”. 
Set 7t(x, D) = 1 Pn{x)n„{x, D f1 S") for 

Z)€®(S), xeS, where S(S) is the topological (T- 
field on S. Then there exists a unique branch- 
ing Markov process X = {X„ P,} on § satisfy- 
ing the following: (i) The Markov process {X,, 

1 < T, P;c} on S obtained from X by shorten- 
ing its tlifetime is equivalent to the Markov 
process obtained by tkilling X at a rate dtPf 
(- 261 Markov Processes), (ii) For any De 
S(S) and A>0, £^[exp[-2i]; X,g£> 1X,_]- 
£x[exp[-/lT]|X,_]7r(X,^,/)) a.S. on {t<co} 
for every x eS. (hi) With probability one, X, = 
A for all t ^ lim„|„ r„, where t„ denotes the nth 
splitting time [10]. This process X = {X„ P,} is 
referred to as the |x„ 7r}-branching Markov 
process. 

Example 1. Let 5 = {ai,a 2 , ...,0^}; then S can 
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be identified with Z + and S with ^ = Z + U 
{ oo}. Therefore a branching Markov process X 
on § is a Markov process on ^ such that hs 
transition probability P{t, i, j), i, jeZ + satisfies 

I F(t, i,0'i J 2 ,. st 

-n z jk)) 

(=1 




P{t, GO, 00 ) = 1 for i = (t\ , I 2 , . . . . 4) e Z\ , 

s = {s,,S2,...,sJeDfc. 

Then the process X is called a multi (Q-type 
Markov branching process. 

Example 2. Let S = [0, 00 ], k(x) be a non- 
negative locally integrable function on [0, 00 ), 
and p,(x), n = 0, I, , be a sequence of non- 
negative measurable functions on [0, 00 ) such 
that ^ PiM = 0. We extend 

k(x) and p,(x) (n as functions on [0, 00 ] by 
setting them to 0 at 00 , and set p^( 00 ) = 1- 
Define a stochastic kernel 7i(x, dy) on S x S by 




Let X = {x„ Px} be the uniform motion on S 
with the semigroup {H,} such that for /eC(5), 
HJ{x) = f{x + t) if xe[0, 00 ) and = /(oo) if 
X = 00. Set (Pt = ^Q /c(Xj) ds. Then we have a 

(p„ n}-branching Markov process, and we 
call it an agc-dcpcndcnt branching process. 

We now discuss some fundamental prop- 
erties of {x„ (pf, n} -branching Markov pro- 
cesses. For the sake of simplicity we assume 
the following: (i) (p, = Jo k{x^)ds,keC^{S)^ 
{/£C(S)|/^0},(ii) p„(x)eC+(S)and 



for Z)eS(S"). Then {TJ is a strongly con- 
tinuous semigroup on C,(S) = |/gC(S) 
limx^A/(’^) = 0}- Let A be the tinfmitesimal 
generator of {//,} (- 261 Markov Processes) 
and ^{A) be the domain of A. If / e^(A)C] 
C*(S), then u(t,x) = {TJ)^g{x)eS^{A), and it 
satisfies du/dt = Au + k{F{-;u) u), u(0 -f , x) = 

/(x), where F{x-,g) = y{y)n{x,dy) [10], 

Let be the space of all nonnegative 
IRadon measures on S and be the sub- 
space of consisting of all probability Radon 
measures on S endowed with the topology of 
tweak convergence. Set M = {<5}, where b 

is an extra point and [0, 00 ] x Jl^. Define 



a mapping p:..# pQ.,AQ)eJi by 
p(I, 2 q) = aXq if 2 < 00 and = (5 if A = 00 , and 
define the topology of ./# as the strongest of all 
the topologies rendering p continuous. Set 
C++(S) = {/eC(S)|/>0}, md for /eC^+(S), 
detine a function on ^ by Oj(A) = 
exp [ — (A,/)] if yi,e.y# and =0 if 2 = ^, where 
(2, /) = yf{x)X(dx). Consider a mapping ij/ : S-> 
^ defined by ij/{x) = 0 if x = p, = Xfc=i if 
x = [xi,X 2 , ...,xJeS”, and =S ifx=A. Let 
J^p be the subspace of M consisting of al 1 non- 
negative integer-valued Radon measures on S, 
i.e., ^p=|(/'(x)|xe§}. We denote by ^ the 
space of all right-continuous mappings v: 

[0, x'S)^Jip whose discontinuities are at most 
of the first kind and such that -b for f ^ 

S if Vj = 6. Consider a branching Markov 
process {X„ P^}, and let P^ be the tprob- 
ability law on § of the stochastic process 
{v, = i//(X,);0^ t< go} with X,=x, where p^ = 
Then the branching property can be 
rewritten as follows: For every t^O, /eC'^'^(S) 
and Ml, M 2 6'^p,£„,+^,[exp[-K,/)]] = 

£K,[exp[-(v„/)]]£^Jexp[-(v„/)]] (- 
Silverstein, Z. Wahrscheinlichkeitstheorie und 
Verw. Gebiete, 9 (1968); for a treatment of 
branching processes in the framework of 
tmartingale problems Holley and Stroock, 
Publ.Res. Inst. Math. ScL, 14 (1978)). 

There are analogous processes which pre- 
serve the basic features of the branching 
property. A Markov process { P^} on ^ 

with 5 as a 4trap is called a continuous-state 
branching process if it satisfies the property 
that for every t>0 , /gC^^(S), and p^, 
^^,+/.,[exp[-(M„/)]] = £;.,[exp[-(/i„/)]] 
Fn,[exp[ -(Mt,/)]]- This concept was intro- 
duced by Jiiina [ 15] for some special cases. 

For further information -i Jifina (Third 
Prague Conf., 1964), Lamperti (Bull. Amer. 

Math. Soc., 73 (1967)), Watanabe (J.Math. 

Kyoto Univ., 8 (1968)), and Fujimagari and 
Motoo (Kodai Math. Sent. Rep., 23 (1971)). 
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A. General Remarks 

R. Brown, an English botanist, observed in 
1827 that the minute particles comprising the 
pollen of plants, when suspended in water, 
exhibit peculiarly erratic movements [1], The 
physical explanation of this phenomenon is 
that haphazard impulses are given to the sus- 
pended particles by collisions with molecules 
of the fluid. Eet X(t) be the x-coordinate of a 
particle at time t. Then X(t) is treated as a 
trandom variable, and the distribution of X(t) 
-X(s) is a tnormal distribution ^(0,1)11 — 51), 
with +mean 0 and t variance D t — s |, where D 
is a positive constant. To be more exact, such 
a family of random variables {X(t)} is now 
considered as the family of random variables 
determining a tstochastic process. Various 
aspects of the theory were analyzed by A. 
Einstein [2], E. Bachelier, N. Wiener, P. Eevy, 
and others. 



B. Wiener Processes 

Eet T be the real line or a subinterval. A 
tstochastic process {X(t)},„ defined on a 
■'■probability space (Q, S, P) is called a Wiener 
process on if it satisfies the following three 
conditions: (1) X(t,m)eR‘^(f6 T,coeQ). (2) The 
R‘^-valued random variables 2f(tJ, 2((t^) — 
X{tj-i), j = 2, 3, , n, are independent for 
any t^<t 2 < ■■■ <t„, tjS T {j= 1,2, ...,n), where 
n is an arbitrary positive integer. (3) If 2f-(t) is 
the ith component of the vector x(t), then the 
{2f((t)},gj- (I ^ d) are independent as sto- 
chastic processes and every increment Y,(t) 

— 2f,(s) is normally distributed with mean 0 
and variance t — sj. A Wiener process is also 
called a Brownian motion. A Wiener process 
is a ■'■temporary homogeneous additive pro- 
cess. A tseparable Wiener process has continu- 
ous paths with probability 1. Conversely, if 
{A'(f)},gj. is a temporary homogeneous addi- 
tive process on R^ whose tsample function is 
continuous with probability 1 and the incre- 
ment X(t) ™ X(s) has mean 0 and variance 
t — s|, then it is a Wiener process (- 407 Sto- 
chastic Processes, 406 Stochastic Differential 
Equations). Eet be an tincreasing 

family of (7-subfields of 23. We assume that 
is right continuous. A d-dimensional 
continuous process X=(X(t)),„„ is called a 
d-dimensional {.^[-Brownian motion if it is 
■'■{ -adapted and satisfies 

E[exp[i(lX(t)-X(s))]l^J 

= exp[-(t-s)Kp/2] 

a.S. for every e R'', 0 < s < t, 

where |<^| denotes the tnorm of ^eR'* [9]. Then 
X satisfies conditions (l)-(3) mentioned above, 
and hence it is a Brownian motion on R'* (- 
406 Stochastic Differential Equations). 

Eet {X^(a))}(/c = 0, 1, . ..) be a sequence of 
independent random variables defined on a 
probability space (Q, ©, P) such that each X^ 
has the normal distribution (V(0, 1) with mean 
0 and variance 1. Then the series 

-^Xo(oj)+ I^YX„{cof-^, tG[0,7r] = r, 

converges uniformly in t G T with probability 1 , 
and its limit, denoted by X{t, w), is a Wiener 
process [3,10]. 

Eet W^= C( [0, oo)->R‘') be the space of all 
continuous functions w: [0, ool^R** endowed 
with the topology of uniform convergence on 
finite intervals and ®( W**) be the ttopological 
(T-field. Eet X=(X(t)),„„ be a Brownian 
motion on R*^ and fl be the probability law of 
X(0). The probability law of the Brownian 
motion X on ( S( W^)) is called the d- 
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dimensional Wiener measure with the initial 
distribution [9, 1 1 , 1 2]. For a probability n 
on (R^, ©(R*^)) the d-dimensional Wiener mea- 
sure with the initial distribution ^ is a prob- 
ability on ( W‘, 33 ( W^)) characterized by the 
property that for every 0 = to < ti < < t„ and 

[w(f i) G zl j , w(t 2 ) G ZI 2 , . . . , w(t„) G 




X exp 






2(0- 0-1) _ 



^{dxQ)dxi ...dx„, 



where ®(R‘*) denotes the topological cr-field on 
Rd. 



C. Brownian Motion as a Diffusion Process 

In terms of the general framework of '•'Markov 
processes, Brownian motion is a typical exam- 
ple of a diffusion process (-^115 Diffusion Pro- 
cesses). Let X = {X,(w), R'^,P^}be a continuous 
Markov process on R*^ 26 1 Markov Pro- 

cesses) with the ttransition probability 

P(t,x,B)=| X -_^jx-y]^dy, 

f>0, xeR^ Be»(R‘'). 

For each xe R“^, the process (X,(w), P^} is a 
Wiener process in the sense mentioned above. 
The Markov process X, which is a collection 
of Wiener processes {X„ P^} starting at x, is 
said to be a d-dimensional Brownian motion. A 
d-dimensional Brownian motion possesses the 
tstrong Markov property. Let ® be the tgen- 
erator of the tsemigroup corresponding to 
X. A bounded uniformly continuous function 
f defined on R** belongs to the domain of © if 
its partial derivatives df/dx^ and d^fjdXidXj, i,j 
= 1,2, . d, are bounded and uniformly con- 
tinuous. For such a function f we have ®/(x) 

= (l/2)A/(x), where A is the tLaplacian[8, 12]. 

D. Brownian Motions and Potentials 

For a > 0, the function Gj,(x) detined by 

^ e ~^‘{2nt) exp ^ 1 X p ^ dt 

is said to be the cc-order Green’s function. 

Since Brownian motion is tnonrecurrent for 
d ^ 3, the limit Go+(x) = lim^io Gct(-x), xe R*^, 
exists for ^3, and G,-i-(x) is equal to K,(x) = 
(F(d/2 — l)/47r‘^^^)|x which is the kernel 
for the '•'Newtonian potentials. Brownian 
motion is ■•■ recurrent when d 2 and G,-i-(x) = 
-boo, xeR*'. In this case, K,(x) is defined by 



Ko(x) =lim,^o(G,(x) -GJxo)), andKo(x) = 
(l/27!:)log{l/|x|), when d = 2 and |Xol= L 
This is the kernel for the '•'logarithmic poten- 
tials. When d = 1 and Xq = 0, K,(x) = — |xl/2. 
Using this relationship, we can express many 
concepts of classical potential theory in an 
elegant form in probability language. Let 
X = (X(t), P^} be the d-dimensional Brown- 
ian motion. Given a set A , set cr^ = inf{f t > 

0, 2f(t) £ A }, where the infimum over the 
empty set is understood to be -foo. Then is 
called the thitting time of X for the set A (-^ 
261 Markov Processes). For a Green domain 
(i.e., a domain which is a tQreen space) D in 
R“^ (d > 2), set 2g°{t, x, y)dy = P^{X{t)edy, 
t), x, ye D . Then the right-hand side of this 
equation is the ttransition probability of the 
Brownian motion on D with the tabsorbing 
barrier dD. Then G®(x, y) = Jq 3 ®(t, x, y)dt is 
tGreen’s function of D . If P is a compact sub- 
set of a Green domain D or an open subset 
with compact closure Bcz D, then the hitting 
probability p(x) = PxiOg < ^^do) *^be equilib- 
rium potential of B relative to D . 

Suppose A is an tanalytic subset of R**. 
tBlumenthal’s O-l law implies that Px(o^ = 0) 

= 1 or 0. The point x is said to be regular for 
A if this probability is 1 and irregular for A 
otherwise. Let B be a compact subset in R'^ 

(d ^ 2). Then tWiener’s test 120 Dirichlet 
Problem) states that x is regular or irregular 
for B according as the following series diverge 
or converge: 

00 

k=l 

tkC(B,), d^2, 

k = l 

where C(B*) is the '•'Newtonian capacity (the 
logarithmic capacity relative to a bounded 
domain when d = 2) of the set B,^ = {/ 
|y-x|<2~'‘}riB [12]. Suppose!) is a bounded 
domain in R'^ (d ^ 2). A point x e dD is regular 
or irregular for —D according as x is regu- 

lar or irregular in the sense of the tDirichlet 
problem for D . Given a continuous boundary 
function / on dD, u(x) = Ex{f(X{G^j))) is the 
solution of the generalized Dirichlet problem 
for D . Given xeD, the distribution h{x, B) = 
Px{X{a^,p)EB) (Be33(dD)) used in the solution 
u(x) = dy) of the generalized Diri- 

chlet problem is the tharmonic measure of dD 
as viewed from x. 



E. 1-Dimensional Brownian Motion 

In his monograph [6], P. Levy gave a pro- 
found description of the fine structure of the 
individual 1 -dimensional Brownian path 
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{X(t,0))},^o- Let us set 

= the hitting time to the point a in R’, 
min X(s, co), 

M{t,o})= max J6(s,o>), 
yo(r,co) = |J^(t,m)|, 

tKCToiw), 
and 

|A:(t,a)), t<Goi(o), 

2 ’ ^ ~ |x(t, t^o-o(w). 



(5) The diffusion process {X(t, m), 0 ^ 1< Gq, P^} 
(x e (0, oo)) obtained from a 1 -dimensional 
Brownian motion by shortening its tlifetime is 
called a Brownian motion on (0, oo) with an 
'^'absorbing barrier at the origin, and its tran- 
sition probability is given by 

P{t,x, B) 

r 1 

J B yjlnt 

Be©([0, oo)), t>0, xe( 0, oo). 

The arc sin law is valid for many functionals 
of 1-dimensional Brownian motion. For 
example, 



Then we have (1) PQ(M(t)>a) = 2pQ{M{t)> 
a,X{t)>a) = 2 Pq{M (t) > a, X (f ) <a) = 

Fo(|X(t)| >a) (reflection principle of D.Andre), 
(2) The stochastic process {a^, 0^ a < oo, Pq} 
is a tone-sided stable process with exponent 
1/2, that is, it is additive and homogeneous 
with the law 



’ b-a 

— J 0 .yjlns 






0^a<b, t^O. 

(3) Let (p~'(t,co) be the right continuous in- 
verse function of 



(pit, 0)) — 



Z[0,oo)(A'(5, oj)) ds. 



where Z[o,oo)( ') L the tindicator function of the 
interval [0, oo). Set Y^{t,(o)^X{(p'^it,Q)),oj). 
Then the four processes { }^(t, w), 0^1 < oo, P^} 
(xg[ 0, oo), 0^/<3) on [0, oo) have the same 
probability law. Each of them is a diffusion 
process with transition probability 



P{t,x,B) 



1 



ie 



~\x-y,\^l2t 



+ e 



-|x . 



^^') dy, 



f*o I Zto, 00 (s. w)) ds < ^ arc sin 

0<d^t, 

^o(L(o>)^5) = (2/7r)arcsiny^, 0<s<t, 

where T,(a)) = sup{s|X(s,m) = 0, 

The visiting set ^{(X))~ {t) X(t, m) = 0} of a 
Brownian path is a ttotally disconnected 
set. Its tLebesgue measure is 0, and the 
tHausdorff-Besikovich dimension number of 
^{(o) is 1/2. 

Consider a 1 -dimensional Wiener process 
{X(r)}Q^,<^ starting at the origin. Let us set 

fO t = 0, 

^^^^“|tX(l/f) t>0, 

and 

B2{t) = cX{t/c^), f^O, (oO). 

Then the stochastic processes {Fj(f))o:g,<oo 
and {B 2 ( 0 }o<t<oo likewise 1 -dimensional 
Wiener processes starting at the origin. Hence 
the properties of the Wiener process starting at 
the origin in a neighborhood of r = 0 (t = oo) 
can be obtained from those in a neighborhood 
of t = oo (t = 0). 




F. d-Dimensional Brownian Motion 



BgS([ 0, oo)), t>0, xg[ 0, oo), 

and is said to be a Brownian motion on [0, oo) 
with a treflecting barrier at the origin. (4) As a 
consequence of (3), if X(0, co) = 0 a.S., then, for 
fxed t, M(t), -m(t), and l/(t) (0 < i < 3) have a 
common distribution. For example, 

Po(M(t) ^ a) = Po(a, ^ t) = 2Po(X(t) > a) 

a>0, 

Po(X (r) 6 da, M{t) e db) 




Almost all paths of d-dimensional Brown- 
ian motion are continuous but are not of 
tbounded variation on any finite interval. 
Accordingly, they cannot have lengths. A 
positive, continuous, increasing function (p 
defined on [(q, oo) with to > 0 is said to belong 
to the upper (lower) class with respect to local 
continuity if Po((inf{t ||X(t,a))l>^/t ^(l/t), t> 
0}) > 0) = 1 (0). Kolmogorov’s test states that 
(p belongs to the upper class or to the lower 
class with respect to local continuity according 
as 




[2h — a)e 



0 ^ a < ^. 



00 

r\(p{t)fe-‘^"^‘^i^dt 

*0 
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converges or diverges. For example, 

9(f) = (21og(2) t+(d + 2)log(3,t-f 21og(4)t+ . 

+ 21og(„_i,t + (2 + (5)log(„)t)^/2 

belongs to the upper or lower class with re- 
spect to local continuity according as <5 > 0 or 
(5^0, where log(„, t = log log(„ _ i , t and log, i > t = 
log t. The law of the iterated logarithm for 1- 
dimensional Brownian motion, 



Pq imsup 



|2f(t, oj) — 2f(s,o))| 

^ y2|t-s|loglogl/|t-s| 






is a special case of this example. For the case d 
= 1, consider the space-time Brownian motion 
{{-t,Xit)),Po}. Set D^^{{s,x)\ -1/to^s^O, 
<p( -1/s) ^x<oo}. Then 0=(0,0) is a 
regular point of or an irregular point of 
for the space-time Brownian motion accord- 
ing as (p belongs to the lower or upper class 
with respect to local continuity. (- 261 Mar- 
kov Processes). Thus Kolmogorov’s test is the 
Wiener test for space-time Brownian motion. 
Let \j/ he a positive, continuous decreasing 
function defined on [f^, oo) with > 0. Then 



^^0 



inf{t| | 2 f(t, m)| < 







belongs to the upper or lower class with re- 
spect to uniform continuity according as () > 0 
or ^ ^ 0. The following theorem on the uni- 
form continuity of 1 -dimensional Brownian 
motion is a special case of this criterion: 



[limsup 



\X{t, m)-2f(5,co)| 
y 2 |r-s|logl/|t-s| 



= 1 . 



Now we state some other properties of 
Brownian paths. Let A be a set of zero touter 
capacity in (d ^ 2). Then P^{X{t)e A for 
some f > 0) = 0 for any x e R**. Let A be a plane 
set with positive tinner capacity. Then P^iXit) 
G A for intrnitely many t larger than any given 
S > 0) = 1 for any x g R^. With probability 1, 
the Lebesgue measure of the set {2f(t, co) |0 
t < oo} in R** is zero for d ^ 2. With proba- 
bility 1 this set is everywhere dense in R^ 
when d = 2 and nowhere dense in R*^ when 
d ^3 [ 13, 15, 17]. Almost all 2-dimensional 
Brownian paths have k-fold multiple points 
for any integer k ^ 2. In the 3-dimensional 
case, almost all paths have tdouble points but 
cannot have any triple point. In the d( >4)- 
dimensional case, almost all paths have no 
double point [18-20]. 



=0 or 1 . 
We have 



G. Ito’s Formula and Brownian Local Time 



-{ip{t)Y = 00 or < 00 when d ^ 3, 



and 



'oc 

J“o 



dt 

t\\ogilj(t)\ 



= 00 or < 00 



when d = 2. 



To describe uniform continuity of a path on 
the interval [ 0 , 1], take a positive, continuous 
increasing function ij/ defined on [t^, oo) with 
to > 0 and set (p{t) = ^/t i/t( 1/t). Then ijj is said 
to belong to the upper class with respect to 
uniform continuity if almost all paths X{t, to) 
(0^ t ^ 1) satisfy the 'tLipschitz condition rela- 
tive to (p, that is, for almost all 0) there exists 
an E(w) > 0 such that 0 < t-s\< fi(m) implies 
I J(t,ft»)- J(s,co)|<(p(|t-s|). And ij/ is said to 
belong to the lower class with respect to uni- 
form continuity if almost all paths X(t, to) 

(0 ^ t ^ 1) do not satisfy the Lipschitz condition 
relative to (p. Then ^ belongs to the upper or 
lower class with respect to uniform continuity 
according as 



converges or diverges [16]. For example, 
(// (t) = (2 log t -r(d + 4)log(2) t 3 - 2 log, 3 , t + . 
+ 2 log(„-i) t + (2 -I- ^)log(„, 



The following formula of Ito [21] is of funda- 
mental importance in the theory of stochastic 
processes. 

Ito’s formula. Let X={X(t) =(X ,(t), Xjit), 
...,X^(t)), Px} be a d-dimensional Brownian 
motion and /(x) =/(x, , X 2 , . . , xj be a C^- 
function delrned on R*^. Let us set <p(t) = 
/(^W)-/{2£^(0)). Then {(p(t)},^o ^ continu- 

ous tadditive functional of the Brownian mo- 
tion X (- 261 Markov Processes), and we 
have 



/(X(t))-/(X( 0 ))=^ 



dx 



-f]{X{s))dX,{s) 



{Af)iX{s))ds, t^O, 



where the first term in the right-hand side is a 
tstochastic integral with respect to the Brown- 
ian motion X [22] (- 406 Stochastic Differ- 
ential Equations). In particular, if / is a C^- 
function defined on R^ Ito’ 's formula can be 
rewritten as 



f{X{t}) -f{X{0}) . 



1 

1 = 1 



^’(A/)(X(s))o«,(s), 



t^o. 



where the right-hand side is a stochastic inte- 
gral of the tStratonovich type with respect to 
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the Brownian motion X [9] (- 406 Stochastic 
Differential Equations). 

Consider a 1 -dimensional Brownian motion 
X = (X(t), P^}- By the local time or the sojourn 
time density of X we mean a family of non- 
negative random variables {(p(t,x)} (te [0, oo), 
xeR*) such that, with probability 1, the fol- 
lowing holds: 

(a) the mapping [0, oo)x 3(f,x)i-^’(p(t,x)eR‘ 
is continuous, 

(b) for every Borel subset A of R’ and f ^ 0, 



0 JA 



(p{t, x)dx. 



The notion of the local time of Brownian 
motion was first introduced by Levy [6]. The 
family of random variables { (p{t, x)} (t ^ 0, 

XE R*) defmed by 

<p(t,x) = (X(t)-x)^-(X(0)-x) + 



- Xi.,a^{s))dX{s) 

J(0 



satisties properties (a) and (b) mentioned above 
and hence it is a local time [9, 22]. Here a-i- is 
the bigger of a and 0. It is clear that the local 
time q)(t, x) of X is given by 

1 f' 

(p{t,x) = hm — I Xix-s,x+c)(X (s)) ds 

for every xeR^t^O. 

Furthermore, we have 

pj Itosup 

\h-a=5l0.fl<f. ,y<5 log 1/(5 

^ 2.Vmax <p{t,a) = 1 , 

aeR‘ / 

and 



{ \(pit,S)-(pit,0)\ 

Fq hmsup — , 

V >510 y/(51oglog \/6 



=1- 



Levy [6] studied the fine structure of the local 
time. Define the visiting sets = {t | 

=0} and .2^“ = {t| yi(t,cn) = 0}, where yo(t,m) 
and Y, (t, co) are the stochastic processes de- 
fined in Section E above. Then we have 



^0 





[the number of flat stretches of 



M{s,o)) of length =M{t,o)), 




Since the two diffusion processes X ^ = 
{YM^^t<oo,P^} SLudX ={Y,{t),0^t< 

00 , P^} define the same probability law on 
W( [0, oo)), there exists a functional co) of 
X corresponding to M(t, co) of X ”. Here, 



W{ [0, 00 )) denotes the space of all continuous 
functions w: [0, co)^[0,oo). The flat stretches 
of the graph of co) are the open intervals 
(n^ 1) such that = [0, oo) — 

Then we have 

/Tce 

Fn hm / — X [the number of intervals 

^ [0, t) of length ^ a] = 9 ^ (t, co), t ^ 0^ = 1 . 
Furthermore, we have 



Fg lim / — X [the total length of the inter- 
\4o y 2e 

vals c [0, t) of length <e] = (^ w), 

t^oVi 

and 



Fgj lim (2a) ^ 



X[o.oO"o(s,(u))ds 



= (p (t,co),t^0 )= 1- 



Let dg(t, co) be the number of times that the 
reflecting Brownian path yg(s, Co) crosses down 
from £ > 0 to 0 before time t. Then 



"off 



Fo|^lini£d£(t><^) = </>^(t»to)H^O )=1. 



H. Flows and Random Distributions 

The flow derived from the 1 -dimensional 
Wiener process [X,} _j^<«(j( 3 is tKolmogorov’s 
flow. It has tmixing properties of all orders 
and is tergodic (- 136 Ergodic Theory, 39.5 
Stationary Processes). The tstationary process 
with independent values at every point corre- 
sponding to the ^characteristic functional 

exp(-^| cp{tfdt 

on the tSchwartz space Y defines the same 
probability law with the stationary process 
obtained by differentiation of the Wiener 
process in the tdistribution sense [23] (- 
395 Stationary Processes, 407 Stochastic 
Processes). 

I. Generalizations of Brownian Motion 

In addition to the Brownian motion described 
above, there are several stochastic processes 
that are also called Brownian motion. A Gauss- 
ian system {X(a)}agRW defined on a proba- 
bility space (Q, ®, P ) is said to be a Brownian 
motion with an Al-dimensional time parameter 
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if(i) £(.V^(a)) = 0,(ii) E{X{a)X{b))=^{\a\ + 

|^)| — |a — /)|), (iii) P(A!^(0) = 0) = 1. Leta*bethe 
tspherical inversion of a e with respect to 
the unit sphere. Set X*{a) = \a\X(a*) {a^O) 
and X*(0) = 0. Then {A'*(rt)}aeRiv defines the 
same probability law with a Brownian mo- 
tion with an N-dimensional time parameter. 
Almost all paths of a Brownian motion with 
an A-dimensional time parameter are con- 
tinuous. A positive, continuous, increasing 
function (p defined on [Iq, ao) with Jq >0 is said 
to belong to the upper (lower) class with re- 
spect to local continuity if the probability 
that the closure of the set {a |X(fl, w)| 

(p{ l/|fl|), |fl| >0} contains the origin 0 is equal 
to 0 (1). Then (p belongs to the upper or lower 
class with respect to local continuity according 
as the integral 



(For general information about Brownian 
motion with a multidimensional time para- 
meter P. Levy [6] and H. P. McKean, 
Theory of Prob. Appl. 8 (1963).) 

Let X (t) be a Wiener process. L. S. Omstein 
and G. E. Uhlenbeck based their investigation 
of the irregular movements of small particles 
immersed in a liquid on Langevin’s equation 

dU{t)=^-oiU{t)dt + lidX(t), 

where U (t) is the velocity of a particle. The 
first term on the right-hand side of this tsto- 
chastic differential equation (- 406 Stochastic 
Differential Equations) is due to frictional 
resistance or its analog, which is thought to be 
proportional to the velocity. The second term 
represents random external force. The station- 
ary solution of this equation is given by 



J'o ^ 

converges or diverges. Eor example, 

(p{t) = (2 log(2) r + (2N + 1 )!og(3) t + 2 log,^) t + 

+ 21og(„_i,t + (2 + (5)log(„)t)i/2 

belongs to the upper or lower class with re- 
spect to local continuity according as ^ > 0 or 
^ ^ 0. As a special case, we have 

pfitosup— J£lfd!==i)=i. 

V .-0 V2|a|loglogl/|a| / 

Take a positive, continuous increasing func- 
tion (p defined on [to> oo) with t© > and set 
ij/{t) = (p{\lt). If almost all paths X{a, o>) 

(|a| ^ 1) satisfy the Eipschitz condition relative 
to then (p is said to belong to the upper 
class with respect to uniform continuity. It is 
said to belong to the lower class with respect 
to uniform continuity if, with probability 1, 
these paths do not satisfy the Eipschitz con- 
dition relative to Then cp belongs to the 
upper or lower class with respect to uniform 
continuity according as 

Jto 

converges or diverges (T. Sirao, Nagoya Math. 
L, 17 (I960)). Eor example. 



(pit) = (2Nlog 1-f {4N + l)log( 2 ) 1+ 2 log, 3 ) t 

+ . . . + 2 log(„_i) t T (2 + 5)log,„, 

belongs to the upper or lower class with re- 
spect to uniform continuity according as ^ > 0 
or ^ 0. As a special case. 



\X(a,m)-Xikm)\ 



lim sup 

a-ft|-.o.|a|,|b|<i y/2Af|a — h|log l/\a — b\ 



:^1 =1 



U{t) = I dX(»). 

J -00 

The stochastic process {L(t)}_oo<,<^ is a 
stationary tGaussian Markov process with 
covariance function y{t) = (/5^/2a)c“®''L This 
process is called Ornstein-Uhlenheck Brownian 
motion [24] (-^ 176 Gaussian Processes). 

Brownian motion has been defined on state 
spaces that are tRiemannian spaces or -i-Eie 
groups, and its properties are being investi- 
gated (^ 5 Additive Processes, 115 Diffusion 
Processes). 
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46 (X.32) 

Calculus of Variations 

A. General Remarks 

One of the first objects of differential calculus 
was to systematize the theory of the extrema 
of functions of a linite number of indepen- 
dent variables. In the calculus of variations 
we consider functionals (i.e., real-valued or 
complex-valued functions delined on a func- 
tion space {u} consisting of functions delined 
on a certain domain B), originally so named 
by J. Hadamard. Such a functional is denoted 
by J\u B] (or simply 7[u]). A function u, 
which is considered an independent variable of 
the functional, is called an argument function 
(or admissible function). 

Concrete examples of functionals are the 
length of a curve y =/(x) and the area of a 
surface z = z(x, y), which are expressed by 

^[y]= [ Vl+(/(x))^ dx 
Jxo 

and 



tions. On the other hand, there are so-called 
conditional problems in tbe calculus of vari- 
ations. A typical example of this sort is the 
tisoperimetric problem, i.e., the determination 
of the curve that bounds a domain with maxi- 
mal area among all curves on a plane with 
given length. In general, an extremal problem 
of a functional under a subsidiary condition 
that the value of another given functional 
remain h'xed is called a generalized isoperi- 
metric problem. In addition to these, there are 
Lagrange’s problem, in which a finiteness 
condition is imposed, and Hilbert’s problem, in 
which a condition consisting of differential 
equations is imposed. 

The birth of the calculus of variations was 
almost simultaneous with that of differential 
and integral calculus. Johann Bernoulli, Jakob 
Bernoulli, L. Euler, and others had dealt with 
several concrete problems of the calculus of 
variations when in 1760, J. L. Lagrange intro- 
duced a general method of dealing with vari- 
ational problems connected with mechanics. 
Then an equation bearing the name of Euler 
or Lagrange was introduced. 



S[z] = 



yj\ +zl + Zy dx dy. 



B. Euler’s Equation 



respectively. Eurthermore, consider a curve y = 
y(x) connecting two given points (x„ y^) and 
(Xj, y,) with y, >y,. The time in which a parti- 
cle slides down without friction from (x„ y,,) 
to (xi, Pi) along this curve under constant 
gravity acting in the direction of the positive 
y-axis is expressed by the functional 






'^1 



Ji^+y'^)liy-yo)dx, 



k constant. 



( 1 ) 



Let f ( . . . ) be a known real- valued function 
that depends on a certain number of indepen- 
dent variables and on argument functions of 
these variables, together with derivatives of 
these functions up to a certain order. Then a 
typical problem in the calculus of variations is 
formulated as an extremal problem of a func- 
tional that is expressed by the integral with 
f ( . . . ) as the integrand, for example, 

7[u] = r F{x, u(x), u'(x), . . y"'\x))dx, 

Jxfl 



J[u,v]= F{x,y,u{x,y),u^,u ,,, 

S Sb 

v{x, y), v^,Vy,...)dxdy. 

Eor instance, a curve minimizing (1) is the 
curve of steepest descent. 

In extremal problems, suitable boundary 
conditions can be assigned to argument func- 



As an example, consider the simplest vari- 
ational problem 



J[yl 



'*1 

F{x, y(x), y'(x))dx 

Jxq 



min. 



( 2 ) 



Let the boundary condition y{Xo) = y„ yjXj) 
=y, be assigned to the argument function 
y(x). Consider a family of admissible functions 
y(x;^) = y(x)-f£r/(x), where rj{x)is any h'xed 
function vanishing at both endpoints and 
is a parameter. If y(x) gives the minimum 
of J[yl then the function of £, J[U must 
attain a minimum for £ = 0. The condition 
(dJ[ Y]/&),„ = 0 is written in the form 



0= {F t] + F ,rj')dx 



by taking into account the boundary con- 
dition. By making use of the arbitrariness of 
f/(x), we conclude that 



d 

dx 



^^f^y-77/y^Py-fyx- y'Py'y - /Fyy 



( 3 ) 



holds, in view of the following lemma: Let (p{x) 
be continuous in [xq, xJ and r](x) be a func- 
tion of class C'’ that satisfies rj(xo) = = • • 

= = 0, f/(xi) = rf'(xj) = ...= t]^‘^\xi) = 0 

(O^q^p). If J^jj>;(x)(p(x)dx = 0 holds for any 
such rj{x), then (p{x) = 0. (Here class can be 
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replaced by class C" when q is supposed finite. 
If then ^ = oo is admitted.) This is called 

the fundamental lemma in the calculus of vari- 
ations. We call (3) the Euler-Lagrange differ- 
ential equation (or Euler’s equation for the 
extremal problem). Since this equation is of 
the second order, y(x) can be determined by 
means of the boundary condition. 

r n d 

The quantity iFjy — F^ — ^F , contained in 
ax 

equation (3) is called the variational derivative 
of F with respect to y. Furthermore, dy = qdfi 
and bJ = {dJ [ ¥]/&),„ dc are called the first 
variations of the argument function y and of 
the functional ^[y], respectively. If q(x) is not 
subject to the condition that it must vanish at 
the endpoints, then the first variation of J 
becomes 



bJ = 



'*1 

Jxo 



[F~\ybydx+[FybyY^'^. 



In comparison with an ordinary extremal 
problem /(xj, ,x„) = niin in differential 
calculus, and bJ correspond to gradfand 
df, respectively. In general, a solution of the 
Euler-Lagrange differential equation [F]j, =0 
is called a stationary function for the vari- 
ational problem, and its graph is called a 
stationary curve. 

For a variational problem involving several 
argument functions, we have only to write the 
system of Euler-Lagrange differential equa- 
tions corresponding to them. For a problem 

J[y]= ‘ f (x,y, = min 

Jxq 

whose integrand involves derivatives of higher 
orders of an argument function, the Euler- 
Lagrange differential equation is 

m 

For a problem involving a double integral 



J[u]= F{x,y,u,u^,yy)dx dy = min, 

S Sfi 

the equation is 



0=[F], 




d 

Sy 



For a generalized isoperimetric problem, for 
example J[y] = min, K [y] = c, the Euler- 
Lagrange equation becomes 

[F -f — 0 

with the so-called Lagrange multiplier X, where 
F and G denote the integrands of J and K, 
respectively. Two integration constants con- 
tained in a general solution of this differential 
equation of the second order and an undeter- 



mined constant X Can be determined, for in- 
stance, by the boundary condition y{Xo) = yo> 
y{Xi) = y, and a subsidiary condition K [y] = C. 
As an example, for the classi cal pro per isoperi- 
metric problem F = y, G = -l-y'^, the equa- 

tion is 1 — X[y' +y''^)' — 0, which after 
integration leads to (x — a)^ + (y = X^ 

(->■ 228 Isoperimetric Problems). 

Besides the case of fixed endpoints, there is 
a boundary condition, for instance, that an 
endpoint (Xj, y,) of the argument function 
y = y(x) must lie on a given curve T{x, y) = 0. 
For the case of such a movable endpoint, the 
extremal function is subject to the condition of 
transversality, 

x=x,. 



C. Sufficient Conditions 

A. M. Legendre introduced the notion of the 
second variation, corresponding to differential 
quotients of the second order in differential 
calculus, in order to discuss sufficient con- 
ditions. Concerning the simplest problem (2), 
the inequality Fyy,{x, yo W, y'o(^)) ^ 0 is neces- 
sary in order for yo(x) to give the minimum. 
Conversely, the inequality F^.^. > 0 and Jacobi’s 
condition (which is stated below) imply that 
y = yo(x) gives a weak minimum. Here “weak 
minimum” means the minimum when a family 
of admissible functions | y — y, < e, y’ _ y^ | < 
e} is considered a neighborhood of y,. 

Jacobi’s condition: Let u be a solution of a 
linear ordinary differential equation of the 
second order. 




m{xo) = 0; 

then the smallest zero of u that is greater than 
Xq (i.e., the conjugate point of Xg) is greater 
than the right endpoint x,-. 

K. Weierstrass derived suflicient conditions 
for a strong minimum by extending the range 
of admissible functions to { y — Vo <e}. Re- 
sults that were obtained until about that time 
constitute the content of what is usually called 
the classical theory of the calculus of variations. 

If for the variations problem (2) there exists 
a unique curve through every point in a do- 
main on the xy-plane that belongs to a one- 
parameter family of stationary curves of the 
functional 

J[y]= F{x,y,y')dx, 

J Xq 

then the domain is called a field of stationary 
curves. Let the parameter value of the curve 
through a point (x, y) in such a family of 




46 D 

Calculus of Variations 



194 



stationary curvcs y = (p{x; a) be denoted by a 
= ol{x, ^.The slope p{x, y) = [(p'{x; a)]a=„(x,y) is 
called the slope of the tield at the point (jc, y) 
or, by regarding x, y as variables, the slope 
function of the field. The value of a curvilinear 
integral 

/c = /[v] 



= iF{x,y,p)-{y'- 

c 



-p)Fy{x,y,p))dx 



is then determined and depends only on the 
two endpoints of the curve C. We call 
Hilbert’s invariant integral. In view of the 
property mentioned above, we can denote the 
value of the functional J for a function y = y(x) 
representing a curve C by J^.. Then any admis- 
sible curve C that passes through a treld em- 
bedding a stationary curve Cq satisfies 



O^AJ 

Here 



j^i{x,y;p,y')dx. 



^{x,y,P,Y’) 

= Fix, y, y') - Fix, y, p) - iy'-p)Fy.ix, y, p) 

is the (f-function introduced by Weierstrass. 

For Cq to give the minimum of J[y], it suffices 
that S"'^0 hold for every point (x, y) in. the field 
and every value y’ 279 Morse Theory). 



D. Optimal Control 



Let a system of differential equations 

dxjdt =fiixi, . . , x„; «,,, Ufc), 

(uj, . ..) Uk)eQ; x,(to) = x,-, i=l- n, (4) 

be given, where Mi, . . , are parameters. In 
general, a problem of optimal control is to 
determine Uj = Mj(t) (to ^ ^ ^ ti) such that the 
value of a functional 



J[u] 



'V(Xi 



. . X,j, Ml,.... Mj^) dt 



assumes a minimum, where v/t) are the solu- 
tions of (4) and are considered functions in 
U^,. . . ,Ui^, and t. Such a problem is a kind of 
conditional variational problem. But since the 
existence region of u is restricted, certain con- 
ditions in the form of inequalities are imposed, 
and furthermore u is not necessarily continu- 
ous, and in many cases the problem cannot 
be treated within the classical theory of the 
calculus of variations (-^ 86 Control Theory). 



E. The Direct Method in the Calculus of 
Variations 

In mathematical physics tvariational principles 
are derived from discussions of formal corre- 



spondence between a functional J [u] to be 
minimized and Euler’s equation for J[m]. This 
is certainly one of the important methods in 
the calculus of variations, but it is also possible 
to investigate a stationary function Uq on the 
basis of its stationary character and indepen- 
dently of Euler’s equation. This is called the 
direct method in the calculus of variations. It 
plays an important role in the theoretical 
treatment of the existence and uniqueness of 
solutions, and it is also significant as a tech- 
nique for approximate or numerical solutions. 
When a differential equation is given inde- 
pendently of the calculus of variations, it is 
possible to apply the direct method if a func- 
tional whose Euler’s equation is the given 
differential equation can be constructed. 

Eet D be a bounded domain in m- 
dimensional space and / eL2(D) be a real- 
valued function. Consider the variational 
problem of minimizing the functional 

r r 



J[m] = 



lgradul^dx-2 fudx. 



Jd 



Here we suppose the set of admissible func- 
tions, denoted by Aj, to be the Hilbert space 
obtained by completing the function space 
Cq{D) with respect to the norm 

N(u) =^j* |gradMpdx-f J M^dx^ 

Utilizing E. Riesz’s representation theorem in 
Hilbert spaces, it can be shown that there 
exists a minimum value / in A, which is 
uniquely realized by certain Uq e Aj. Since the 
function Uq belongs to A„ it can be shown that 
the boundary condition 

wbn-0 (5) 

is satisfied in a generalized sense. Eurthermore, 
in view of J[Uq] ^ J{Uq + (p] being valid for 
any cpeC^iD), it can be verified that the 
equation 



-Am=/ (6) 

is satisfied in D in the sense of differentiation 
of tdistributions. In other words, the station- 
ary function Uq is a solution in the wide sense 
(a tweak solution) of the classical boundary 
value problem for tPoissOn’s equation for- 
mulated by (6) and (5). If a function space A, 
with AjZ)AjZ> Cq (D) is taken as the set of 
admissible functions, the value I— J[Uq^ be- 
comes the greatest lower bound of J in Aj. 

In this case, if is any minimizing se- 

quence from Aj, that is, if u„ e A, n = L 2, . . ; 
J[m„] ->/ (n^ oo), then it converges to Mq in the 
sense that 



N(u„-Uo)->0 (n-^oo). (7) 

In other words, the solution in the wide sense 
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Uq of the boundary value problem can be 
constructed as the limit of a minimizing se- 
quence that consists of sufficiently smooth 
functions vanishing on the boundary. In the 
proof of the fact that a solution in the wide 
sense Uq coincides with the classical solution of 
the boundary value problem under an assump- 
tion of suitable smoothness for / and dD, a 
standard argument has been established for 
proving the regularity of a solution in the 
wide sense. 

The technique of obtaining the solution of 
the boundary value problem as the limit of 
a minimizing sequence was proposed by B. 
Riemann concerning the classical tDirichlet 
problem and was completed by D. Hilbert. 

This pioneering work led to the recent treat- 
ment of boundary value problems by utilizing 
Hilbert spaces. For -i-Self-adjoint boundary 
value problems, the method stated for the 
above example has been generalized almost 
directly to the cases of differential operators of 
higher order and with variable coefficients (- 
323 Partial Differential Equations of Elliptic 
Type). By making use of some auxiliary argu- 
ments, this technique can be applied exten- 
sively to the construction of several kinds of 
mapping functions in the theory of functions 
of a complex variable, to the solution of 
integral equations of the second kind, and 
also in other fields [3,4]. 

The eigenvalue problem, which is formu- 
lated by 

Hu=/lu, U9^0, (8) 

with a +Self-adjoint operator H in a Hilbert 
space, can also be transformed into a vari- 
ational problem for the t Rayleigh quotient 

R[u']^(Hu,u)/jjup (9) 

298 Numerical Computation of 
Eigenvalues). 

F. Solution of Differential Equations by the 
Direct Method 

In view of the convergence shown in (7), a 
minimizing sequence can be regarded as an 
approximating sequence for a solution of the 
boundary value problem or a stationary func- 
tion Uq. Eet a function 

u„= u„(x;c)= u„(x;Ci, ■ ■ , c,) (10) 

involving an n- vector c = (cq, . . . , c,) as a para- 
meter be admissible for any c. If J [w„( ■; c)] = 
E(c), obtained by substituting u„ into J, is 
minimized at c = c°, then u„{ •; c°) is considered 
as a function that approximates Uq most pre- 
cisely within the family u„{ •; c). This vector 
is obtained, in general, by solving the simulta- 
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neous equations 

0J[m„(-;c)]/5cj=O, 7 = (11) 

The function u„ appearing in (10) is often taken 
to be a so-called linear admissible function. 

Eor instance, in the example above, let 

be a system of independent functions complete 

in Aj, and set 

u„ = CiV>i + ...-l-c„(p„. (12) 

The method that constructs a minimizing 
sequence U 2 ,. by determining the value of 
the in (12) by (1 1) is called Ritz’s method, 
and (p^ is called a coordinate function in this 
method. As for the rate of convergence in 
approximation by Ritz’s method, as well as 
for the estimation of errors, there are several 
results by the Soviet school in addition to 
those of E. Trefftz [3,5]. (Other methods of 
constructing minimizing sequences are stated 
in detail in [3]; concerning a connection with 
tGalerkin’s method 304 Numerical Solution 
of Partial Differential Equations.) 

Ritz’s method applied to eigenvalue prob- 
lems is called the Rayleigh-Ritz method. Since 
a stationary value of the Rayleigh quotient is 
itself an eigenvalue in this case, the precision of 
approximation is far better for eigenvalues than 
for eigenfunctions, so that it is a convenient 
method for the approximate computation of 
eigenvalues. 
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Cantor, Georg 

George Cantor (March 3, 1845- January 6, 
1918), the founder of set theory, was bom in 
St. Petersburg into a Jewish merchant family 
that settled in Germany in 1856. He studied 
mathematics, physics, and philosophy in Zu- 
rich and at the University of Berlin. After re- 
ceiving his degree in 1867 in Berlin, he became 
a lecturer at the University of Halle and served 
as a professor at that university from 1879 to 
1905. In 1884, under the strain of opposition 
to his ideas and his efforts to prove the tcon- 
tinuum hypothesis, he suffered the first of 
many attacks of depression which continued 
to hospitalize him from time to time until 
his death. 

The thesis he wrote for his degree concerned 
the theory of numbers; however, he arrived at 
set theory from his research concerning the 
uniqueness of ttrigonometric series. In 1874, he 
introduced for the first time the concept of 
-i-Cardinal numbers, with which he proved that 
there were “more” ttranscendental numbers 
than talgebraic numbers. This result caused 
a sensation in the mathematical world and 
became the subject of a great deal of con- 
troversy. Cantor was troubled by the oppo- 
sition of L. tKronecker, but he was supported 
by J. W. R. tDedekind and G. Mittag-Lefiler. 

In his note on point-set theory, he wrote, in 
connection regard with his concept of infinity, 
“The essence of mathematics lies in its free- 
dom!” In addition to his work on cardinal 
numbers, he laid the basis for the concepts of 
torder types, ttransfmite ordinals, and the 
theory of real numbers by means of tfunda- 
mental sequences. He also studied general 
point sets in Euclidean space and defined the 
concepts of '^accumulation point, 'iclosed set, 
and topen set. He was a pioneer of the point- 
set theory that led to the development of 
general ttopology. 
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Capacity 

A. General Remarks 

The electric capacity of a conductor in the 3- 
dimensional Euclidean space is defined as 
the ratio of a given positive charge on the 
conductor to the value of the potential on the 
surface. This definition of capacity is indepen- 
dent of the given charge. The capacity of a set 
as a mathematical notion was defined lirst by 
N. Wiener (1924) and was developed by 0. 
Erostman, C. J. de Ea Vallee Poussin, and 
several other French mathematicians in con- 
nection with potential theory. 

B. Energy 

Eet Q be a tlocally compact Hausdorff space 
and d)(x, y) be a +lower semicontinuous func- 
tion on Q X Q such that — oo <0 oo. A mea- 

sure n will mean a nonnegative tRadon mea- 
sure with compact tsupport Denote by 
<I)(x, /t) the tpotential J d>(x, y)tf/i(y) of a mea- 
sure fi with kernel 0 and by ^) the tenergy 
of 11 . Eet X be a set in Q, and denote 
by the class of normalized measures ^ (i.e., 
of measures fi satisfying fi[Q) = 1) with c. X. 
Eet X be a nonempty compact set in Q. Set 
IE(X) = inf(ju,/i) for and W{0)=oo for 

the empty set 0. Eor $(x, y) = l/|x — y|in Q 
= R^, the general solution u(x) of the tDirich- 
let problem (texterior problem) for the bound- 
ary function 1 in the unbounded component of 
R^ — X is equal to the potential of an tequilib- 
rium mass-distribution. Therefore, if S is a 
smooth surface surrounding K and normals 
are drawn outward to S, the integral -( 1/4 n)- 
^s{du/dn)d(7 of the normal derivative is equal 
to \IW{K). This is the capacity of X delined 
by N. Wiener [J. Math. Phys., MIT, 3 (1924)) 
when X is a closed region. Vallee-Poussin 
(Ann. Inst. H. Poincare, 2 (1932)) called the 
supremum of /i(R^) the Newtonian capacity of 
a bounded tBorel set E, where fi runs through 
the class of measures fi with E whose 
^Newtonian potentials are not greater than 1 
in R\ If E is compact, the Newtonian capacity 
coincides with Wiener’s capacity. Eor the 
tlogarithmic potential in R^, c is called 
the logarithmic capacity. When tGreen’s func- 
tion g{z,cc) with the pole at the point at infin- 
ity exists in the unbounded component of R^ 
-X, lim2^^(0(z,oo)-log|z|)is called Robin’s 
constant and can be shown to be equal to 
W(K). (Eor the relation between Robin’s con- 
stant and treduced extremal distance ->.143 
Extremal Eength.) In the case of a general 
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kernel it is difficult to define capacity as above 
by means of W(K), and hence the value of 
W(K) itself instead of the capacity of K is often 
used. When W(K)= oo, we can say that K is of 
capacity zero. The minimum value of the 
tGauss integral (/t, /t) - 2 JJ dn is a generaliza- 
tion of W(K), where and f is an upper 
semicontinuous function bounded above on K. 



C. Minimax Value 



Suppose that we are given the kernel as 
above. For a set X c Q and a measure ji, 
set {7(/t; X) = sup^g;^ 0(x, fi) and K(/i; X)= 
inf^g;^^d)(x, fx). Next, for Tc Q, set U(Y) = 
inf Uin; S^), V(Y) = sup S,), U{X, Y) = 
inf U{ p; X), and V(X, Y) = sup V{ p; X), where 
If 0(x, y) = @(y, x) is taken as a kernel 
instead of <I)(x, y), then the notations u/(K), 
C(p;X), F(p; X), are used correspondingly. 
For any compact set K the following relations 
hold; 



W(K)= W{K)^C\{K)= ii(K) 



< 



U{K,K)= V{K,K)1 
UiK,K)=V(K,K)\ 



I f7(aK)=F(X,Q) 
^\U{Q, K)= K(/C,Q) 
I V(K)= V(K). 



Examples show that all the inequalities can be 
strict. The tminimax theorem in the ^theory of 
games plays an important role in the proof of 
these inequalities [7]. Even if the kernel is 
symmetric, the inequalities can be strict except 
for the equality W(K)= U(K). When the kernel 
is positive, we can define the quantities which 
correspond tO U(Y), V(Y), V{X, Y), V(X, Y) 
by considering the class of p with c Y and 
(p, p) = 1 instead of ^y. 



D. Transfinite Diameter 

As k^oo, 

D,(X)-/c ‘(/c-1)-' inf E0)(x,.,x,) 

decreases and the limit D (K) is equal tO W (K). 
Eor the logarithmic kernel in M. Eekete 
defined D(K) and called e the transfinite 
diameter of K (1923). E. Eeja and his school in 
Poland studied relations between transfinite 
diameter and tconformal mapping. Next, set 

kR^{X, Y) = sup inf ^ xj. 

X, i = l 

Then R(Y,Y) = lim R^(Y,Y) exists as k->oo, 
and we have R{K, Y)= V{K, Y). 

Eekete introduced R(K)=R(K, K) in 
(1923). G. Polya and G. Szego computed D(K) 
and R(K) for special K and cc-kemel r “ (a ^ 0) 
in and R^ [12]. The equality D{K) = R{K) 
holds for the logarithmic kernel in and the 



Newtonian kernel in R^. The maximum of the 
absolute value on K of ^Chebyshev’s poly- 
nomial (^ 336 Polynomial Approximation) 
of order k with respect to K in is equal to 
exp(^kRfc(K)). 

E. Evans’s Theorem 

In order that K be of Newtonian capacity 
zero, it is necessary and sufficient that there 
exist a measure p on K such that the New- 
tonian potential of p is equal to go at every 
point of K. This result was proved by G. C. 
Evans and H. Selberg independently (1935) 
and is called Evans’s theorem (or the EvanS- 
Selberg theorem). The corresponding theorem 
in R^ is often applied in the theory of functions 
[ 15, 16]. A similar potential exists in case of a 
general kernel if and only if R{K,K)= 00. 



F. Nonadditivity of Capacity 

Many kinds of capacity satisfy the inequality 
cap( [J„ X,) ^ cap Y„, where a capacity is 
denoted by cap. Even the Newtonian capacity 
C is not necessarily additive, but it satisfies 
C(Ki U K^] + C(K , n K2) < C(K,) + CiK^) (G. 
Choquet [2]). Choquet [3] proved that X can 
be divided into mutually disjoint sets X, and 
X2 such that C;{X) = C,(Yi) = C;(X2), where 
Cj{X) is the Newtonian inner (or interior) ca- 
pacity delined to be supj^ ^ x if X 0 
and 0 if X = 0. 

G. Relation to Hausdorff Measure 

There are many studies of relations between 
capacity and tHausdorff measure [ 1], Frost- 
man [6] introduced the notion of capacitary 
dimension and observed that it coincides with 
the Hausdorff dimension. The capacity of 
product sets has been evaluated from above 
and below [11]. Eor compact sets K cz R", K' 
c R”, their dimensions a, j?, and the dimension 
y of K X K', we have the relation a + ^ y ^ 
min(m + s, n + fi), where the equalities are 
attained by general tCantor sets. There are 
also works on the evaluation of capacities 
of general Cantor sets [10, 15]. If K is a con- 
tinuum of logarithmic capacity 1 in a plane, 
then its diameter d satisfies 2 ^ d ^ 4, and its 
area A satisfies A<n [S]. Consider the sum K 
= {zi + ... + z„\ZkeK^, ofcontinua 

Kj , . . . , in a plane. The logarithmic capacity 
of K is strictly greater than the sum of the 
logarithmic capacities of X,, , except 
when all are convex and mutually similar 
[14]. By various tsymmetrizations the log- 
arithmic capacity decreases in general (-^ 228 
Isoperimetric Problems; also [ 133). 
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H. Capacitability 

The Newtonian outer (or exterior) capacity 
C,(X) is defined by inf C,(G), where G ranges 
over an open set containing X. The inequality 
Cj(X)^ C,(X) holds, in general. When the 
equality holds, X is called capacitable. Cho- 
quet (1955) [2] proved that all '’'analytic sets 
and hence tBorel sets are capacitable but there 
exists an analytic set whose complement is not 
capacitable. He himself generalized his result 
on capacitability in the following way [4]: Let 
Q be an abstract space, (p a nondecreasing 
function defined on the family of all subsets of 
and jT some family of subsets of Q that is 
closed under the formation of finite unions 
and countable intersections. Assume that 
as in jf decreases to H and 
that (p{X„)](p{X) as X„]X. When cp(X) is equal 
to s\ip{(p{H) \ H B Jf , H czX},X is called 
{(p, Jf )-capacitable. Choquet detined jf - 
Suslin sets and showed that they are {(p, 4f)- 
capacitable. M. Kishi [9], Choquet [5], and B. 
Fuglede [7] investigated capacitability with 
respect to several kinds of capacity more gen- 
eral than Newtonian capacity. We can discuss 
capacitability with respect to quantities de- 
fined in connection with the ^Gauss varia- 
tional problem. 
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Cardinal Numbers 

A. Definition 

The general concept of cardinal number is an 
extension of that of natural number (Cantor 
[ 1]). When there exists a ■''one-to-one corre- 
spondence whose +domain is a set A and 
whose ■’range is a set B, this set B is said to be 
equipotent (or equipollent) to A, and this rela- 
tion is denoted by A-B. The relation ~ is 
an ’equivalence relation, and each equivalence 
class under this relation is said to be a cardinal 
number. The class of all sets equipotent to a set 
A is denoted by A (or A\) and is said to be the 
cardinal number (power, cardinality, or potency) 
of tbe set A. When A is a finite set, A is said to 
be finite, and when A is an infinite set, A is 
said to be infinite (or transfinite). When the 
cardinal number of a set A is m, A is also said 
to consist of m members (or m elements). In 
this sense, 0 and the natural numbers are 
considered to express finite card inal numbers. 
For example, 0= 0, 1 = {0}, 2 = {0,1}, etc. 
Examples of infinite cardinal numbers: A set A 
which is equipotent to the set N of all natural 
numbers is said to be countably infinite, and 
the cardinal number of the set N is denoted by 
a. A set A which is finite or countably infinite 
is said to be countable. The cardinal number of 
the set of all real numbers is denoted by c and 
is called the cardinal number of tbe continuum. 
Moreover, the cardinal number of the set of all 
real-valued functions whose domain is the 
interval [0, 1] is denoted by f. These three 
cardinal numbers are known to be distinct. 
Henceforth in this article, lower-case German 
letters denote cardinal numbers. For a defi- 
nition of cardinal numbers using the concept 
of ordinal numbers 312 Ordinal Numbers. 
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B. Ordering of Cardinal Numbers 

m ^ n or n m will mean that there exist sets 
A and B such that m = /I, n = 6, and /I B. 
A^B does not necessarily imply m^n. For 
example, the cardinal number of the set of all 
positive even numbers is also a. m > n and 
n > m imply m = n (Bernshtem’s theorem). 

Since the treflexive and ^transitive laws for the 
relation ^ between cardinal numbers are 
obvious, the relation is an tordering relation. 
The twell-ordering theorem imples that ^ is a 
^total ordering (comparability theorem for 
cardinal numbers), m > n means that m ^ n 
and nt ^ n. When B ^ m, B is said to be at 
most m. 



C. Sum, Product, and Power of Cardinal 
Numbers 

For cardinal numbers m and n, choose sets A 
and B so that m = n = B, and T fl B = 0, 
and put s = /4 U B. Then s is uniquely deter- 
mined by m and n. The s is said to be the sum 
of the cardinal numbers m and n and is de- 
noted by m -H n. If the sets A, B are chosen as 
described above, the cardinal numbers of the 
^Cartesian product A x B and of the set of 
functions A^ are called the product of m and n 
and the nth power of m, denoted by mn and 
m", respectively. These operations are also 
determined by m and n. For these three oper- 
ations, the following laws are valid: commuta- 
tive laws m -I- n = n -h m, mn = nm; associa- 
tive laws (m-i-n)-l-p = m-|-(n-|-p), (mn)p = 
m(np); distributive law p(m -I- n) = pm -I- pn; 
exponential laws m"^‘' = m"m^, m"'’ = (m")'^', 
(mn)'’ = m^n^’. In particular, if /4 =m, then 2‘" 
is the cardinal number of the +power set '’•P(/4) 
of A. 

Addition and multiplication of more than 
two cardinal numbers can be defined as fol- 
lows. Let A be any set, and suppose that to 
any element X of A there corresponds a unique 
rardinal number m^. Let be a set such that 
M I = rn;i, and fl M\ = 0 for / Then 

the cardinal number of the tdisjoint union 
is said to be the sum of all m^ and is 
denoted by 2;.tn^. The cardinal number of the 
Cartesian product is said to be the 

product of all m^(2e A) and is denoted by 
n, m^. The axiom of choice can be stated as 
follows: If nt; 9^0 for all AeA, then ri;im;7^0. 



D. The Continuum Hypothesis 

For a, c, and f defined as before, f = 2' > c = 2" > 
a. In general, 2"'>m holds for any cardinal 
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number m (Cantor). The hypothesis which 
asserts that for any m, there does not exist an 
n such that 2'" > n > m is called the gener- 
alized continuum hypothesis. In particular, this 
hypothesis restricted to the case where m = a 
is called the continuum hypothesis. After Can- 
tor stated this hypothesis {J. Reine Angew. 
Math., 84 (1878)), it remained an open ques- 
tion for many years. In particular. Cantor 
himself repeatedly tried to prove it, and W. 
Sierpihski pursued various related hypotheses. 
Finally, the continuum hypothesis and the 
generalized continuum hypothesis were proved 
to be independent of the axioms of set theory 
by K. Godel (1940) [3] and P. J. Cohen (1963) 
[4] (^ 33 Axiomatic Set Theory). 



E. Cardinality of Ordinal Numbers 

Lower-case Greek letters will stand here for 
^ordinal numbers. The cardinal number of 
1 <a} will be denoted by a, which is called 
the cardinality of the ordinal number a or the 
cardinal number corresponding to a. When a 
cardinal number m corresponds to some 
ordinal number, the minimum among ordinal 
numbers a with a = m is called the initial or- 
dinal number corresponding to m. An initial 
ordinal number corresponding to an infinite 
cardinal number is called a transfinite initial 
ordinal number. There exists a unique corre- 
spondence P^cop from the class of ordinal 
numbers onto the class of all transfinite initial 
ordinal numbers such that /?>y implies o)p> 
tOy. In particular, coq — oj, and an ordinal 
number ^ such that C<co^ is called a count- 
able ordinal number. (Op is called the pth 
transfinite initial ordinal number. The cardi- 
nality of cDp is denoted by (K is the Hebrew 
letter aleph). In particular, a is denoted by 
(aleph zero). We have if and only if 

P^y, and in this case -i- = 
i<p. The axiom of choice implies that every 
infinite cardinal number is an Hence, in 
this case, the continuum hypothesis can be 
formulated as 2^'<> = N] , and the generalized 
continuum hypothesis can be formulated as 
2*^/' = Kp+i for every ordinal number p. 



F. Finiteness and Infiniteness 

Dedekind [5] defined a set A to be infinite if A 
is equipotenl to a proper subset of itself, and 
to be finite otherwise. It is also possible to 
define finiteness and infiniteness of sets as 
follows: A set A is finite if there exists a ^well- 
ordering of A such that its Mual ordering is 
also a well-ordering, and A is infinite other- 
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wise. If a set is finite in the latter sense, then it 
is also finite in the sense of Dedekind. Under 
the axiom of choice, these two definitions can 
be shown to be equivalent. 
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Cartan, Elie 

Elie Cartan {April 9, 1869-May 6, 1951) was 
born at Dolomieu in the French province of 
Isere. He entered the Ecole Normale Supe- 
rieure in Paris in 1888 and graduated in 1891, 
having at the same time qualified in the agrege 
examination. Beginning his research immedi- 
ately, he completed his thesis on the structure 
of continuous transformation groups [2] in 
1894 at the age of 25. 

Cartan was a professor first at the Univer- 
sity of Montpellier, later at the University of 
Lyon, then the University of Nancy, and fi- 
nally in 1912 at the University of Paris. He 
freely used the tmoving coordinate system 
introduced by J. G. Darboux, and contri- 
buted to many areas, such as the theory of 
tLie groups, the theory of tpfaffian forms, the 
theory of tinvariant integrals, ttopology, tdif- 
ferential geometry (especially the geometry 
of tconnections), and theoretical physics. His 
doctoral thesis is still an object of interest 
among young researchers today, and the con- 
cept of connection introduced by him is funda- 
mental in the field of differential geometry. 
Henri Cartan (1904-) is his eldest son. 
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A. General Remarks 

Catastrophe theory was originally proposed 
by R. Thom [1,2] in late 1960. This theory 
provides certain mathematical models for the 
evolution of forms in nature, in particular in 
biology and the natural languages. In recent 
years, E. C. Zeeman and others have devel- 
oped and applied the theory to various fields, 
including the physical sciences, medicine, 
economics, and sociology; a detailed bibli- 
ography may be found in [3,9, 10]. 

B. Static Models 

The basic concept of the theory is the static 
model. This is a family of potential functions 
/„: A-^R, where V is a subset of R", containing 
a neighborhood of the origin, and the para- 
meter u lies in a neighborhood U of the origin 
in R". 

We regard R" as the internal space or state 
space, which is parametrized by the various 
variables that are relevant to the process under 
study, and R*" is the external space or control 
space, which can be either the physical space- 
time continuum in which the process under 
consideration takes place or a space of control 
parameters that govern the process. As a rule, 
we assume that the dimension r^4, although 
the dimension n can be arbitrary large. 

The static model is local in nature, and any 
one of the local minima of /„, called a local 
regime at mg U, is a candidate for the state 
of the model corresponding to the control 
point M. 

Mathematically, a static model is a germ of 
C®-functions /:R" x R''-+R at 0 that is an 
unfolding (r-dimensional extension family) of a 
germ of C®-functions =/ 1 R" x {0} : R"^R at 
0; further details are given in Section D. 
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C. Classification of Singularities 

Let m) be the vector space of tgerms of 
C“ -functions / : at 0, and let ^(n) <= 

(f(n, n) be the subset of invertible germs R”-> 

R" mapping 0 to 0. 

The germs r/, ^eS^n, m) are called right 
equivalent if there exists an hs^(n) such that 
r]oh — ^. Let S“{n) = S’{n, 1). Then it follows in 
the usual manner that S’in) is a local algebra 
with the unique maximal ideal J(^Jl{n) = 

{ 6 (f (n) I r/ (0) = 0} . A germ r\ e S’(n) such that 
rj{0) = Dri{0) — 0 is called a singularity. For any 
singularity ?/, we define the codimension of by 

codim r] — dimR(^/<df//r5xi>^(„)), 

where is the ideal of <f{n) gen- 

erated by dt]ldxi, ...,dr]/dx„ and x = (x,, 
denotes the coordinate system of R". 

The following result was proved by J. N. 
Mather [4]: Lfp to the addition of a nondegen- 
erate quadratic form in other variables, and up 
to multiplication by ±1, a singularity of codi- 
mension <4 and ^ 1 is right equivalent to one 
of the rj appearing in Thom’s list of the seven 
elementary catastrophes below. 

D. Unfoldings 

Let r] he a singularity. An r-unfolding of is a 
germ / e..#(n-|-r) such that /|R" x {0] = rj; this 
unfolding is denoted by (r, /). Let (r,/) and 
(s, g) be unfoldings of t], A morphism {(p, d>, S’) : 
{r, f)^{s, g) consists of (i) a germ (peS{n -I- r, 
n + s) such that ip | R" x {0} = identity, (ii) a 
germ O e S{r, s) such that n^o(p = (^on^, (hi) a 
germ £e,y#(r) such that f—go(p + F.on„ where 
7r,:R"xR’^^R'^ is the projection. In this case 
we say that the unfolding (r,/) is induced by 
((^, d>,e) from (s, y). A morphism is an isomor- 
phism if q> and d> are diffeomorphic germs. 

The addition of unfoldings (r,/) and {s,g) of 
r] is defined by (r,f) + {s,g) = {r + s,f + g — rj), 
where the last term on the right is given by 
if+g-ri)ix,u,v)= fix, u) + g{x, r) - tj(x). 

Thus if the constant unfolding (s, i/) of is 
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defined by rj{x, u) = tj(x), we have (r,/) -I- (s, g) = 
(r + s,f). 

An unfolding (r,f) of g is said to be versal if 
any unfolding of g is induced by (r, f) and a 
suitable morphism. A versal unfolding (r,f) 
with minimal r is said to be universal. The 
following facts have also been proved by 
Mather [4]; 

A singularity r/e..#(n) has a versal unfolding 
if and only if codim f/ is finite. Any two r- versal 
unfoldings of g are isomorphic. Every versal 
unfolding is isomorphic to (r,/)-|- constant, 
where r = codim and (r,f) is a universal 
unfolding defined as follows: If (hi, ...,/?,}<= 
J^{n) is a system of representatives for a basis 
of then the unfolding/ of 

g is defined by fix,u) — g{x) + bfx)ui + ...+ 

E. The Seven Elementary Catastrophes 

Let / and g be germs in S{n + r). We say that / 
and g are equivalent as r-unfoldings if there 
exist he^{r), a family of H^e^{n), where ue 
U ciR'', and an ee^{r) such that f{x,y) = 
g(H^{x),hiu)) + F.iu). 

We say that a static model (r,f) is stable if 
any small perturbation {r, g) of (r,/) in S(n + r) 
(with the Whitney C®-topology) is equivalent 
to (r,f). The following is the main result ob- 
served by Thom and proved by Mather and 
others (for references see the bibliography in 
[3]). Suppose that r^4. Then the set of stable 
static models (r,f) is an open dense subset of 
S(n + r), and up to the addition of a nondegen- 
erate quadratic form and multiplication by 
±1, any stable static model (r,f) is equivalent 
to one of the models with the standard poten- 
tials F, which are the universal unfoldings 
of singularities g in Table 1 (x,y denote inter- 
nal variables, and u, v, w, t denote external 
variables). 

The static models with these standard po- 
tentials are known as the elementary catas- 
trophes and can serve as qualitative models 
of various natural processes. 



Table 1. Thom’s List of the Seven Elementary Catastrophes 



r 


Singularity g 


Standard Potential F 


Name 


1 


x^ 


x^-1- ux 


Fold 


2 


x^ 


x"^+ ux^+ vx 


Cusp (Riemann-Hugo-Niot) 


3 


x^ 


x^-h UX^ + VX^ + WX 


Swallowtail (Dovetail) 


3 


x^+y^ 


-t- uxy + vx+wy 


Hyperbolic umbilic 


3 


x^ — xy^ 


x^ — xy^ + w(x^-t->’^) + vx + wy 


Elliptic umbilic 


4 




x^ + tx'* + ux^ + vx^ -1- wx 


Butterfly 


4 


x'^y+y'^ 


x^ y -h y'^ + ux^ + vy^ + wx + ty 


Parabolic umbilic 
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F. Catastrophe Sets 

A process for the static model / GS"{n + r) is a 
subset s of A X t/, where X and U are neigh- 
borhoods of the origin in R” and R*", respec- 
tively. If s is a process and ueU,we define s„ — 
sn(A' X {u}). We say that wet/ is a regular 
point for s if there exists a neighborhood V of 
w in (7 and a homeomorphism h:Xx V-*X x 
V such that n^oh = n^ on X x V and h{sf] 

X F)) = s„ X K A catastrophe point is a non- 
regular point in U. The set of all catastrophe 
points is called the catastrophe set. 

There are several conventions with regard to 
the definition of a process for the unfolding 
(r,/). One of these is the Maxwell convention, 
which requires that s„ be a least local regime at 
usU. The catastrophe set of the model (r,/) 
according to the Maxwell convention consists 
of those points u of the control space R'' where 
the least minimum of /„ is attained for at least 
two points or where this minimum is attained 
at a unique point but is not stable. 

Another is the perfect delay convention, 
which assigns to each path t in 17 a mapping 
(possibly discontinuous) m^:r^X x z such that 
— {m^{u),u) is a local regime, and 

remains continuous for a maximum interval 
on the path z. Consider the set 

A = { (x, u) e R” x R*” I f{x, u) is degenerate} 

and its image B = \{A), called the bifurcation 
set, under the projection ti^. R" x R'■^R^ Then 
the points of the bifurcation set B are impor- 
tant candidates for catastrophe points of the 
static model with respect to the perfect delay 
convention. 

For geometrical studies of the elementary 
catastrophes ^ [6-8]. The static models have 
been generalized to metabolic models, the 
structure of which largely remains an open 
question [2]. In this connection, the bibli- 
ographies of [3,9, 10] appear to be fairly 
complete. 
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Categories and Functors 

A. Categories 

Consider the family of all tgroups. Given two 
groups X and Y, denote the set of all thomo- 
morphisms of A to 7 by Hom(A, 7). If A, 7, 
and Z are groups and if f:X^Y and g:Y^ 

Z are homomorphisms, we can compose 
them to get a homomorphism gof-.X^Z. 

In general, suppose that we are given, as in 
this example, (1) a family iOl of mathematical 
objects, and (2) for every pair (A, 7) of objects 
in iOl, a set Hom(A, 7) whose elements are 
called morphisms; and let / e Horn (A, 7) 
and 06Hom(7, Z) determine a morphism 
go f eHom(A,Z), which is called their com- 
posite. A morphism /eHom(A, 7) is also 
written f:X^Y. Suppose further that these 
morphisms satisfy the following axioms: (1) if 
f : A-»7, g: Y-*Z, and h\Z^W are mor- 
phisms, then {hog)of —ho{gof); (2) for 
each object Ae9JZ there exists a morphism 
l;j^:A->A such that for any f:X^Y and 
g:Z^X we have f olx~f and 1 ^ 0 ,^ = ^; 

(3) Hom(A, 7) and Hom(A', 7') are disjoint 
unless A = A' and Y— 7'. Then we call the 
whole system (i.e., the family of objects 2R, the 
morphisms, and the composition of mor- 
phisms) a category. The elements in 9R are 
called the objects of the category. 

By axioms (1) and (2), the set Hom(A, A) is 
a semigroup (with respect to the composition 
of morphisms) which has l;j. as the identity 
element. Hence is determined uniquely by 
A. On the other hand, axiom (3) implies that 
a morphism / determines the objects A and 
7 such that / eHom(A, 7). From these facts 
we can give an alternative definition of cate- 
gory using only the morphisms and their 
composition. 

The totality of the objects (or morphisms) in 
a category ^ is denoted by Ob(^) (or Fl('^); 
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the notation FI comes from the French word 
fleche). The relation xeOb(‘^) is often ab- 
breviated to XG*^, while Hom(2f, F) is written 
Hom.j(2f, F) if necessary. A subcategory of a 
category is a category with Obf'^') <= 
ObF^), such that for X, Ye^' we have 
Hom.^.fA', F)c=Hom,^(A', F) and the compo- 
sition in is the restriction of ^ to If 
Hom,^.(A, Fl^Hom^^fA, F) for all X, Ye^', 
we say that is a full subcategory of 

We define the product category x <^^2 of 
two categories in the canonical way, using the 
pairs of objects and the pairs of morphisms. 

B. Examples of Categories 

(1) Taking all sets as the objects, all mappings 
as the morphisms, and the composition of 
mappings as the composition, we obtain a 
category called the category of sets, denoted by 
(Sets) (or (Ens) from the French ensemble). For 
the empty set 0 we make the convention that 
Hom(0, F) contains just one element for any 
F and that Hom(F, 0) is empty if4^/0. 

(2) As we have seen, taking all groups as the 
objects and the homomorphisms as the mor- 
phisms, we get the category of groups, written 
(Gr). If we limit the objects to ^Abelian groups, 
we get the category of Abelian groups (Ab) as a 
full subcategory of (Gr). 

(3) Fix a ring R. The left /^-modules and 
their R -linear mappings define the category of 
left R-modules, which we denote by The 
category of right R-modules, is defined 
similarly. When R is ^unitary, we usually 
limit the objects of and to t unitary 
modules. If R is commutative we can identify 

with When R — Z (the ring of ra- 

tional integers), can be identified with 
(Ab). When R is a field, is also called the 
category of linear spaces over R. 

(4) Taking rings as objects and homomor- 
phisms of rings as morphisms, we obtain the 
category of rings. The subcategory consisting 
of unitary commutative rings and unitary 
homomorphisms is called the category of 
commutative rings and is denoted by (Rings). 

(5) If we take ^differentiable manifolds as 
objects and differentiable mappings as mor- 
phisms, we obtain the category of differentiable 
manifolds. Similarly, for tanalytic manifolds 
and analytic mappings we obtain the category 
of analytic manifolds. 

(6) Taking topological spaces as objects and 
continuous mappings as morphisms, we get a 
category called the category of topological 
spaces and denoted by (Top). On the other 
hand, if we take the thomotopy classes of 
continuous mappings as morphisms and 
define their composition in the natural way, 
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we obtain another category, which is called the 
homotopy category of topological spaces. 

(7) Fix a ^preordered set 1. Taking the ele- 
ments of / as the objects and the pairs (x, y) of 
elements of / with x^y as the (unique) mor- 
phism from X to y, we get a category, in which 
we define the composite of the morphisms 
(x, y) and (y, z) to be (x, z). 

In examples (1) through (6), the totality of 
the objects Ob('(^) is not a set, but a tdass 
(^ 381 Sets G; for the logical foundation of 
category theory ^ [3,9]). 

C. Diagrams 

If a set of arrows { A^} and a set of points 
are given in such a way that each arrow has 
a unique initial point and a unique endpoint, 
then we say that {A^, Bp} is a diagram. (Usu- 
ally, we consider the case where each point Bp 
is the initial point or the endpoint of at least 
one A^ (Fig. 1).) Let ^ be a category and 
{A^,Bp} a diagram. If we associate a mor- 
phism /j in with each arrow A^ and an 
object ZpE^t^ with each point Bp so that 
/j,GHom(Zp,Zy) whenever A^ has the initial 
point Bp and the endpoint By, then we say that 
{ /^, Zp] is a diagram in the category ^ (Fig. 2). 
Suppose, furthermore, that the following con- 
dition is satisfied: For any pair of points Bp 
and By, and for any sequence of adjacent 
arrows A^^,..., starting at Bp and ending 

at By (i.e., the initial point of A^^ is Bp, the 
endpoint of A^. is the initial point of and 
the endpoint of A^^ is By), the composite 

o ... (eHom(6^, By)) depends 

only on Bp and By. Then the diagram in is 
said to be a commutative diagram. For exam- 
ple, commutativity of Fig. 2 is equivalent to 
J3 —/a ~f5- 




Fig. 1 Fig. 2 

D. Miscellaneous Definitions 

A morphism /: A ^ F in a category ^ is called 
an isomorphism (or equivalence) if there exists a 
morphism g: F->A such that f og=\y,gof — 
\x- In this case, g is determined uniquely by 
/ and is itself an isomorphism. We call g the 
inverse morphism of f Then the inverse of g 
is f. An isomorphism is sometimes written 
f:X^Y.Tv/o objects X and F are said to be 
isomorphic if there is an isomorphism X^Y, 
and then we write X=Y. The composite of 
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isomorphisms is again an isomorphism. In 
particular, an isomorphism X^X is an inver- 
tible element of the semigroup Hom(2f, X), 
and is called an automorphism of X. Examples 
of isomorphisms are bijections in (Sets), group 
isomorphisms in (Gr), the ^-isomorphisms in 
ring isomorphisms in the category of 
rings, tdilTeomorphisms in the category of 
differentiable manifolds, and homeomorphisms 
in (Top). 

A morphism /: A E is called a monomor- 
phism (or injection) if for any object Z and for 
any morphisms u, v:Z~*-X{u^v) we have 
fou^fov. Dually, f:X-*Y is called an epi- 
morphism (or surjection) if for any u,v:Y-^ 
Z{uj^v) we have uo/^t;o/In the category 
of sets the monomorphisms and the epimor- 
phisms coincide, respectively, with the injec- 
tions and the surjections as mappings (^381 
Sets). A monomorphism which is at the same 
time an epimorphism is called a bijection. An 
isomorphism is always a bijection, but the 
converse is false in some categories. 

Two monomorphisms /j and 

fi ■ 2^2^ A (with the same A) are said to 
be equivalent if there exist ; Ai->A2 and 
g 2 -.X 2 ^Xi such that /i=/2 0^1 and/2 = 

/, 0 02 (Fig. 3). An equivalence class with 
respect to this equivalence relation is called a 
subobject of A. Similarly, we define a quotient 
object of A as an equivalence class of 
epimorphisms A ^ A,. 




Fig. 3 

An object c of a category ^ is called a 
final object of ^ if for every object Y of 
Hom(T, c) contains one and only one element. 
Dually, an object e' is called an initial object 
(or cofinal object) if Hom(e', K) contains one 
and only one element for every Ye^.U e^ and 
ej are final objects, then there is a unique iso- 
morphism C] ^^2, and similarly for initial 
objects. A set with only one element is the 
final object in (Sets), and a space with only 
one point is the final object in (Top). In the 
category (Gr) (resp. (Ab)), the trivial group {1} 
(or {0}) is the final object and the initial object 
at the same time. In the category of commuta- 
tive rings, the zero ring {0} is the final object 
and the ring of rational integers Z is the initial 
object. 

E. Product and Coproduct 

Let A, and A2 be objects of a category We 
say that a triple (F, Pi , P 2 ) consisting of an 



object F and morphisms p, :F-^A, (i= 1,2) is 
the product (or direct product) of A, and A2 if 
for any pair of morphisms /• : A ^ A,- (i — 1 , 2), 
there exists a unique morphism /: A-+F with 
PiOf = fi (' = L2) (Fig. 4). If(F',p'i,p2) is an- 
other product of Aj and A2, then by virtue of 
this definition there is a unique morphism 
/:F-^F' such that p'^o f=Pi (i = l,2), and / is 
an isomorphism. The product is unique in this 
sense. The product (or any one of the prod- 
ucts) of Aj and A2 is denoted by A, x A2 or 

hyX,nX2. 




Fig. 4 

The product in the categories of sets, of 
groups, of rings, and of topological spaces 
coincides with the notion of '•'direct product in 
the respective systems. In a general category, 
the product does not always exist. Suppose the 
product A X A exists for an object A; then 
there is a unique morphism A^^: A^A x A such 
that 1 jy = p 1 o A;^- = P2 o which is called the 
diagonal morphism of A. Let/: A^Al (1 = 1,2) 
be morphisms and assume that the products 
(A, x A2,P|,P2X (2f'i X A'2 ,p'j,P2) exist. Then 
there is a unique morphism /: A^ x A2-+ 

X\ X X '2 satisfying p\o f =/op,- (i= 1,2). This 
/ is denoted by /i x/2. On the other hand, if 
g^'.X ^Xj{i= 1,2) are given, the unique mor- 
phism 0; A->Aj X Aj with p,O0 = 0,- (i= 1, 2) is 
denoted by (01,02)- We have (0i,02) = (0i x 
02) o Ajf if A X A exists. 

The dual notion of product is coproduct. 

We say that a triple (SJ 1 J 2 ) of an object S and 
morphisms /: A,“>S (1= 1,2) is the coproduct 
(or direct sum) of Aj and A2 if for any mor- 
phisms /■: A,-»A (i= 1,2) there exists a unique 
morphism /:S-> A with /o/-=/ (i= 1,2) 

(Fig. 5). The coproduct, like the product, is 
uniquely determined up to canonical iso- 
morphisms. It is denoted by Aj -f A2 or by 
Aj U Aj. The coproduct in (Gr) is the tfree 
product. In (Ab), or more generally in the 
product of two objects can be identified with 
the coproduct ( = direct sum) (^ 277 Modules 
F). The coproduct in the category of com- 
mutative rings is the •^tensor product over Z. 




Product and coproduct can also be defined 
for a family {A,},g; of objects. Namely, the 
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product of is an object P together with 

a family of morphisms Pi -P^X^ (iel) having 
the property that for any family of morphisms 
fi'.X-yXf (iel), there exists a unique morphism 
/ :X^P such that PiOf—fi (iel). The product 
is unique up to canonical isomorphisms, and 
similarly for the coproduct Sections F, L). 

F. Dual Category 

In the theory of categories we often encounter 
the dual treatment of notions and propo* 
sitions. To be precise, we may define the no- 
tion of the dual category of a category ^ as 
follows: The objects of are those of i.e., 
Oh(^°) ~ Ob(^}; for any objects X and Y we 
put Hom,,„(A:, T) = Horn,, ( T, X)- iff:X^Y 
and g: Y^Z in (i.e.,/ : Y^X and g\Z-^ 

Y in ^), then the composite go fin is de- 
fined to be f og in It is clear that then 
satisfies the axioms of a category. Quite gener- 
ally, given a proposition concerning objects 
and morphisms we can construct another 
proposition by reversing the directions of the 
morphisms, and we call the latter the dual 
proposition of the former. The dual propo- 
sition of a proposition in coincides with a 
proposition in For instance, a monomor- 
phism (epimorphism) in ^ is an epimorphism 
(monomorphism) in and the final (initial) 
object in is the initial (final) object in 
The product (coproduct) in ^ is the coproduct 
(product) in Although the notion of the 
dual category is defined quite formally, it is 
useful in describing relations between specific 
categories. The dual category of (Ab), for 
instance, is equivalent to the category of 
commutative compact topological groups 
(tPontryagin’s duality theorem). 

G. Categories over an Object 

Fix a category and an object Se^. A pair 
(X,f) of an object Xg^^ and a morphism / :X 
-^S is called an object over S or an S-object, 
and / is called its structure morphism. We 
often omit / and simply say “an 5-object X” if 
there is no danger of misunderstanding. If 
(X,/) and { Y,g) are 5-objects, a morphism 
h'.X^Y such that f = goh is called an 5- 
morphism from (X,f) to (Y,g). The category 
whose objects are the 5-objects and whose 
morphisms are the 5-morphisms is called the 
category of S-objects in and is denoted by 
'^75. It has (5, Is) as the final object. The prod- 
uct of two S-objects X and Y, taken in '^/S, is 
called the fiber product of X and Y over S (in 
^), and is denoted by X Xj T or Alls Y. The 
dual notion of the fiber product is called fiber 
sum (or amalgamated sum). Thus for two 
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morphisms f:S-*X and g:S-^Y, the fiber 
product of X and Y over 5 in ^°/S is the fiber 
sum of X and Y (with respect to 5); it is de- 
noted by X U, Y. 

Let ^ be the category of commutative rings 
and Ke'^. Then the family of A-objects in 
is precisely the family of commutative K- 
algebras. The fiber product A Xf^B in i.e., 

A B in is the tensor product A B of 
algebras. 

H. Functors 

Let ^ and be categories. A covariant functor 
F from ‘i? to is a rule which associates (1) 
with each object A in an object F(A) in 
and (2) with each morphism / : A ^ T in 
a morphism f(/):f(A)^f (T) such that 
F(g of)^F(g)o F(f), F(\^) = i A con- 
travariant functor is defined dually, by mod- 
ifying this definition to F(f):FiY)-fF{X), 
F(gof) = F{f)oF(g). Thus a contra variant 
functor from 'if to is the same as a covariant 
functor from the dual category to (or 
from to Functor is a general term for 
both covariant functors and contravariant 
functors, but some authors use the word ex- 
clusively in the sense of a covariant functor. 

A functor in several variables is defined to be 
a functor from the product category of the 
categories in which the variables take their 
values. 

A CO variant functor F:^~*^' is said to be 
faithful (fully faithful) if for any A, Ye^, the 
mapping Hom(A, T)-»Hom(F(A), F(T)) in- 
duced by F is injective (bijective), and simi- 
larly for contravariant functors. A faithful 
covariant functor F:<^-^<^' which maps dis- 
tinct objects of ^ to distinct objects of is 
called an embedding, and in this case can 
be identified with a subcategory of by F. 

A fully faithful covariant functor F\^-^^' is 
called an equivalence (between the categories) if 
it satisfies the condition that for any object A' 
of there exists an object A of such that 
F(A) = A'. In this case we can consider the 
two categories essentially the same. A con- 
travariant functor from ^ to which defines 
an equivalence from to is called an 
antiequivalence. 

I. Examples of Functors 

(1) Let be the category of groups (or rings). 
For any A let F(A) be the underlying set of 
A (i.e., the set obtained from A by “forgetting” 
its structure as a group or ring), and for any 
homomorphism / put F(f)—f Then we get 
a faithful covariant functor (often called the 
forgetful functor) F\^£ ^(Sets). 
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(2) Let be any category and fix an object 

X of Then we get a co variant functor hx'^ 
^{Sets) as follows: With each we associ- 
ate the set Hom(X, Y), and with each mor- 
phism /: Y->- Y' in the mapping / o (where 

/ o:Hom(X, Y)-+Hom(2f, Y') is defined by 

if o){g)~f og). Similarly we define a con- 
travariant functor ^(Sets) by h^{Y) = 
Hom(Y,A:) and h^if)^ of. 

(3) Let p -.A-i’Bhe a homomorphism of 
rings. With each left ^-module M associate 
the tscalar extension p*{M) = B M, and 
with each ^-homomorphism / associate the 
B-homomorphism p*{f) = 1b ® / Then we get 
a CO variant functor p * : 

(4) Let K be a ring. With each left B-module 

M associate its dual module M* = Hom^(M, 
R), and to each B-linear mapping / associate 
its tdual mapping f —of. Then we get a con- 
travariant functor and similarly for 

.M ^ . 

(5) For each differentiable manifold X let 

f (X) denote the commutative ring of the dif- 
ferentiable functions on X, and for each dif- 
ferentiable mapping Y let F{f) be the 

ring homomorphism of :F(Y)^F{X). Then F 
is a faithful contravariant functor. 

(6) Fix an Abelian group A. By associating 
with each topological space X the cohomology 
group H{X, A) and with each continuous map- 
ping f -.X^Y the homomorphism H{ Y, A)^ 
H{X, A) induced by f we obtain a contrava- 
riant functor from (Top) to (Ab). 

(7) Fix a topological space X, and let T{X) 
be the set of the open sets in X. Then T{X) is 
ordered by inclusion, so it is a category (^ 
Section B, no. 7). The contravariant functors 
from T{X) to (Ab) are precisely the tpre- 
sheaves of Abelian groups over X. We can use 
any category instead of (Ab) to define a pre- 
sheaf over X (-" 383 Sheaves). 



J. Natural Transformations 



Let ^ and be categories, and denote by 
Hom('^,'^') the collection of all covariant 
functors Let F, A nat- 

ural transformation (or functorial morphism) 
from F to G is a function which assigns to 
each object X of a morphism (p{X):F{X)->^ 
G(X) in such that for any morphism f:X~* 
Yin the equation G{f)o(p{X) = (p{Y)oF{f) 
holds; in other words, the accompanying dia- 
gram is commutative: 



A 

d 

Y 



-p(A') 

F{X) G{X) 






F{Y) 



<p(y) 



G(Y) 



A natural transformation between contrava- 
riant functors is defined similarly. For in- 
stance, let A and B be Abelian groups and let 
H'(- , A) and Hf -,B)he the contravariant 
functors of '•'cohomology viewed as functors 
(Top)->(Sets). Then the natural transform- 
ations between them are the tcohomology 
operations. 

Let (p:F-*G he a natural transformation, 
and suppose that <p(A): F(A)^G(A) is an 
isomorphism for every Xe^. Then the inverse 
transformation G^F of cp exists, and (p is 
called a natural equivalence (functorial isomor- 
phism or isomorphism) and is written cp-.F^yG. 

Suppose that Ob(^) is a set. Then the collec- 
tion Hom(F, G) of all natural transformations 
F->G is also a set, and hence we can consider 
Hom('^, ^') a category in which the objects are 
the covariant functors the morphisms 

are the natural transformations, and the com- 
position of morphisms is the natural one. Then 
Hom('^°,'^') is the category of contravariant 
functors from to In particular, the cate- 
gory Hom('^°,(Sets)) is sometimes denoted 
by 

Given a category a covariant (resp. con- 
travariant) functor F:'^->(Sets), and an object 
Xe^,we can define a canonical bijection 
Hom(/ij-, F)^F(A) (resp. Hom(/i'^, F)o:y 
F(X)) by (The functors hx 

and were defined in Section I.) The inverse 
mapping of assigns to ^eF(X) the natural 
transformation cp'.hx^F defined by <p{Y)u = 
F{u)^{Ye^). In particular, if we take F = 
hy(h^X we obtain a canonical bijection 
Uomihx, hy)o^Hom{Y, X) (Hom(/i^ 

Hom(A, Y)). It follows that there is a fully 
faithful contravariant (covariant) functor 
Horn (<^, (Sets)) ^ Hom(<i^ °, (Sets)) = ^) 

which associates hx(h^) with X e^. 



K. Adjoint Functors 

Let F’.'i ^ and F':'^'->'^ be co variant 
functors. Suppose that there is a rule which 
assigns to each pair of objects and 

a bijective mapping Ojv, A^ :Hom,^,(M, 
F'(M'))^Hom,^.(F(M), M') such that for any 
pair of morphisms in ^ and 

in the following diagram induced by the 
morphisms is commutative: 

Home(M, F'(M')) Home-(F(M ), M') 

i i 

Homg(A,F'(A')) Home-(F(A), A') 

Then we say that F is a left adjoint functor of 
F' and that F' is a right adjoint functor of F. 
We can regard Horn, ^.(F(M), M') as a functor 
from X (contravariant in the variable 




207 



52 M 

Categories and Functors 



Me^, CO variant in the variable M'e^') to the 
category of sets, and similarly for Hom.^(M, 
F'(M')). This commutativity of the diagram 
means that these two functors are isomorphic 
Section J). 

For instance, let A and B be rings and L a 
fixed B-/l-tbimodule. Let and 

F ' ; be the functors defined by 

F{M) = L®^M, F'(M') = HomB(L,M'), 

where the assignment for morphisms is defined 
in the natural way. Then F is the left adjoint of 
F' and F' is the right adjoint of F. In parti- 
cular, let p: /I ^5 be a homomorphism and 
consider the case L = B. Then F is the functor 
and F' is the functor 

A-yM, so that p* is the left adjoint of p^ (and 
p^ is the right adjoint of p*) 111 Modules, 

K, L; for more examples of adjoint functors — ^ 

[ 11 ]). 



L. Representation of Functors 

We begin by discussing an example. Let T be 
a set, and consider the following problem: Is 
it possible to find a group X and a mapping 
i :T->^X such that, for any group Y and for 
any mapping rp.T^Y, there exists a unique 
homomorphism u:X-*Y with uo^ = rjl The 
answer is yes; it is enough to take the Tree 
group X generated by T and the canonical 
injection ^:T^X (Fig. 6). On the other hand, 
let F{Y) be the set of all mappings T-^Y, and 
for each group homomorphism / : Y' 

define the mapping F{f):F{Y)^F{Y') by 
P{f )^=f ('?eF(Y)). Then we get a co- 

variant functor from the category ^ of groups 
to the category of sets, f :^->(Sets). We can 
now reformulate the condition on Xe'i^ and 
^ e F{X) as follows: 

For any Ye^£ and for any t]eF{Y), there 
exists a unique morphism u ; X -> Y such that 



Y 

Fig. 6 

Proceeding to the general case, let be an 
arbitrary category and let F ;'^->(Sets) be a 
functor. If there exist an object X of ^ and an 
element ^ of F(X) satisfying the condition just 
stated (with the modification u: Y-*X in the 
contravariant case), then we say that the pair 
(X, represents the functor F, or less specifi- 
cally, that X represents F, and we call ^ the 
canonical element of F(X). We also say that F 
is representable. The condition stated above is 
a formulation of the so-called universal map- 
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ping property. If (X', ^') also represents F, the 
unique morphism u:X^X' (or X'^X) with 
— is necessarily an isomorphism. 

When (X, represents F, the natural trans- 
formation (h^^F in the contrava- 

riant case) which corresponds to ^ by the 
canonical bijection d)^:Hom(fi;f,F):::::tF(X) is 
an isomorphism. Conversely, if there is a func- 
torial isomorphism (p:hx~^F (or h^-^F) for 
some Xe^, then the object X represents F, 
with the canonical element of F(X) the element 
which corresponds to (p by the canonical bijec- 
tion i.e., ^ = (p{X)\x- 

We have already seen the example of a free 
group; here we list a few more examples. (1) 

Let {X,.},.g; be a family of objects in a category 

For each Ye'^ we put F(Y) = n,-e/Hom( Y, 
X;), and for each morphism /: Y^ Y' we define 
the mapping F{f)-.F{Y')^F{Y) by F(/)(/.) = 
(fo f). Then we get a contravariant functor 
F:‘^^(Sets). A pair (X, i^) which represents 
F (where ^ e F(X) = II, e; Hom(X, XJ) is the 
product of {X;}. Thus representability of F is 
equivalent to the existence of the product of 
{Xj, and similarly for the coproduct. 

(2) Let R be a ring, M a right R-module, and 

N a left R-module. For each Abelian group Y 
let F(Y) denote the set of the R-balanced map- 
pings M X Y (^ 277 Modules J). Since a 
homomorphism f'.Y^Y' induces a natural 
mapping F(/);F(Y)->F(Y') by composition, 
we obtain a covariant functor F:(Ab)->(Sets). 
This functor is representable: the pair consist- 
ing of the tensor product M 0^ N and the 
canonical mapping M x 0 rX repre- 

sent it. 

(3) Let R be a commutative ring and S a 
subset of R. For each commutative ring Y, let 
F( Y) denote the set of homomorphisms R-*Y 
that map the elements of S to invertible ele- 
ments of Y. As in the preceding example, we 
obtain a covariant functor F: (Rings) (Sets). 
This functor is represented by the tring of 
quotients S ^R and the canonical homomor- 
phism R->5”'R. 



M. Groups in a Category 

Let be a category with a final object e, and 
assume that a finite product always exists in 
If an object Ge'if and morphisms cr.Gx.G-* 

G, )S:G-»G, e:e->G are given such that the 
the diagrams of Fig. 7 are commutative, then 
(G,a, /?,e) is called a group in (group object in 
or '^-group). 

If ^ is the category of sets, then a defines a 
law of composition in the set G, and the image 
of e by e is the identity element and j9(x) is the 
inverse of x, so that G is an ordinary group. If 
is the category of topological spaces (analy- 
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tic manifolds, algebraic varieties, tschemes) 
then G is a '•'topological group (•'Lie group, 
talgebraic group, '•'group scheme). 

We can also define the 'i^-group by lifting 
the group concept in (Sets) to the category 
by means of the functor h^. Namely, let G be 
an object of and suppose that for each 
the set /i^(y) = Hom(y, G) is equipped with a 
group structure and that for each morphism 
/ : Y' the induced mapping h^{Y')^h^{Y) 

is a group homomorphism. In other words, 
suppose that is a contravariant functor 
from ^ to the category of groups. Then the 
object G with the additional structure on is 
called a '^-group. This definition is equivalent 
to the one given above. 



GXGXG — GXG 

l,Xo| 

GXG ; — -G 





Fig. 7 



N. Additive Categories 

A category ^ is called an additive category if 
for each pair X, Ye^, the set of morphisms 
Hom(2f, 7) has the structure of an additive 
group such that (1) the composition of mor- 
phisms is distributive in both ways; ho{f +g) 

= hof +hog,{f +g)oh — f oh + goh-,{2) 
there exists an object 0' with Hom(0',0') = {0}; 
(3) the product (or the coproduct) of any two 
objects exists. Then the object 0' in (2) is a final 
and initial object, and is called the zero object. 
Both the product and the coproduct of any 
two objects exist and can be identified. The 
dual category of an additive category is also 
an additive category. A functor F from an 
additive category to another is called an addi- 
tive functor if F(f +g) = F(f) + F{g) holds 
for morphisms. In an additive category 
Hom(A, T) is an additive functor from to 
(Ab) in each variable. 

For any ring R, the category of left (or right) 
/^-modules is an additive category. The follow- 
ing definitions are generalizations of the corre- 
sponding concepts in the theory of modules. 
The kernel of a morphism /: A ->B is a pair 
consisting of an object A' and a monomor- 
phism i:A'^A with f oi — 0, such that any 
morphism u:X^A with /o u = 0 is divisible 
by i (that is, w = for some v:X-^A'). Dual- 
ly, the cokernel of / is a pair consisting of 
an object B' and an epimorphism 
with _;o/=0 which divides any morphism 
u:B-^X with uof =0. We write A' = Ker /, B' 



= Coker /The kernel of / B ^ Coker / is 
called the image of / and is denoted by Im/; 
the cokernel of i:Kerf-*A is called the 
coimage of / and is denoted by Coim f If all 
these exist, it follows from the definitions that 
there is a unique morphism Coim / ->Im / 
such that the composite of A->Coim/ ^Im / 
~*B is equala to / 

An additive category is called an Abelian 
category if it satisfies the following conditions: 

(1) every morphism has a kernel and a coker- 
nel, (2) for every morphism / the morphism 
Coim / ^Im/ just mentioned is an isomor- 
phism. The dual category of an Abelian cate- 
gory is also Abelian. The categories of Abel- 
ian groups, of K-modules, and of sheaves 
of C-modules on a tringed space {X, G) are 
important examples of Abelian categories. 
Many propositions which are valaid in (Ab) 
remain valid in any Abelian category. In parti- 
cular, the notion of an +exact sequence is de- 
fined in an Abelian category in the same way 
as in (Ab), and the fiber product and fiber sum 
of a finite number of objects always exist in an 
Abelian category. A functor between Abelian 
categories which carries exact sequences into 
exact sequences is called an exact functor; 

(such a functor is automatically additive). If 'if 
is a category of which Ob('^) is a set and if 
is an Abelian category, then Hom('^,'^') is an 
Abelian category. Given an Abelian category 
'if and a subcategory which satisfies certain 
conditions, one can construct an Abelian 
category which is called the quotient 
category (Serre’s theory of classes of Abelian 
groups; — [8]). 

If is an Abelian category of which Ob('if) 
is a set, there is an embedding of into the 
category of modules over some ring R by 
a fully faithful flat exact covariant functor (full 
embedding theorem, B. Mitchell, Amer. J. 
Math., 86 (1964)). This remarkable theorem 
enables us to extend results obtained for 
modules to the case of Abelian categories. 

The notions of category and functor were 
introduced in [7] and were applied first in 
topology and then in homological algebra and 
algebraic geometry (^ 200 Homological 
Algebra). 
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Cauchy, Augustin Louis 

The French mathematician Augustin Louis 
Cauchy (August 21, 1789-May 25, 1857) 
graduated from the Ecole Polytechnique in 
1807 and from the Ecole des Ponts et Chaus- 
sees in 1810, to become a civil engineer. In 
1816, his mathematical works were recognized, 
and he was appointed a member of the Aca- 
demie des Sciences while a professor at the 
Ecole Polytechnique. After the July revolution 
in 1 830, he refused to pledge loyalty to Louis- 
Philippe and fled to Turin; he later moved to 
Prague. He returned to France after the revo- 
lution of 1848 and became a professor at the 
LFniversity of Paris, where he remained until 
his death. He was a Catholic and a Royalist all 
his life. 

His scientific contributions were numer- 
ous and covered many fields. In algebra, he 
did pioneering work in determinants and in 
the theory of tgroups. He also made notable 
achievements in theoretical physics, optics, 
and the theory of elasticity. His main field 
was analysis. He was interested in making 
analysis rigorous by giving calculus a solid 
foundation in such works as Cours d’analyse 
de I’Ecole Polytechnique (1821). In his paper 
“Memoire sur les integrates definies prises 
entre les limites imaginaires” (1825), he proved 
the main theorem of the theory of functions of 
a complex variable. Another important work 
is his proof of the existence theorem for the 
solutions of differential equations in the cases 
of real and complex variables 
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Cayley Algebras 

Let Q be a tquaternion algebra over a field K 
of characteristic zero. A general Cayley algebra 
^ is a 2-dimensional Q-tmodule Q + Qe with 
the multiplication (<? -t- re) (s -I- te) = (qs + ylr) + 
{tq + rs)e, where q, r, s,teQ, y is a given ele- 
ment in K, and T, s are the ^conjugate quater- 
nions of t, s, respectively. The elements of 
are called Cayley numbers; is a nonassocia- 
tive, talternative algebra of dimension 8 over 
K (^ 231 Jordan Algebras). The map a = 
q + re-^d = q~re is an ^antiautomorphism 
of Define two maps by N [a) = aa — aa 

(norm of a) and T{a) — a-\-a (trace of a). Then 
every a in satisfies the equation — T(a)x + 
N{a) = 0. Furthermore, N{ab) = N{a)N{b) for 
a, b in The tquadratic form N{x) — T{xx)/2 
characterizes In particular, any two (non- 
associative) general Cayley algebras over the 
same field K which are not talternative fields 
are isomorphic. 

In order for ^ to be an alternative field, 
either of the following two conditions is neces- 
sary and sufficient; (i) N{a) — 0 implies a — 0; 

(ii) Q is a noncommutative division algebra 
and y cannot be expressed in the form — 

{a, f/,(eK). (For the mean- 

ing of fx with respect to Q 29 Associative 
Algebras D.) Every alternative field of finite 
dimension is a general Cayley algebra. 

In particular, when Q is the tquaternion 
field over the real number field with k — ix — 

— 1, the general Cayley algebra over Q with 
y — — 1 is called the Cayley algebra. When K 
is an talgebraic number field of finite degree, 
there are only a finite number of nonisomor- 
phic general Cayley algebras over K. 

The Lie algebra of all tderivations of a 
general Cayley algebra is a ^simple Lie 
algebra of type G 2 .It K is the real number 
field, the identity component of the group of 
all ^automorphisms of the Cayley algebra ^ is 
a compact simply connected ^^simple Lie group 
of type G 2 . The Cayley algebra is the unique 
alternative field over the real number field K. 
This last fact is important because of the fol- 
lowing proposition: In the theory of ^non- 
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Desarguesian projective planes, the field which 
gives rise to the coordinates is an alternative 
field. Let 2 P 2 be the set of all 3 x 3 tHermitian 
matrices A over the Cayley algebra such 
that trv4 = 1, A^ = A. Then we can define a 
structure of a projective plane on fiPj, which 
with this structure is called the Cayley projec- 
tive plane. Furthermore, let 3 be the set of all 
3x3 Hermitian matrices over with a multi- 
plication in 3 defined by A - A -\- 
AB). The identity component G of the group 
of all automorphisms of 3 is a compact simply 
connected simple Lie group of type F 4 . This 
group G acts on ifFj transitively, and 2 P 2 = 
G/Spin(9) (-^ 249 Lie Groups; Appendix A, 
Table 5.III). 
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Celestial Mechanics 

A. General Remarks 

The motions of planets, comets, the moon, and 
satellites in our solar system are the main 
topics in celestial mechanics (^ 392 Spherical 
Astronomy). However, studies in this subject 
can also include motions of fixed and binary 
stars in our galaxy, equilibrium figures of 
celestial bodies, and rotational motions of the 
earth and the moon. 

Celestial mechanics is usually based on 
tNewtonian mechanics, with the effects of 
tgeneral relativity sometimes taken into ac- 
count to determine corrections to the orbits of 
celestial bodies. Therefore the main task of 
celestial mechanics is to solve differential 
equations of motion based on Newtonian 
mechanics. However, since the equations for 
the problem of n > 2 bodies cannot be solved 
rigorously (-^ 420 Three-Body Problem), 
appropriate methods have been devised where- 
by we may obtain approximate solutions of 
the equations with accuracy comparable to 
that of observations. 

The two-body problem, which concerns the 
behavior of two celestial bodies regarded as 



points exerting mutual interactions, can be 
reduced to a one-body problem with reference 
to a central force, since integrals of motion of 
the center of gravity for the system exist. The 
tHamilton-Jacobi equation for the one-body 
problem is of ^separable type and can be 
solved completely. The orbit for the two-body 
problem is a tconic with one of its Toci at the 
center of gravity. The majority of celestial 
bodies in the solar system actually perform 
elliptic motions. tKepler’s orbital elements for 
elliptic motion are functions of the integration 
constants in the solution of the Hamilton- 
Jacobi equation and are determined by the 
initial conditions. 



B. Perturbations 

In studying the tn-body problem, we first solve 
certain two-body problems and then apply the 
method of ^perturbations, i.e., the method of 
variation of constants, in order to obtain solu- 
tions developed as tpower series of small 
parameters. The parameters are ratios of the 
masses of planets to that of the sun for plane- 
tary motions and the ratio of the geocentric 
lunar distance to the solar distance for lunar 
motion. 

Electronic computers have made it possible 
to compute planetary coordinates for long 
intervals of time by solving numerically dif- 
ferential equations of motion including all 
possible interactions. However, in discuss- 
ing the stability of the solar system, analytic 
methods are more effective, particularly the 
method of obtaining secular perturbations by 
eliminating short-periodic terms by canonical 
transformations. This is one of the averaging 
methods of solving differential equations. 
However, as the solutions obtained by means 
of perturbation methods are not always con- 
vergent, most important problems related 
to the stability of motion have not yet been 
solved rigorously. Secular perturbations for 
planetary motions can be derived by solving 
differential equations that are linearized by 
neglecting cubic powers of orbital eccentric- 
ities and inclinations to the ecliptic (^ 309 
Orbit Determination), which are small quan- 
tities. The teigenvalues of these linear differen- 
tial equations correspond to mean angular 
velocities of the perihelion and the ascending 
node. The equations for the eigenvalues are 
called secular equations. 

C. Artiflcial Satellites 

To discuss motions of artificial satellites close 
to the earth, the latter cannot be regarded as a 
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point or as a sphere but must be assumed to 
be an oblate spheroid, i.e., an tellipsoid of 
revolution. The effects of oblateness on the 
motion of satellites can be derived as pertur- 
bations of the theoretical elliptic motions ob- 
tained as the solutions of this two-body prob- 
lem under the assumption that the earth is 
spherical. Also, by utilizing a special potential 
very close to the geopotential, we can find a 
Hamilton-Jacobi equation of separable type 
which is solvable. This special potential ap- 
pears in the problem of two fixed centers with 
equal masses situated on an imaginary axis. 
When the geopotential is assumed to be axi- 
ally symmetric, the equations of motion for the 
satellites have two degrees of freedom; there- 
fore there appear two fundamental frequencies 
related to the special potential. When these 
two frequencies are equal, the problem is 
called a critical inclination problem and is 
important from the mathematical point of 
view. Theories for satellites can be applied to 
the motions of fixed stars in the galaxy. 

D. Equilibrium Figures 

There is a large literature concerning equilib- 
rium figures and the stabilities of celestial 
bodies assumed to consist of spinning fluids. 
The two-body problem with tidal interactions 
is particularly important; problems concerning 
the evolution of the earth-moon system are 
special cases of such a problem. 

The theory of rotation of the earth as it is 
affected by tprecession, tnutation, and latitude 
variations is also a part of celestial mechanics; 
for this theory, elastic theory and geophysics 
are applied. 

For the n-body problem ^ 420 Three-Body 
Problem. 
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Characteristic Classes 

A, General Remarks 

The theory of characteristic classes arose from 
the problem of whether or not there exists a 
ttangent r-frame field on a tdifferentiable 
manifold (E. Stiefel [5]) (-*' 105 Differentiable 
Manifolds). The importance of the character- 
istic class as a fundamental invariant of '^vec- 
tor bundle structure is now fully recognized 
114 Differential Topology, 147 Fiber 
Bundles, 427 Topology of Lie Groups and 
Homogeneous Spaces). 

B. Stiefel- Whitney Classes 

Let (^ = (£, B, R") be an n-dimensional real 
^vector bundle (called an R"-bundle) with 
tparacompact Hausdorff tbase space B, fiber 
R”, and torthogonal group 0{n) as the tstruc- 
ture group. Then the element w,((^) of the i- 
dimensional tcohomology group //'(B; Z 2 ) 

(i = 1, 2, . . . , n) of the base space B with coeffi- 
cient in Z 2 = Z/2Z, called the i-dimensional (or 
ith) Stiefel- Whitney class, and the element w((^) 
== 1 + vvi ((^) -f- . . . -t- w„((^) of the tcohomology 
ring //*(B; Z 2 ), called the total Stiefel- Whitney 
class, are defined as follows. First, we deal with 
the case n = 1. We call the infinite-dimensional 
real projective space the tinductive limit P”(R) 
= limP"(R) of the finite-dimensional real tpro- 
jective space P"(R). The nontrivial tfine bundle 
)>i over the infinite-dimensional real projective 
space P'^(R) is a universal R^bundle (tuniver- 
sal bundle for the orthogonal group 0(1)); 
therefore any R^ -bundle t^ = (£, B, R') is equiv- 
alent to an tinduced bundle from the universal 
bundle by a tclassifying mapping 
P-(R):^^/^* We define the 1 -dimensional 
universal Stiefel- Whitney class w,(y,) to be 
the generator of //*(P®(R); Z 2 ) and set w,(<^) = 
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/^*Wj(y,). For general n, we consider the prin- 
cipal 0(n)-bundle (P, B, 0{n)) tassociated with 
the given R"-bundle ^^{E, B, R"). Let Q„ be 
the subgroup of 0(n) consisting of all diag- 
onal matrices. Then the orbit space Y ~ P/Q„ 
is the base space of the ^'principal Q„-bundle 
Let p: Y-yB^ P/0(n) be the 
natural projection. Then the R"-bundle 
over Y induced by p is associated with rj and 
is equivalent to the ^Whitney sum of n line 
bundles (F. Hirzebruch [4]), = ©...© 

4. Moreover, the homomorphism p*:H*{B; 
Z 2 )-*H*{Y,Z 2 ) is injective (A. Borel [1]). 
Therefore we can uniquely define the total 
Stiefel-Whitney class w(<^) of the R"-bundle by 
the relation p*w((^) = w(^i)... w(<^„). The Stiefel- 
Whitney classes defined above are compatible 
with bundle mappings /:vv(/*<f)=/*w{^) (-- 
147 Fiber Bundles M). The Stiefel-Whitney 
class w,(y„)efP(B(,(„);Z2) (1 ^i^n) of the 
universal R"-bundle y„ over the tclassifying 
space is called the i-dimensional univer- 
sal Stiefel-Whitney class. For the Whitney sum 
of vector bundles, we have w{^ © r]) = w{^)w{t}). 

In order for an R"-bundle ^ to be torient- 
able, namely, for ^ to have an SOjuj-structure, 
it is necessary and sufficient that Wi(^) = 

0. For an oriented R"-bundle the Euler- 
Poincare class X„{^) is defined to be the tpri- 
mary obstruction X„{^)eH''{B-,Z) for con- 
structing a ■''cross section of the ''associated 
(«— l)-sphere bundle. In particular, the Euler- 
Poincare class of the universal bundle for 
SO(n) is called the universal Euler-Poincare 
class. X„{Q mod 2 is equal to w„((^). If n is odd, 
we have 2A„((^) = 0. 

C. Chern Classes 

We consider an n-dimensional complex vector 
bundle (o — {E,B, C") (called C"-bundle in the 
following) with a paracompact Hausdorff base 
space B, fiber C", and unitary group U {n) as 
the structure group. The cohomology class 
Ciiw) eH^‘(B; Z) (i = 1 , 2, . . . , n), called the 21- 
dimensional (or ith) Chern class, and the total 
Chern class c(cy)^ 1 -f c,(ct))-|- ... -|-c„(co)e 
H*{B,Z) are defined as follows. In the case 
n = 1 , let C"^ = lim C" (the inductive limit of 
the ■'complex Euclidean spaces C"), S'® be 
its unit sphere, and P*(C) be the infinite- 
dimensional complex projective space consist- 
ing of all complex lines through the orgin 0 of 
C®. Then the natural mapping S®-»-P®(C) 
defines a universal principal C(l)-bundle 
(S®, P®(C), 1/(1)). Let 7, be its associated 
universal C^bundle. Then we define the 1- 
dimensional universal Chern class Ci(y])e 
H^(P®(C); Z) to be the cohomology class that 
takes the value —1 on the cycle S^(:^P*(C)c 



P®(C)) with the natural orientation. Since a 
general C^bundle (^ = (£,B, C’) is induced 
from by a classifying mapping ffB^ 

P“(C), we set Cii ^)= When n> 1, let 
{P,B, U{n)) be the principal C(n)-bundle asso- 
ciated with the given C”-bundle ^ = (£, B, C”). 
Let T„ be the subgroup of U(n) consisting of 
all diagonal matrices (which is a ^maximal 
torus of U («)). Then the quotient space Y — 
PjT„ is the base space of the principal T„- 
bundle i 7 = (P,F/T„, T„). Let p: Y-yB = P/U{n) 
be the natural projection. Then the C"-bundle 
p*^ over Y is associated with rj and is equiva- 
lent to the twhitney sum of n complex line 
bundles: p* ^ ^ 1 © . . . © i^„. Moreover, 
p*:H*{B; Z)^H*{Y;Z) is a monomorphism 
(Borel [1], Hirzebruch [4]). Therefore we can 
uniquely define the total Chern class c{^) of the 
C"-bundle ^ by the relation p*c(i^) = c(^i) ... 
c((^„). The Chern classes, as defined above, 
are compatible with bundle mappings / (i.e., 
c{ f*^)=f*c{^)) (- 147 Fiber Bundles N). 

The Chern class Ci(y„)G//^‘(B[;(„);Z) (1 ^/^n) 
of the universal C"-bundle y„ over B^J^„^ is 
called the 2?-dimensional universal Chern class. 
Let <^, r] be complex vector bundles over B. 
Then we have c{^ @rj) = c{Q c{rj). By the nat- 
ural inclusion U(n)<= SO(2n), we can identify 
a C”-bundle a> with an oriented R^"-bundle 
cog. Then we have c,(m)mod2 = W2,(mR), 

W2i+\ (tt»R) = 0 (i - 0, 1 , ■ • • , «), c„(o)) - A'2 „(o;r). 

Examples. Let (S^”"^^P"(C), 1/(1)) be the tHopf 
bundle, and let y" be the associated complex 
line bundle. Then the classifying mapping of y" 
is the natural inclusion P"(C)->BC(1) = P“(C), 
and c{y1)— 1 —g„, where p„gH^(P"(C); Z) is the 
cohomology class dual to the homology class 
represented by the hyperplane P" '(C). On the 
other hand, for the complex line bundle = 
(P"“'(C)} determined by the ''divisor (-^ 72 
Complex Manifolds) P”“'(C)c;P"(C), we have 
c{^") = \ +g„, and y" are dual to each other. 

is called the canonical line bundle over the 
complex projective space. Moreover, the Whit- 
ney sum T © Si of the tangent bundle t(P"(C)) 
of the n-dimensional complex projective space 
P"(C) and the trivial C -bundle Sj is equivalent 
to the Whitney sum of (n+ 1) copies of 
Therefore we have c(t(P"(C)))=:c((^i © <^^ © 

•••©^r) = (l-f0j"*. 

D. Pontryagin Classes 

Utilizing the inclusion mapping 0(n)c= U (n) of 
Structure groups, we can associate a C"-bundle 
<^c = <^ © with an R"-bundle We 

define the 4j-dimensional (or ith) Pontryagin 
class of the R"-bundle ^ by p,(^) = ( — l)‘c2,(i^c) 
(e H^‘(B; Z)) (i = 1, 2, . . . , [n/2]) (Hirzebruch 
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[4]). In particular, the Pontryagin classes of 
the universal bundle for 0(n) are called the 
universal Pontryagin classes. We also define 
the total Pontryagin classes p(^)= 1 +Pi(<^) + 
••• + P[n/2](0- We have 2 c 2 .-+i(^c) = 0- For R"- 
bundles and t], p{^ 0 ri) = p{^)p{r]) modulo 2 - 
torsion elements. We have (mod2) = 

and for the oriented 

R^"-bundle <^. Moreover, for a complex vector 
bundle oj, we have (Wu [ 8 ]) 

2k 
i = 0 

where we put Cq(q})=\. 

All such classes as defined in Sections B-D 
are called characteristic classes. 



E. Other Definitions of Characteristic Classes 

Axiomatic Definition. (1) For a C”-bundle ^ 
over a paracompact Hausdorff base space B, 
Chern classes (i^O) are de- 

fined, and we have Cq (0 = 1 , Cid) -0{i> n). ( 2 ) 
For the total Chern class we 

have c(f*i^)— f*c(<^) for each bundle mapping 
f. (3) For the Whitney sum, we have c(^ © 17 ) = 
c(^) ■ c(tj). (4) Normalization condition; For 
the canonical line bundle we have 
+ g„ (-^ Section C). We can verify the existence 
and uniqueness of c,((^) satisfying these four 
conditions, so Chern classes can be defined 
axiomatically by these conditions (Hirzebruch 
[4]). We can similarly defined Stiefel-Whitney 
classes axiomatically. 

Definition in Terms of Obstruction Classes. 

When the base space is a tCW complex, we 
can define the Chern class of a C"-bundle ^ 
over B as follows; Let F„,„_,+i(C)= t/(n)//„_,+i 
X l/fq — 1) be the tcomplex Stiefel manifold of 
all orthonormal (« — q-j- l)-frames in C" with 
Hermitian metric. Then K„ „_^+,(C) is ^{2q-2)- 
connected, and its ( 2 ^ — l)-dimensional 
thomotopy group 7 T 2 <,-i(F„,„ ,+i(C)) = Z. Let 
be the tassociated bundle of ^ with fiber 
F„.„-,+i(C). Then the tprimary obstruction 
(e//^’(B;Z)) to constructing a Across section of 
coincides with the Chern class Anal- 
ogously, we can interpret w^{^] for an R"- 
bundle ^ as an obstruction class 147 Fiber 
Bundles). 

Definition in Terms of Schubert Varieties. 

We denote by C*" the subspace defined by 
Zfc+i =z ^+2 = ...=z„+^ = 0 of the space C"+'^ = 
{(zi, ...,z„+^)|ZieC,i= 1 , ...,n-t-N}, and fix 
the sequence of subspaces C^ <= C^ <^ .. . <=C"^^. 
The set of all complex n-planes X through 
the origin 0 in C"'^''' forms the ^complex 
Grassmann manifold M„+jy „(C). We denote 
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by E{y^) the set of all pairs [X, v), where 
XGM„+iy JC) and u is a vector in X. Then 
we can define a 2N-universal complex n- 
dimensional vector bundle y„^ with base space 
K +iv.n(^X with total space E(y^) and projec- 
tion (A,i;)^A. Let « = (&»(!), ...,co(n)) be a 
sequence of integers satisfying the condition 
O^co(l):^ ...^co{n)^N. Then the set of all 
n-planes X cz C”^^ through the origin O satis- 
fying dim(A n = i, dim(A fl = 

i— 1 , 1 = 1, 2, ...,n, forms a real (22"=, co(i))- 
dimensional open tcell. The set of all these 
open cells gives a ^cellular subdivision of 
Af„+iv,„(C) as a CW complex. The closure 
of is a cellular subcomplex of „(C), 
called a Schubert variety. This is a ^pseudo- 
manifold with canonical orientation and 
represents a (2E?=i co(i))-dimensional inte- 
gral cycle, called a Schubert cycle. We denote 
^<0 t>y (cu(l), ...,o)(n)). All these homology 
classes form the basis of the homology group 
The cocycle dual to the cycle 
(0, . . . , 0 1, . . . , 1 ) represents the Chern class 
n-q 

c,(y„^)€7/2'*(M„+^,„(C);Z). For the real Grass- 
mann manifold „(R), we can analogously 
define the universal Stiefel-Whitney classes. 

Thom’s Definition. Let ^ be an R"-bundle over 
B, B^ be its tThom space, and UgH''{B^; Zj) 
be the tfundamental class of B^. Let j.B-^B^ 
be the inclusion induced from the zero cross 
section and <p://'‘(B;Z 2 )^//'‘‘^"(B^;Z 2 ) be the 
tThom-Gysin isomorphism. Then we have j* U 
= w„(c), <p~^{Sq^U) = Wii^) (O^i^n), where 
Sq‘ is the ‘’’Steenrod square (R. Thom [6]). 

Definition in Terms of Differential Forms. 

Let B be a tdifferentiable manifold and ^ — 
{P^,B, U{n)) be a differentiable principal U{n)- 
bundle over B. Let Q = (Q^^) be the tcurvature 
form corresponding to the tconnection form 
^ i, 7 = 1, ■ . . , n, on P^. Then is a 
complex- valued 2-form, and D,-, = — Qy,-. For 
the matrix Q, we consider the differential form 

</^ = X'A. = det|/ + (27ry^)-iQ|, 

Q 

where / is the unit matrix, the multiplication 
in the determinant is the texterior product, 
and ij/q is the part of degree 2q in i/r. Then ij/ is 
defined as a real form independent of the con- 
nection CO. We have (hp^ = 0, and the coho- 
mology class of { — in //^®{B;R) is the 
Chern class with real coefficients (Borel 
and Hirzebruch [2], S. S. Chern [3]). 

Definition in Terms of Symmetric Polynomials. 

427 Topology of Lie Groups and Homoge- 
neous Spaces.) 
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F. Characteristic Classes of Manifolds 

For a differentiable (complex or almost com- 
plex) manifold M, the characteristic classes of 
its tangent bundle are called characteristic 
classes of the manifold M. We shall denote 
Stiefel-Whitney classes, Pontryagin classes, 
Euler-Poincare classes, and Chern classes of M 
by vv;(M), Pi{M), and c^{M\ respec- 

tively. These are invariants of differentiable 
structures, orientations, or (almost) complex 
structures of a manifold M if M is a differ- 
entiable, oriented differential, or (almost) com- 
plex manifold. By the Stiefel-Whitney num- 
bers of an n-dimensional manifold M, we 
mean the values of n-dimensional monomials 
of Stiefel-Whitney classes of M on the fun- 
damental homology class {{w^{My^W 2 {MY^ ... 
w„(My") [M] G Zj , where -f 2r2 + . . . + nr„ = 
n, Tj ^ 0. We can define integer-valued Pon- 
tryagin numbers and Chern numbers similarly. 
These numbers are called generally character- 
istic numbers of the given manifold. In partic- 
ular, X„{M) [M] = ;c(M) is the tEuler-Poincare 
characteristic. 

In the case of topological manifolds, we 
can define characteristic classes in the follow- 
ing sense. Let M be a closed n-dimensional 
topological manifold and X" the genera- 
tor of H”{M; Z2). By defining 2C(y"~‘) = 
(X'y”-‘)[M]GZ2 for X'g//‘(M;Z 2), Y^-'e 
//""'(M; Z2), we have an isomorphism 
/T(M;Z2)^Hom(/f"-‘(M,Z2);Z2). The ele- 
ment UigH‘{M; Z2), corresponding to the 
homomorphism 7" under this 

isomorphism is called the Wu class of M, 
where Sq‘ is the tSteenrod square. Moreover, 
we call Wj = 'Zj^oSq^~‘UieH^(M; Z2) the Stiefel- 
Whitney class of the topological manifold M. 
Then for any tdifferentiable structure we 
have Wj(M,^) = Wj. Therefore Stiefel-Whitney 
classes of differentiable manifolds are topolog- 
ical invariants (more precisely homotopy type 
invariants) (Thom [6], W. T. Wu [8]). J. W. 
Milnor [9] proved that Pontryagin classes of 
differentiable manifolds are not topological 
invariants. The image of P;(M) by the homo- 
morphism Z)^H^‘(M; Q) induced by 

the inclusion Z <= Q (the rational number field) 
is called the rational Pontryagin class. In 1966, 
S. P. Novikov [10] proved the topological 
invariance of the rational Pontryagin class 
(^ Section H). 

G. Index Theorem for Differentiable 
Manifolds 

Let M be an oriented closed manifold of di- 
mension 4k. Putting f(x,y) = x- y [M] for 
elements x, y of the 2/c-dimensional real coho- 



mology group R), we obtain a bilinear 

form on f/^'‘(M; R). The tsignature of the 
quadratic form /(x, x) (namely, (number of 
positive terms) — (number of negative terms) in 
its canonical form) is a topological invariant 
(homotopy type invariant) of the manifold M. 
We call it the index or the signature of the 
manifold M and denote it by t(M). If the di- 
mension of M is not divisible by 4, we define 
z{M) — 0. For the product of manifolds we 
have t{M x N) — t(M) ■ x{N). Also r(M) is an 
invariant of the tcobordism class of M (Thom 
[ 6 ]). 

The index t of a differentiable manifold 
gives a homomorphism of the tcobordism ring 
Q into the ring Z of integers. Hirzebruch inves- 
tigated the multiplicative property of t and 
gave its expression by means of Pontryagin 
numbers. Let P; be the ith telementary sym- 
metric function of indeterminates , . . . , 

Then a homogeneous part of the formal power 
series 

n 

tanh.,y^ 

of , . . . , is a symmetric polynomial of 

and therefore a polynomial of P; 
with rational coefficients. For we de- 
note the homogeneous part of degree k by 
Lfc(Pi, . .. , Pfc). Specifically, if Pj are the Pontrya- 
gin classes p;(M'^*‘) of a 4/c-dimensional closed 
differentiable manifold then Lt(Pi, , P^) 
is a 4k-dimensional cohomology class of 
Then we have the formula 

called the index theorem of differentiable mani- 
folds (or Hirzebruch index theorem). For exam- 
ple, Li =(l/3)Pi, L2 = (1/45)(7P2 -Pi^), and 
L3 = (1/945) (62P3 - 1 3P2 Pi + 2Pf ), ...(-^114 
Differential Topology). Later this index theo- 
rem was generalized to the tAtiyah-Singer 
index theorem (-^ 237 K-Theory). 

H, Combinatorial Pontryagin Classes 

Let K be an oriented n-dimensional compact 
thomology manifold, and let Z' be the bound- 
ary of an oriented (r-t- l)-simplex, namely, the 
combinatorial r-sphere. Let be a 

tpiecewise linear mapping. Then for almost all 
points y of 2'"“'^', /“^(y) is an oriented 4i- 
dimensional compact homology manifold, and 
its index r(/“Hy)) is independent of y. We 
denote this by x(/). The t(/) is an invariant of 
the homotopy class of f. Let <r be the funda- 
mental class of ^"“‘^'(2'"“'^'; Z). Then for 8i 
+ 2, there exists a unique cohomology class 
I- — l-{K)e H‘^'{K;Q) such that for any piece- 
wise linear mapping / : K ->2"““^', we have 
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(/;-/*ff)[X] = t(/). We can remove the restric- 
tion n ^ 8i 4- 2 if we take KxZ"' for K and 
define /j(K) to be /,-(K x Z’™) for sufficiently 
large m. If K is a tC^-triangulation of a dif- 
ferentiable manifold M, li{K) coincides with 
the class L,(Pi , . . . , P,) defined by Hirzebruch 
(R. Thom [7]; V. Rokhlin and A. Shvarts), 
where Pj — Pj{M) is the Pontryagin class of 
M. Since the variable P, can be expressed as 
a polynomial with rational coefficients of 
j ^ i, we define the combina- 
torial Pontryagin class p,(X) of a homology 
manifold K as the polynomial of lj{K) with 
rational coefficients. Therefore, if X is a C^- 
triangulation of a differentiable manifold M, 
we have p;(X) = p;(M) (modulo torsion ele- 
ments). The class li(K) and consequently p,(X) 
are important combinatorial invariants of X. 
These classes are topological invariants [10]. 
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A. Mathematics in the Chao, Han, and Tang 
Dynasties (3rd Century B.c.-10th Century a.d.) 

In ancient China, the art of divination, called 
yi, was used in government administration. 
This was a kind of calculation that used pieces 
called tse. The book embodying it, called the 
I-Ching, is still popularly used. It shows that 
“numbers” or mathematics was seriously uti- 
lized in China at that time. The multiplica- 
tion table for numbers up to nine (called the 
Pythagorean table in the West) was known in 
China from the legendary period. However, 
mathematics in the Greek sense, that is, math- 
ematics as a logically systematized science, was 
unknown in ancient China. 

Suanching-Shihshu, or the Ten Books on 
Arithmetic — namely, Choupi-Suanching, 
Chiuchang-Suanshu, Haitao-Suanching (edited 
by Liu Hui), Suntzu-Suanching, Wutsao- 
Suanching^ Hsiahouyang-Suanching, 

Chang chiu-Suanching, Wuching-Suanshu, 
Chiku-Suanching (edited by Wang Hsiao- 
Tong), and Shushu-Chiyi (edited by Hsu Yue) 
— came into being between the 2nd century 

B. c. and the 6th century A.D., from the Chao 
to the Han eras, with the exception of the 
Chiku-Suanching compiled in the Tang era. 
These are the only mathematical texts from 
this early period whose authors and times of 
publication are known. They were used in the 
civil service examination for selecting adminis- 
trators up to the beginning of the Sung era 
(960 A.D.). The most important among them is 
Chinchang-Suanshu, or the Book of Arithmetic, 
which contains nine chapters. It treats positive 
and negative fractions with laws of operations 
on signed numbers, equations, and the elemen- 
tary mathematical knowledge of daily life. The 
Chiku-Suanching contains a number of prob- 
lems reducible to equations of the 3rd and 

4th degrees. 

There were also two works called San Tung 
Shu (edited by Tong Chuan) and Chui Shu 
(edited by Tsu Chung-Chih), but no copies of 
them are extant. Later works, one from the Sui 
era (published in 636) and another from the 
Tang era, tell us that the latter contained the 
result 3.1415927>Ti>3.1415926 and the ap- 
proximate values 355/113 and 22/7 for k. 

In the 1st century a.d. Buddhism was intro- 
duced from India, and paper was invented. 




57 B 

Chinese Mathematics 



216 



However, despite the communication with 
India, neither the Indian numeration system, 
written calculation, nor the abacus was at that 
time widely used in China. The extraction of 
square or cube roots was done with calculat- 
ing rods. 

B. Mathematics in the Sung and Yuan 
Dynasties (10th-14th Centuries) 

In the Sung and Yuan periods contact was 
made with the Arab world. In the 13th cen- 
tury, a mechanical algebra utilizing calculating 
rods made remarkable progress; this can be 
attributed to Arab influence. Toward the 
end of the Sung era appeared the Shushu- 
Chiuchang by Ch’in Chiu-Shao and the Yiku- 
Yentan by Li Chih. The former gives a method 
like Horner’s for approximate solution of 
equations, and the latter gives the principle of 
tienyuan-shu, i.e. the mechanical algebra of 
this period. The principle of tienyuan-shu was 
further expounded in the Suanhsueh Chimeng 
(1295) and the Suyuan Yuchien (1303) by Shih 
Shih-Chieh, the Yanghui Suanfa by Yang Hui, 
and other works. These were introduced into 
Japan and they influenced the wasan (Japanese 
mathematics) of early times. Until recently, no 
further original mathematical ideas appeared 
in China. 

C. Mathematics after the Ming Era (15th 
Century) 

In this epoch, European renaissance civiliza- 
tion began to influence the Orient. In 1607, 
Matteo Ricci (1552-1610) translated Books I- 
IV of Euclid’s Elements into Chinese with the 
aid of Hsu Kuang-Chi. In 1592 Suanfa Tang- 
tsung by Ch’eng Ta-Wei appeared, which 
dealt with the use of the abacus. This book 
had great influence upon wasan. 

No development was seen in the indigenous 
mathematics of the Ching era, that is, after 
the 17th century, but science and technology 
were imported by Christian missionaries. This 
brought about calendar reform from the lunar 
to the solar method. On the other hand, new 
editions of classical works such as the Ten 
Books on Arithmetic began to appear in this 
period. Emperor Kang Hsi-Ti (1655-1722), 
who was in correspondence with Leibniz, 
asked Ferdinand Verbiest (renamed Nan Huai 
Jen in Chinese) to compile Shuli-Chingwen 
(Advanced mathematics), whose 53 chapters 
were completed in 1723. This book dealt with 
European-style algebra and trigonometry. In 
the latter half of the 19th century, Alexander 
Wylie translated a number of Western mathe- 



matical books into Chinese, including Books 
VII-XIII of Euclid’s Elements and some works 
on calculus. Many current Chinese and Japa- 
nese mathematical terms originated with this 
translation. 
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C°°-Functions and Quasi- 
Analytlc Functions 

A. General Remarks 

An example of a tC®-function is a Yeal ana- 
lytic function, which is defined to be a function 
that can be expressed as a power series that 
converges in a neighborhood of each point 
of the domain where the function is defined. 
Many examples, however, show that real 
analytic functions form a rather small subset 
of the C®-functions. Sometimes C“-functions 
(not real analytic functions) play essential roles 
in the development of theories of analysis (^ 
105 Differentiable Manifolds S). On the other 
hand, there is a subfamily of C*-functions 
having some remarkable properties in com- 
mon with the family of real analytic func- 
tions. This family is called the family of quasi- 
analytic functions. It has been an important 
object of study since the beginning of the 20th 
century. On the other hand, the Gevrey class 
functions, which are no longer quasi-analytic 
in general, also constitute an important subset 
of C®-functions. The first part of this article 
deals with C°®-functions, the second part with 
quasi-analytic functions, and the third part 
with Gevrey class functions. 

B. C®-Functions 

Let Q be an open set of the n-dimensional real 
Euclidean space R". A real-valued function 
/(xj , . . . , x„) defined on Q is called a function 
of class C“ on Q (or C”-function on Q) if 
/(xj, . . . ,x„) is continuously differentiable up to 
any order. The totality of C“-functions defined 
on D is denoted by C“(Q). It is an ^associative 
algebra over the real number field R. A contin- 
uous function / defined on some closed set F 
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of R" is called a C“-function on F if there exist 
an open neighborhood U of F and a function 
geC^iU) such that f —g\F. This definition is 
equivalent to the following (H. Whitney [6]): 
For any multi-index a=={a,, we can find 

a continuous function /*(x i , . . . , jcJ on F such 
that (i)/^(xi, ...,x„) = /(xj, ••.,x„) and (ii) for 
any positive integer r and for every multi- 
index a with IfxKr, 



lim 

y^x 



1 

||y-x|r-l*l 



/"W 



- Z 

|a-b^j<r 



i^-yf 



= 0 , 



where II || denotes the Euclidean norm of R". 



C. Local Theory of C® -Functions 

We shall now introduce an equivalence rela- 
tion ~ in C“(R"), defined as follows: 
of\U — g\U for some open neighborhood 
U of the origin. Let S‘„ denote the quotient set 
C®(R")/~, which naturally inherits the struc- 
ture of an associative algebra from C®(R"). An 
element of is called a germ of a C®-function 
at the origin. We denote the germ of /eC®(R") 
by / To we assign the formal Taylor 

expansion Z(D“/(0)/a!)x® around 0, where 
£)*/ means (5''''^ ■^''''/)/(5'''Xi ...d'''’X„) for a — 

(r j , . . . , r„). This assignment induces a homo- 
morphism T from to the tring of formal 
power series R[[xi, ...,x„]] of n variables. 

The homomorphism r is surjective but not 
injective. Put A„ = i“'(0)c:^„. A function / 
whose germ f belongs to A„ is called a flat 
function. The function (p{x) defined by (p{x) — 
exp( — 1/x^) when x^^^O and ^(0) = 0 is an 
example of a flat function on Rf A close study 
of the relationship between and R[[Xi, 
...,x„]] leads to the preparation theorem for 
C”-functions, which can be stated as follows: 
Let F(xi, ...,x„)eS’„ satisfy F(0, ...,0, x„) = 
ige<^i,gi0)^ 0). Then any fe S„ can 
be expressed as / = FQ + R, where Q€S‘„ and 
R = with (B. 

Malgrange [3]). 

Let /(xi, ...,x„) be a symmetric function in 
(x 1 , . . . , x„) of class C® . Then there exists a 
germ geS‘„ such that/(xj, ...,x„) = 3 (cti, 

<T„), where ctj , . . . , cr„ denote elementary sym- 
metric functions with respect to Xi , . . . , x„ (G. 
Glaeser, Malgrange). Let/etfi satisfy /(x) = 

/{ — x). Then there exists a germ geS’i such 
that 7(x) = g{x^) (H. Whitney). 



D. Global Results 

Case of n Variables. C®(Q) becomes a ^Frechet 
space when it is endowed with the topology of 



uniform convergence on compact sets for all 
partial derivatives. Let J and be two closed 
tideals of C®(Q). Then we have J = J, if and 
only if T^(J) = T^(Ji) for each xef2, where is 
the mapping from C® (Q) to the ring of the 
formal power series that assigns the formal 
Taylor series of / around x (Whitney). 

Case of One Variable. In the case of one vari- 
able, further information can be obtained from 
various points of view. In the following, / 
denotes a C®-function defined on the unit 
interval / — [0, 1]. If / satisfies /(!)= l,/'(0) 

= . . . = 0, then we have 

1 1 1 

1 'r~~ F • • • H ~j=^ < 

where m; = sup{l/^'^'(x)l|x6/} and where k 
is some constant independent of the choice 
of f and r (E. Borel). Similar kinds of inequal- 
ities were obtained by A. N. Kolmogorov, 

A. Gorny, and H. Cartan. Let A be an arbi- 
trary countable set of real numbers. If for any 
xe/ we can find an integer r(x) such that 

then such a function / is neces- 
sarily a polynomial. The interval / can be 
divided into three disjoint subsets: S[^\ S^/\ 
and Sf \ These are characterized as follows: 
For xESj^* the formal Taylor series t^(/) of / 
around x converges to / in some neighbor- 
hood of X. For xeSf\ t,^(/) diverges. And for 
xeSf\ r^{f) converges in some neighbor- 
hood of X but does not tend to f Then S'/’ is 
an open set and S/’ is a G^-set, while S/’ is 
an F„-set of the tfirst category. Conversely, let 
/ = Si Sj + S3 be any partition of / into an 
open set Sj, a G^-set S2, and an F„-set S3 of the 
first category. Then there is some / e C® (/) 
withS; = S/’ (i=l,2, 3) [5]. 



E. Relations between C “-Functions and Real 
Analytic Functions 

Let C^{I) be the set of real analytic functions 
on /. Then C"(7) is a subalgebra of C®(/). 
Applying the above result in the case of S, = 
0, we find a function / e C®(/) that admits 
no real analytic function coinciding with / 
in a subinterval of /. Actually, functions with 
such a property are distributed densely in 
C®(/). A necessary and sufficient condition for 
a function / eC®(/) to belong to C“(/) is that 
for suitable constants A and k, l/'"’(x)l ^ 
Ak"n], xeI,n — 0, 1, 2, ...,be valid (Pring- 
sheim’s theorem). If /'"’(x)^0 for all xe / and 
n = 0, 1,2, ...,then/ eC“{/) (S. N. Bernshtem). 
For any open set n (c R"), the set C“(D) of 
real analytic functions on Q is dense in C®(fi) 
(polynomial approximation theorem). This re- 
sult is true even when the topology of C®(Q) 
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is replaced by a stronger one (Whitney [6] ). 
Let /e C “ (Q) and e C ® (f2). Then we can 
find ^6 C”(Q) satisfying f —g(p if and only if 
for any xeQ, ) is divisible by r^{(p) in the 
ring of formal power series (S. Lojasiewicz, 
Malgrange [3]). 



F. Quasi-Analytic Functions 

The investigation of quasi-analytic functions 
began with the attempt to obtain an intrinsic 
characterization of analytic functions. Borel 
defined monogenic functions as functions 
differentiable on their domains of definition, 
which can be any subset of the complex plane, 
not necessarily assumed to be open (-^ 198 
Holomorphic Functions Q). Similar to com- 
plex analytic functions, monogenic functions 
are uniquely determined by their values on 
any curve. While quasi-analyticity can be 
defined by such properties, it is customary to 
approach quasi-analytic functions from an- 
other aspect, that is, the behavior of higher 
derivatives of C”-functions. 

Generally, a subset B of C'®(7) is called a set 
of quasi-analytic functions if the mapping : 
B^R[[x]] defined in Section D is injective at 
each point xeI. The functions belonging to B 
are called quasi-analytic (with respect to B). 
Here, an important problem is to characterize 
a set of quasi-analytic functions B by specializ- 
ing the image of t^{B). 

Now let {M„} be a sequence of positive 
numbers. Let C(M„) be the subset of C*(/) 
consisting of / such that 

|/<"’(x)KAfc"M„, xel, n = 0,1,2,..., 

where A — A{f) and k = k{f) are constant. 

Then Pringsheim’s theorem simply asserts that 
C(n!) = C“(7). 

In 1912, J. Hadamard raised the problem of 
determining the condition that the sequence 
{M„} should satisfy so that C(M„) becomes a 
set of quasi-analytic functions [9]. A. Denjoy 
showed that if 

M„ — (nlog^ nlog^n ...log'’n)", 
where 

log* n==logn, log^n = log(logP“* n), 

p = 2,3,..., 

then C{M„) is a set of quasi-analytic functions 
[10]. Later he derived an improved result that 
the condition oo is sufficient. T. 

Carleman first gave a necessary and sufficient 
condition for C(M„) to be a set of quasi- 
analytic functions, and later A. Ostrowski and 
T. Bang gave another version of the same 
condition [11-13]. The condition states essen- 



tially the following: A necessary and sufficient 
condition for C(M„) to be a family of quasi- 
analytic functions in the interval (a, b) is given 
by either (i) 2^“* = -t-oo, where ^„ = inf*^„ 
(Carleman), or (ii) J“(log T{r})/(r^)dr— oo, 
where T(r) = sup„j, , {r^jMJ (Ostrowski, Bang). 
S. Mandelbrojt and T. Bang also gave another 
condition [13, 14]. (The simplest proof of these 
results is found in [13] or [15], where the 
proof follows Bang’s idea.) 

Related to the above theorem, we also have 
the following: Let |M„} be a sequence of posi- 
tive numbers with S(M„/M„+J< oo. For <x>0 
we can find / eC{M„) defined on (— oo, oo) such 
that /(O) > 0, /*"*( + a) = 0. Moreover, for 0 < 
a.<ji there exists / e C(M„) such that /(O) > 0, 
n = 0,l,2,...) [16]. 

Suppose that we are given an interval 7 and 
increasing sequences {v„} and {M„} of positive 
numbers. Then we have the problem of finding 
suitable conditions on {v„} and {M„} under 
which the mapping / ^{/‘''"’(xq)} gives an 
injective mapping from C(M„) to the sequences 
above. When {v„} and {M„} satisfy the above 
conditions, then a function belonging to C(M„) 
is called quasi-analytic (v„) in the generalized 
sense. The study of the inclusion relation be- 
tween two families C{M„) and C{M'„) also 
deserves attention. In [15] the relation be- 
tween C(M„) and C(n!) = C"(7) is discussed in 
detail. There are many open problems con- 
cerning the relationship between C(M„) and 
C(M'„) in general. 

Quasi-analytic functions are closely related 
to problems in various branches of analysis, in 
particular the theories of complex analytic 
functions, Fourier series, Fourier integrals, 
Dirichlet series, and asymptotic expansions 
[8,15,17]. 



G. Gevrey Class Functions 

This class of functions has its origin in the 
study of tparabolic equations. A C°° -function 
/(x) defined in a domain of R” is called of 
Gevrey class s (1 < oo) if for every eom- 

pact set K in that domain there exist positive 
constants and such that 

max|7)Y(x)|^AKCWar 

for all multi-indices a(|a| = ai-l-...-i-a„,a! = 
a 1 ! Xj ! • . . a„ !). A typical example is 

^ fexp(-l/x) for x>0, 

“jo for x^O, 

which is of Gevrey class 2. Let us denote the 
function space of a Gevrey class s by Evi- 
dently if s<s'. Let/(x)ey'** and 

F(y)ey‘*’; then xi— >f(/(x))ey‘**. In particular. 
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the sum and the product of two functions in 
belong also to Furthermore, the tim- 
plicit function theorem holds in this class. 
Unlike the class of analytic functions, we can 
use this class of functions of class s ( > 1) for a 
tpartition of unity, and several problems in 
partial differential equations can be treated 
within this function space. Gevrey class func- 
tions are discussed in a complete form in [20] 
(- also [19]). 
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Class Field Theory 

A. History 

The notion of a class field was first introduced 
by D. Hilbert (1898). Let k be an talgebraic 
number field and K a tGalois extension of k. 
Hilbert called such a field K a class field over k 
(or K/k was called a class field) when the fol- 
lowing property was satisfied: A tprime ideal 
p of /c of absolute degree 1 (i.e., a prime ideal 
whose tabsolute norm is a prime number) is 
decomposed in K as the product of prime 
ideals of K of absolute degree 1 if and only if 
p is a '^'principal ideal. (Such a field K is now 
said to be an absolute class field over k in 
order to distinguish it from a class field defined 
later more generally by T. Takagi as explained 
below.) Hilbert conjectured the following theo- 
rems (1) (4) together with the principal ideal 
theorem (— ^ Section D), and proved them in 
some special cases. (1) For any algebraic num- 
ber field k there exists one and only one class 
field K/k. (2) A class field K/k is an tAbelian 
extension whose tGalois group is isomorphic 
to the tideal class group of k. Hence the degree 
n = \_K:k'] is equal to the tclass number h of k. 
(3) The trelative different of a class field K/k is 
the principal order; thus K/k is an tunramified 
extension. (4) Let p be a prime ideal of k, and 
let / be the smallest positive integer such that 
p^ is a principal ideal. Then p is decomposed 
in the class field K//c as p = 'ip i ^2 • • • 

^Kiki%) = V^,f9 = n. 

Hilbert was led to these conjectures by the 
analog to the theory of talgebraic functions in 
one variable. Theorems (l)-(4) were proved by 
P. Furtwangler {Math. Ann., 63 (1907)), but 
these results were subsumed under the class 
field theory of Takagi, who generalized the 
notion of class field and proved that every 
Abelian extension of k is a class field over k (J. 
Coll. Sci. Imp. Univ. Tokyo, (9) 41 (1920)). Since 
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then, the arithmetic of Abelian extensions of k 
has developed through this theory. In Takagi’s 
paper, L. Kronecker’s problem concerning 
Abelian extensions of an imaginary quadratic 
field (— ^ 73 Complex Multiplication) was 
solved simultaneously; this had been an open 
problem since the 19th century. Later, E. Artin 
proved the general law of reciprocity {Abh. 
Math. Sem. Univ. Hamburg, 5 (1927)) which put 
class field theory into its complete form. The 
original proof by Takagi was rather com- 
plicated, and H. Hasse, Artin, J. Herbrand, C. 
Chevalley, and others tried to simplify it. In 
particular, Chevalley introduced the notion 
of tideles and gave a purely arithmetic proof. 
On the other hand, attempts are also being 
made to generalize this theory to non-Abelian 
extensions. We mention here the results of 
G. Shimura [15] and Y. Ihara [16]. Today, 
class field theory is considered one of the 
most beautiful theories in mathematics. 



B. Definition of a Class Field 

Let k be an algebraic number field. For the 
definition of a general class field over k, we 
need a generalization of the ideal class group 
of k 14 Algebraic Number Fields H). Let m 
be an tintegral divisor of k, and let 3(Ttt) be the 
multiplicative group of all fractional ideals of 
k which are Relatively prime to m. For the 
rest of this article, we mean by an ideal of k 
a fractional ideal of k. Denote by S{m) the 
Ray modulo m. Let H(m) be an tideal group 
modulo m, that is, a subgroup of 3(ttt) con- 
taining S(m). A Galois extension X of k is said 
to be a class field over k for the ideal group 
H{m) if the following property is satisfied: A 
prime ideal p of k of absolute degree I which is 
relatively prime to m is decomposed in K as 
the product of prime ideals of K of absolute 
degree 1 if and only if p belongs to H(m). The 
absolute class field of Hilbert is the case where 
m = (l) and H{m) is the group of all principal 
ideals of k. 

A class field K/k for an ideal group H is 
uniquely determined by H (uniqueness theo- 
rem). The tconductor f of // is said to be the 
conductor of the class field for H. The ideal 
group H corresponding to the class field K/k is 
determined by K as follows: if(f)/S(f) is the 
union of all cosets C of 3(f) modulo S(f) such 
that C contains a Relative norm of 

some ideal 91 of K which is relatively prime to 
f. In general, let K/k be a Galois extension and 
m be an integral divisor of k. Let H(m) be the 
union of all cosets C of 3(^Tt) modulo 5(m) such 
that C contains a relative norm ^^^^(91) of 
some ideal 91 of K which is relatively prime to 



m. Then H{m) is a multiplicative subgroup of 
3(m), and the index k = (3(m)://(m)) is not 
greater than the degree n = [K :k]. We have 
h = n if and only if K/k is the class field for 
H. Hence a class field K/k can be defined as 
a Galois extension of k such that h = n for a 
suitable integral divisor m of k. 



C. Fundamental Theorems in Class Field 
Theory 

(1) Main theorem: Any Abelian extension K/k 
is a class field over k for a suitable ideal group 
H. 

(2) Existence theorem: For any ideal group 
H{m) there exists one and only one class field 
for Him). 

(3) Composition theorem: Let K^ and K 2 be 
class fields for H^^ and H 2 , respectively. Then 
the composite field iCj K 2 is the class field over 
k for Hi n/l 2 . Consequently, K^:dK 2 if and 
only if Hi czHj. 

(4) Isomorphism theorem: The Galois group 
of a class field K/k for H(m) is isomorphic to 
3(m)/H(m). In particular, every class field 
K/k is an Abelian extension of k. 

(5) Decomposition theorem: Let f be the 
conductor of the class field for H. If p is a 
prime ideal of k relatively prime to f and / is 
the smallest positive integer with p^eH, then 
p is decomposed in /C/k as p == '']3i ^^2 • ■ ■ ^4^9 ^ 

(6) Conductor-ramiflcation theorem: Let f be 

the conductor of a class field K/k. Then f is not 
divisible by any prime divisor that is unrami- 
fied for K/k, and f is divisible by every prime 
divisor that ramifies for K/k. Let f = nfp, 
fp = p‘’. Then coincides with the tp-conduc- 
tor of K/k, and the exponent c can be explicitly 
expressed by the order of the tramification 
groups and the tramification numbers of p for 
K/k (^ 14 Algebraic Number Fields P). 

(7) Let p be a prime ideal of k that rami- 
fies for K/k. Let H^ be the ideal group of k 
such that (i) the conductor of Hp is relatively 
prime to p and (ii) Hp is the minimal ideal 
group of k containing H with property (i). 

Let n — \_K\k'], e~iH^:H), and p^eH^, where 
p‘'(d< f)$H^. Then p is decomposed in K/k as 
p = . ?!,)', N^4%) = p'.n = efg. 

(8) Translation theorem: Let K/k be the class 
field for an ideal group H(m), and let Q be an 
arbitrary finite extension of k. Then KQ./Q. is 
the class field for H*, where H* is the ideal 
group of Q consisting of all ideals b of with 
Njj/fc(b)eH(m). In particular, the conductor of 
KQ/Q is a divisor of the conductor of K/k. 

(9) Artin’s general law of reciprocity; Let K/k 
be the class field for an idea group H with the 
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conductor f. We denote the '’’Artin symbol of 
an ideal a of k that is relatively prime to f by 




Let a mapping <1) from 3(f) to the Galois 
group G of K/k be defined by <D(a) = (K/a) for 
a G 3(f)- Then <D induces the isomorphism 
3(f)/ff(f)^G. Namely, the isomorphism men- 
tioned in (4) is explicitly given by the Artin 
symbol. Also, the ideal group //(f) is character- 
ized as the set of all ideals a such that a g 3(f) 
and {K/a) = 1. From this theorem we can prove 
all the known laws of reciprocity for power- 
residue and norm-residue symbols (^14 
Algebraic Number Fields O, Q, R). 

From the general results of class field theory 
we can systematically derive all the known 
theorems concerning the arithmetic of qua- 
dratic fields, cyclotomic fields, and Kummer 
extensions. 

D. Principal Ideal Theorem 

Let K/k be an absolute class field. Then the 
extension of any ideal of fc to /C is a principal 
ideal of K. This theorem is called the principal 
ideal theorem. It was conjectured by Hilbert, 
formulated by Artin as a theorem of group 
theory, and proved by Furtwangler (Abh. 

Math. Sent. Univ. Hamburg, 7 (1930)). Later a 
simple proof was given by S. lyanaga {Abh. 
Math. Sem. Univ. Hamburg, 10 (1934)). This 
theorem was also generalized to the following 
general principal ideal theorem (lyanaga, 
Japan. J. Math., 1 (1930)): Let K/k be the class 
field for the ray S(f), and let f = where X* is 
the relative different of K/k. Then the exten- 
sion to K of any ideal of k that is relatively 
prime to f belongs to S(3)- Put 3 = ri''j3‘'. 

Then v is equal to the ramification number 
r,.-t- 1 (— 14 Algebraic Number Fields K). For 
an absolute class field K/k, let the extension of 
an ideal a to K be (0(a)). Then we can choose 
0(a)6/v such that 0(a)0(fa)'’'"*0(nb)~’ Gk, 
where cr(a) = (/C/a) is the Artin symbol for a 
(T. Tannaka, Ann. Math., 67 (1958)). This 
result can also be generalized for the class 
field for S(f). 

E. Theory of Genera 

Let K/k be a Galois extension and let H{m) be 
an ideal group of k. The set of all ideals 91 of K 
relatively prime to m such that N^/t(9I) be- 
longs to H{m) forms an ideal group of K. This 
ideal group is said to be the principal genus for 
H. Each coset of 3(>n) modulo the principal 
genus for H is said to be a genus for H. In 
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particular, let K/k be a cyclic extension with 
the conductor f, and let //(f) be the ideal group 
of k generated by Nj^j^iA) {AeK) and S(f). 

Then the principal genus for //(f) is the ideal 
group formed by the ideal classes of K of 
the form C‘ where cr is a generator of the 
Galois group of K/k (-^ 347 Quadratic Fields 
F). In general, let K/k be an Abelian extension, 
and let f be the conductor of K/k. Then for the 
ideal 3 = 11^'’ of X defined in Section D, 
A'^/i(S(m3)) = S'(mf) for an arbitrary integral 
ideal m of k. In particular, let K/k be a cyclic 
extension, and let // = S(mf). Then the prin- 
cipal genus for H is the ideal group consist- 
ing of all cosets of 3(Tn<^) modulo S(m3) of 
the form (Herbrand; lyanaga, J. Reine 
Angew. Math., 171 (1934)). 

F. Class Field Tower Problem and 
Construction Problem 

Furtwangler considered the following prob- 
lem: Let k be a given algebraic number field, 
k = ko <= k J c k 2 . . . be the sequence of fields 
such that kj is the absolute class field over k,-_i, 
and be the union of all the k;. Is X^ a 
finite extension of k? The answer is yes if and 
only if k„ is of class number 1 for some n. This 
problem is called the class field tower problem. 
Artin remarked that if for every algebraic 
number field F of degree n we have the inequal- 
ity I Dp\ > {n/4)^''^{n"/n\)^ > {ne^/4)"/{27me^‘^'') 
for the tdiscriminant Dp, then X^/k is always 
finite [1, p. 46], E. S. Golod and I. R. Shafa- 
revich (1964) solved the class field tower prob- 
lem negatively; they proved that X^/k is 
infinite if k,- (/ — 1,2, ...) is the maximal unrami- 
fied Abelian p-extension of kj_i for a fixed 
prime number p and if the inequality y ^ 3 + 
2^ p + 2 holds, where y is the minimal num- 
ber of generators of the p-component of the 
ideal class group of k and p is the rank of the 
unit group of k. (We call an extension X/k a p- 
extension if the degree [X : k] is a power of a 
prime number p.) For example, the class field 
tower X^/k is actually infinite if k is an imagi- 
nary quadratic field (p = 1 ) and y > 7 for p = 2, 
for example, k = Q{,J — 2-S-l \ 1 • 13- 17 - 19). 

Construction problem. Let k be a given alge- 
braic number field and G a finite group. The 
construction problem asks us whether there 
exists a Galois extension X/k such that its 
Galois group Gal(X/k) is isomorphic to G. If G 
is Abelian the problem can be solved affirma- 
tively by using class field theory. This problem 
was also solved affirmatively for p-groups by 
A. Scholtz and H. Reinhardt in 1937, and for 
general solvable groups by Shafarevich in 1954 
{Izv. Akad. Nauk SSSR, ser. mat. 18). 
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G. Class Field Theory for Algebraic Function 
Fields and Local Class Field Theory 

F. K. Schmidt developed an analog of class 
field theory for Abelian extensions over an 
algebraic function field in one variable with 
finite coefficient field (1930; ^ [8]). An arith- 
metic proof was given by M. Moriya (1938). 

An analog of class field theory for local fields 
with finite residue-class fields, called local class 
field theory (^ 257 Local Fields) was first 
developed by Hasse, and later Chevalley gave 
an algebraic derivation (1933). 



H. Cohomology of Groups and Class Field 
Theory 

For the purpose of simplifying the proof of the 
main theorems in class field theory, the theory 
of tGalois cohomology was developed by T. 
Nakayama, G. Hochschild, A. Weil, Artin, J. 
Tate, and others. In particular, Artin and Tate 
[9] constructed class field theory on the basis 
of the cohomology theory of finite groups as fol- 
lows: Let G be a finite group, A a tG-module 
(or a multiplicative commutative group with 
the operator domain G), and i?"(G, A) the nth 
tcohomology group (n = 0, + 1 , + 2, . . . ) of 
G with coefficients in A (^ 200 Homolog- 
ical Algebra N). Then we have H^(G, A)^ 
A^/Nq{A), where A® is the set of all G-invariant 
elements in A and Nq{A) is the set of all ele- 
ments of the form NQ{a) = '^„^Q(ja (aeA). We 
can consider Z a G-module by defining cm = n 
(neZ, creG). Let A, B, C be G-modules such 
that a G-bilinear mapping (A, 5)->C is de- 
fined. Then we can define the tcup product 
(a,j8)^a— )? (ae/7''(G, A), /?e//"(G,B), a— /?G 
//'■^*(G, C)) for r, seZ with the usual prop- 
erties. Let A be a G-module and H a sub- 
group of G. Then the trestriction homomor- 
phism Rg^^:H"(G, A)-yH"{H,A) and the 
tinjection homomorphism Inj„^c:H"(H, A)-> 
/?"(G, A) are defined for neZ. If // is a 
normal subgroup of G, then the tinflation 
homomorphism Inf(G///)/G ^ H"(G///, A")-» 

H"(G, A) can be defined for n > 1 (-^ 200 
Homological Algebra M). 

Let k be an algebraic number field, and let 
X be a Galois extension of k of degree n with 
the Galois group G — G{K/k). The multiplica- 
tive group K"" ^K — {0}, the tidele group of 
K, and the idele class group Q of K are multi- 
plicative commutative groups with G as their 
operator domain. The fundamental formulas 
in Galois cohomology for class field theory are 

hhg,q)-o, (1) 

//'(G,Q))^Z/nZ. (2) 



It is possible to realize the isomorphism of 
(2) by the invariant mw 
(mod Z) I r = 0, 1, . . . , n — 1 } in such a way that 
the following properties hold, where the canon- 
ical cohomology class for Kjk is the element 
of Q) such that invjf/fc<^K/k = (f/«) 
(mod Z): (i) for /c c / c X, G = G(K //c), H — 
G(X//), the relation R^SQ/u^K/k — ^kh holds; 

(ii) if l/k is also a Galois extension with F — 
G{ljk), then we have ln^p|Qi^l|k — ^K|k (^ = 

[X : /]); (iii) for a cyclic extension K/kwe 
have invK^fc^xM = Ip«nv,((^j^/Jp (modZ), where 
p runs over all prime divisors of k and inVp is 
the invariant in the local theory (^ 257 Local 
Fields E). By these properties, the canonical 
cohomology class is uniquely determined. 
After these preliminaries we can state Tate’s 
theorem, from which the fundamental theo- 
rems in class field theory follow. 

Tate’s theorem. Let K/k be a Galois exten- 
sion with the Galois group G. Then we have the 
isomorphism 0„:H""^(G, Z)^//"(G, Q) {n = 

0, ±1, ± 2, . .. ) that is given explicitly by 
^«(a) = ^K//c^a. where ^K/k^H^G^Cp) is the 
canonical cohomology class for K/k {Ann. 
Math., (2) 56 (1952)). 

Corollary 1. Since /?"^(G, Z) = G/[G, G] and 
H^{G, Cji)^CJNf^ik{C^}, we have the isomor- 
phism Oo : G/[G, G] ^ Ck/NKikiCii). Let /(x, a) 

(x, (tgG) be a 2-tcocycle belonging to 
Then by the explicit expression for the cup 
product we obtain the isomorphism 

d>o:(T|^mod[G, G])-> 

n /(t, cr)~^ (mod . 

teG / 

This is an analog of the result in local theory 
that was proved earlier by T. Nakayama and 
Y. Akizuki {Math. Ann., 112 (1936)). 

Corollary 2. For an Abelian extension K/k 
we have the isomorphism : G = Cfc/Ax/fc(Cx). 
<I>Q ‘ has the property of being the norm- 
residue symbol for C^, and from this iso- 
morphism we can prove immediately Artin’s 
law of reciprocity. Thus we can prove the main 
theorems in class field theory by cohomology- 
theoretic methods [9]. 

We can also see, by generalizing this iso- 
morphism to infinite Abelian extensions, that 
the Galois group of the maximal Abelian 
extension of k over the ground field k with 
tKrull topology is algebraically and topologi- 
cally isomorphic to Q/£>^, where is the 
connected component of the unit element in 
Cfc. The structures of and CJO^ were ex- 
plicitly determined by Artin and T. Kubota, 
respectively (^ 6 Adeles and Ideles D). 

If we assume the fundamental formulas (1) 
and (2) stated above and several other simple 
assumptions as axioms for an infinite ex- 
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tension of a fixed ground field, we can de- 
velop the results stated in this section purely 
cohomology-theoretically. Such a system is 
called a class formation (Artin; ^ [9]). In ad- 
dition to the cases of algebraic number fields, 
algebraic function fields in one variable with 
finite coefficient fields, and local fields with 
finite residue-class fields, which we have men- 
tioned already, we also know several other 
cases for which analogies of class field theory 
are valid. These analogies can be explained 
systematically by using class formation the- 
ory (Y. Kawada, Duke Math. J., 22 (1955)). 
Examples are (1) the theory of unramified 
Abelian extensions of an algebraic function 
field in one variable with algebraically closed 
constant field of characteristic 0 (Tate and 
Kawada, Amer. J. Math., 11 (1955)); (2) the 
theory of Kummer extensions over a field k 
such that (i) the characteristic of k is 0, (ii) k 
contains all the roots of unity, and (iii) for any 
Galois extension K/k, A^/^(K) = k; (3) the 
theory of Abelian p-extensions of a field of 
characteristic p (E. Witt, J. Reine Angew. 
Math., 176 (1963); I. Satake and Kawada, J. 
Fac. Sci. Univ. Tokyo, 1 (1955)); (4) the theory 
of unramified Abelian p-extensions of an 
algebraic function field in one variable with 
algebraically closed constant field of char- 
acteristic p (Hasse and Witt, Monatsh. Math., 
43 (1936), H. L. Schmid, I. R. Shafarevich, 
Kawada, T. Tamagawa); and (5) the theory 
of Abelian extensions of a local field with 
algebraically closed residue-class fields (J.-P. 
Serre, Bull. Soc. Math. France, 89 (1961)). 

An analogy of class field theory for infinite 
Abelian extensions was considered by Her- 
brand, Moriya, M. Mori, and Kawada. 
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Classical Groups 

A. Introduction 

The general linear groups, unitary groups, 
orthogonal groups, symplectic groups, etc., 
that are described below are all called classical 
groups (^13 Algebraic Groups, 151 Finite 
Groups, 248 Lie Algebras, 249 Lie Groups). 

B. General Linear Groups 

Let F be a tlinear space of dimension n over a 
Afield K, and let GL{V) denote the set of all 
^linear mappings of V onto V (hence they are 
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all bijections). Then GL{ V) is a group under 
the composition of mappings. This group is 
called the general linear group (or full linear 
group) on V. Let e^, ...,e„he a basis of V over 
K, and let (aj) be the matrix associated with an 
element A of GL(V): Aei — Y^jOilej. Then the 
mapping A~*{oc‘j) is an isomorphism of GL{V} 
onto the multiplicative group GL(n, K) of all 
nxn tinvertible matrices over K. We can 
thus identify the group GL(V) with GL(n, K). 
GL{n, K) is called the general linear group of 
degree n over K. Consider the homomorphism 
(|.4| is the determinant of A) of GL{ V) 
onto the multiplicative group K* = K — {0}. Its 
kernel SL{V) is a normal subgroup of GL(V) 
and is called the special linear group (or uni- 
modular group) on V. The subgroup SL{n, K) 
^{A\Ae GL(n, K), | /4 1 - 1 } of GL(n, K) corre- 
sponds to SL(K) under the above isomorphism 
GL{V)^GL{n,K). SL{n,K) is called the special 
linear group of degree n over K. Unless n~2 
and K is the tfinite field F 2 = GF{2), SL{n, K) is 
the ^commutator subgroup of GL{n, K). The 
tcenter 3 of GL{n, K) coincides with the set 
of all scalar matrices olI {cteK*), and the 
center 30 of SL(n, K) is a finite group given by 
3 nSL(«, X)-{a/|aeX,a” = l}. 

Now let P{V) be the tprojective space of 
dimension n — \ obtained from a linear space 
V of dimension n. Namely, P{V) is the set 
of all linear subspaces of dimension 1. Then 
there exists a natural homomorphism (p of 
GL{V) into the group of all projective trans- 
formations of P{V), and the tkernel of (p 
coincides with the center 3 of GL{V). Hence 
(p{GL{V))^GL{V)/i. This group is written as 
PGL{V) and is called the projective general 
linear group on P(V). Similarly, PGL{n, K) 

= GL{n, K)h is called the projective general 
linear group of degree n over K. The quotient 
group SL{n, K)/^q of SL(n, K) by the center 3 q 
is called the projective special linear group and 
is written as PSL{n,K) or LF{n, K) (linear 
fractional group). 

The groups GL{n, K), SL{n, K), etc., are also 
written as GL„{K), SL„{K\ etc. In particular, 
when K is the tfinite field F^, these groups 
are denoted by GL{n,q), SL{n;q), PGL{n,q), 
PSL{n,q), LF{n,q). 

Simplicity of PSL{n, k). When n = 2 and K — 

F 2 , PSL(2, 2)^ S 3 (the tsymmetric group of 
degree 3). When n = 2 and K = ¥^, PSL{2,3)^ 
214 , (the alternating group of degree 4). Except 
for these cases, the group PSL{n, K) {n ^ 2) is 
a noncommutative Nimple group (^151 
Finite Groups I). 

Suppose that K is the finite field F,, and let 
a(n,q), p{n,q), y(n,q), S{n,q) denote the orders 
of GL{n,q), SL{n,q\ PGL{n,q), PSL(n,q), res- 



pectively. Then we have 

oc(n, ^) = (^"-l)(^"-q)... (?”-<?"“'), 

[i{n, q) = y(n, q) - a(n, q)/(q - 1 ), 

3{n, q) = y{n, q)/d, 

where d — (n,q — 1) (the greatest common divi- 
sor of n and q — \). 

C. Properties as Lie Groups 

If the ground field K is the field R of real num- 
bers (the field C of complex numbers), the 
above groups are all +Lie groups (tcomplex Lie 
groups). In particular, SL{n, C) is a ^simply 
connected, tsimple, and tsemisimple complex 
Lie group of type /4„_, , and PSL{n, C) is the 
^adjoint group of the complex simple Lie 
algebra of type A„_^. 

D. Determination of the Rational 
Representations of GL(V) 

In Sections D and E, the field K is assumed to 
be of characteristic 0. Let p be a homomor- 
phism of GL{V) = GL{n, K) into GL(m, K) (p : 

A = = Then if each is a rational 

function (or polynomial or analytic function) 
in (a J , flC 2 , • ■ ■ , «n ) over K, p is called a rational 
representation (or polynomial or analytic repre- 
sentation) of degree m of GL{V). (We suppose 
that K is R or C when we consider analytic 
functions.) For example, every rational repre- 
sentation of degree 1 can be expressed as /I 
lAl^" {e is an integer). In particular, if K is the 
field C of complex numbers, every analytic 
representation of GL(n,C) is a rational repre- 
sentation. Since GL(n, C) is the tcomplexifica- 
tion of the tunitary group U{n), there exists a 
one-to-one correspondence between the com- 
plex analytic representations of GL(«, C) and 
the continuous representations of U (n); this 
correspondence preserves equivalence, irre- 
ducibility, ■'"tensor product, and direct sum 
of the representations (^ 249 Lie Groups). 
Hence, determining the rational representa- 
tions of GL{n, C) is equivalent to determining 
the continuous representations of U (n). In the 
general case, the rational representations of 
GL{V) = GL{n, K) are all completely reducible. 
For any rational representation p of GL{V), 
there exists a natural number e such that the 
representation p' : A-*\Af p(A) is a polynomial 
representation. Hence in order to determine 
the rational representations of GL(F), it is 
sufficient to determine the irreducible poly- 
nomial representations of GL{V), which, as 
described below, can be obtained by decom- 
posing the representations on the Tensor space 
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K™ = K (X) . . . (X) F of degree m (m copies of V) 

(m = 1, 2, . . . ). For /I e GL(F), define D^{A)e 
GL(F'") as the tensor product 

D^{A) = A® ...® A {m copies of A). 

Namely, for t;i, F, we have 

D^{A){vi ® ■■■®vJ^Avi® ...®Av„. 

The mapping GL{V)BA—yD„{A)eGL{V"') is a 
polynomial representation of degree n"‘ of 
GL(F). Now let filF"*) be the tassociative 
algebra of all linear mappings of F™ into F™ 
(ttotal matrix algebra), and let 21 be the sub- 
algebra of generated by {D^(A)\Ae 
GL(F)}. Next, for an element o of the symmet- 
ric group S„, of degree m, define GL(F'") 
by ® = ...®v„ i^„y 

Then the mapping is a representation 
of on F™. Thus we obtain a representation 
T of the tgroup ring K[S„,] of S„, over K on 
F'":K[6J^i>(F"'). Set i(K[SJ) = 23. Then 
21 and © are tcommutors of each other in 
£( F"*), i.e., 21 - { a: e fi( F"*) 1 XS = BX (for all 
Be^)},^ = {Xe2{V"')\AX = XA (for all 
/4e2I)}. 

Now for a right ideal r of ©, let r(F'") be the 
subspace of F™ composed of all the finite sums 
of the form ^Bx (Bgv,xg F'"). Then the fol- 
lowing statements hold: 

(1) r{V") is invariant under 2f; hence it is a 
subspace of F"" invariant under GL(F). Con- 
versely, for any subspace V of F"* invariant 
under GL(F), there exists a unique right ideal r 
of© such that U = r(F'"). 

(2) Let r , , be right ideals of ©, and put 
t/j ==r,(F'"), G 2 = r 2 (F'"). Then tj =t 2 (as right 
S„,-modules) if and only if = IJj (as repre- 
sentation spaces of GL(F)). 

(3) The mapping r->r(F'") is a lattice iso- 

morphism of the ^lattice of right ideals of © 
onto the lattice of GL( F)-invariant subspaces 
of F'". Hence if r = r, +t 2 (direct sum), then 
U = + U 2 (direct sum). Also, r(F“) gives an 

irreducible representation of GL(F) if and only 
if r is a minimal right ideal of ©. 

Since the algebra K[S„,] is a tsemisimple 
algebra, © can be considered as a two-sided 
ideal of /C[S„], Hence a minimal right ideal r 
of © is also a minimal right ideal of K[S„], 
and the fidempotent element £ which generates 
r is a tprimitive idempotent of From 

the theory of symmetric groups (— ^ 362 Repre- 
sentations H) the primitive idempotents of 
/C[Sm] are all given (up to isomorphism) by 
tYoung’s diagrams T(/i , . . . ,/^) (/i > . ■ . > 

f^>0, m= fi + . . . + fk)An this setting, we have 

(4) Let e = £(/i , . . . ,/J be the primitive 
idempotent determined by Young’s diagram 
T(/i , . . . ,/J. Then eK [S^] c= © if and only if 
k^n. In this case, put £X [S„,] = r, r ( F™) = 



£(F'")= F'"(T(/i, ...,/J) and denote the 
irreducible representation of GL(F) on 
F-l T(/i , . . . ,/J) by /I ^ D( A; , . . . ,/,). We call 
(/i, the signature of this irreducible 
representation. 

(5) The representation D{A;f^, ...,/t) is an 

irreducible polynomial representation of 
GL(F). Furthermore, for any irreducible poly- 
nomial representation p of GL(F), there exists 
a unique D{A;fi, equivalent to p. For ex- 
ample, if /c= 1, then fi = m, £ = (m!)“^X„e 5 „ci', 
and F'"(T(m)) is the space of tsymmetric ten- 
sors of degree m. If /^ — ... 1, then k = m, 

e = (m!) F S^es„(sgna)a, and F'"(T(1, , 1)) 

is the space of talternating tensors of degree m. 

(6) Let x(A;/i, ...,/J be the tcharacter of the 
irreducible representation D{A; fi, ...,/jj). Then 









ph 



„rt-l 

^n-l ^n-2 



where £ ^ , . . . , e„ are the teigenvalues of /I and 
F=/i+(”-l). / 2=/2 + («-2), (set 

= . . . = = 0). Hence the degree d of 

D( 2 l;/j, ... ,/J is expressed as 



d = D(F,...,/„)/D(n-l,...,l,0), 



where D{x ^ , . . . , x„) = ~ ^j)- 

(7) In particular, denote the character 
of D{A,m) by p„ — p„{A). Then they satisfy 
\I -zA\~' -Po + PiZ + p 2 z^ + ... and 






Pfi P/1+1 



P/2-1 



P /2 



P/l+(«-l) 

P/2+("-2) 



P/„-(»-l) P/„-(n-2) ••• Pfn 



where we put /*+! =/t +2 = •■■ =/« = 0. P-i = 
p 2 = ... — 0. This matrix is simply written as 
Ip, ... , p,|, with the convention that in 
each row, we set /, = /j -|-(n — 1), = 

/n-l + F K^fn- 



E. Determination of the Rational Represent- 
ations of 5L(F) 

The rational representations of SL{V) are 
completely reducible. By restricting any 
irreducible representation D(A;/, , ...,/„) 

(/, ... ^/„^0) of GL(F) to SL(F), we 

get an irreducible rational representation 
D(A;/i, ...,/„) of SL(F). Furthermore, any 
irreducible rational representation of SL{V) 
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can be obtained in this way. D{A;fi ,/J 
and are equivalent representa- 

tions of SL(V) if and only if / 



F. Unitary Groups 

The set t/(n) of all n x n tunitary matrices with 
complex elements is a group under multipli- 
cation (— " 269 Matrices). This group U{n) is 
called the unitary group (or unitary transfor- 
mation group) of degree n. The subset of U{n) 
consisting of all matrices of determinant 1 is a 
normal subgroup of U (n). This group is called 
the special unitary group and is denoted by 
SU{n). 

U{n) and SU{n) are subgroups of GL(n, C) 
and SL{n, C), respectively, and can be obtained 
from these groups through the tunitary restric- 
tion. Hence they are both compact, connected 
Lie groups; in particular, S(/(l) is composed 
only of the identity and (/(I) is the multiplica- 
tive group of all complex numbers of absolute 
value 1. The center 3 of U{n) is the set of all 
diagonal matrices XI(/.eC,\A\= 1), and we 
have 

3^67(1), 3 -Sl/(n)-C/(n), 

U{n)/SU{n)^U{\). 

Moreover, for n ^ 2, SU{n) is a simple, semi- 
simple, and simply connected Lie group, which 
gives one of four infinite series of simple com- 
pact Lie groups. 

U{n)/i is denoted by PU{n) and is called the 
projective unitary group. We have the relations 
PU{n) ^ SU{n)ji n SV{n), 3 H SU{n) ^ Z/nZ. 
Hence PU{n) is locally isomorphic to SU(n). 



G. Irreducible Representations of U (n) 

Restricting the irreducible representation 
D{A;fi, of GL{n, C) on SU{n), we obtain 
a continuous irreducible representation of 
SU{n), and conversely, all continuous irreduc- 
ible representations of SU(n) are obtained in 
this manner. Similarly, any continuous irre- 
ducible representations of U(n) are given by 
A-*\A\^D{A;fi, ...,/*), where e is an integer. 
Since both U (n) and SU{n) are compact, any 
continuous representation of these groups 
can be decomposed into a direct sum of the 
irreducible representations mentioned above 
(-^ 69 Compact Groups). 

The representation theory of U{n) and SU{n) 
is important as the most typical and concrete 
example of the representation theory of gen- 
eral compact Lie groups 69 Compact 
Groups, 248 Lie Algebras, 249 Lie Groups). 



H. Unitary Groups over General Fields 

A unitary matrix and the unitary group can 
also be defined over some fields other than the 
field C of complex numbers. Namely, let P be 
a field and K a quadratic extension field of P; 
for an element ^ of K, let ^ be the tconjugate 
of over P. Then a matrix of degree n with 
entries in K is called a unitary matrix of K 
(relative to P) if it leaves invariant the tHer- 
mitian form (^1 + (^2 ^2 + • • ■ + The multi- 

plicative group consisting of all unitary mat- 
rices is called the unitary group over K (relative 
to P) and is denoted by U {n, K, P); its sub- 
group consisting of all unitary matrices of 
determinant 1 is called the special unitary 
group over K and is denoted by SU{n, K, P). 
The quotient group of SU(n, K, P) by its 
subgroup consisting of all /I/ (A"= 1, |/l| = 1) is 
called the projective special unitary group over 
K and is denoted by PSU{n, K, P). In partic- 
ular, when K and P are the finite fields F,2 
andF,(^ = p'"), U{n,K,P),SUin,K,P), 

PSU{n, K, P) are written simply as U{n, q), 

S U {n, q), PSU{n, q). Then for n ^ 3, each 
PSU{n,q) is a noncommutative simple group, 
except for P5L(3, 2) (^ 151 Finite Groups I). 

I. Orthogonal Groups 

The set of all torthogonal matrices of degree n 
(with real entries) forms a group under multi- 
plication. This group 0{n) is called the ortho- 
gonal group (or orthogonal transformation 
group) of degree n. The subset of 0{n] consist- 
ing of all orthogonal matrices of determinant 1 
forms a normal subgroup of 0{n) of index 2. 
This group SO{n) (also denoted by O^) is 
called the rotation group (special orthogonal 
group or proper orthogonal group) of degree n. 
Geometrically, 0{n) is the set of all orthogonal 
transformations leaving a point in Euclidean 
space of dimension n fixed, and SO(n) is com- 
posed of all rotations around the point. 

Both 0{n) and SO{n) are compact Lie 
groups, and SO{n) coincides with the con- 
nected component of 0(n) which contains the 
identity. For n = 3 or n ^ 5, each SO{n) is a 
simple and semisimple Lie group. Following 
the theory of Lie algebras, we divide the set 
of all SO(n) (n^3 but n^4) into two classes 
according as n is even or odd, and we thus get 
two of the four infinite series of simple and 
semisimple compact Lie groups (for SO (4), for 
example, see [1]). 

Although SO(n) (n^3) is a connected Lie 
group, it is not simply connected. The simply 
connected Lie group which is locally isomor- 
phic to SO(n) is called the spinor group and is 
denoted by Spin{n). SO{n) is isomorphic to the 
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quotient group of Spin{n) by a normal sub- 
group of order 2. Let 3 be the center of Spin{n). 
Then 3 ^ Z/2Z for odd n, 3 ^ Zj4Z for n = 2 
(mod 4), and 3 ^ (Z/2Z) © (Z/2Z) for n = 0 
(mod4)(-^ 61 Clifford Algebras). 

The group 0(n, C) of all tcomplex ortho- 
gonal matrices is called the complex ortho- 
gonal group, and the group SO{n,C) of all 
matrices in 0{n, C) of determinant 1 is called 
the complex special orthogonal group. SO{n, C) 
(n^3, «t^ 4) is a simple and semisimple com- 
plex Lie group. 

J. Irreducible Representations of Orthogonal 
Groups 

In the same way as for GL{n, K), the irreduc- 
ible representations of 0{n) can be obtained 
by decomposing the tensor product D^{A) 

— v4 ® ® A of m copies of an orthogonal 

matrix A using Young’s diagram. Namely, con- 
sider the Young’s diagram T(/i,/2, ...,/fc) 
such that the sum of the lengths of the first 
column and of the second column is not 
greater than n, and call it an 0{n) diagram. 
Then to any 0{n) diagram T = T(/, , . . . ,/J, 
there corresponds an absolutely irreducible 
representation D®(/l;/i,/2, and the 

representations D°(A;f^,f 2 , ... ,/t) correspond- 
ing to two distinct 0(n) diagrams are mutually 
inequivalent. D„(A) can be decomposed into 
the direct sum of those representations D^(A; 

such that /=/i + ... + /t takes the 
values m, in — 2, m — 4, ... . F urthermore, any 
continuous irreducible representation of 0(n) 
is equivalent to a D®(/4;/,,/2, obtained 
from some 0(n) diagram T = T(/j,/2, ... ,/fc). 

In general, two 0{n) diagrams T and T' are 
called mutually associated diagrams if the sum 
of the lengths of their first columns is equal to 
n and if the lengths of each column other than 
the first one coincide. In particular, if T= 
T(/i,/ 2, ...,/J and 2k = n, then T is said to 
be self-associated. The set of all 0{n) diagrams 
can be divided into pairs of mutually asso- 
ciated T, T' (and self-associated T = T'). 
Suppose that we are given mutually asso- 
ciated diagrams T and T' and that the length 
k of the first column of T= T(/, ,/2, ...,/J is 
not greater than n/2. Then the character Xri^) 
of D“(A;/i,/ 2, ... ,/J corresponding to T and 
the character yj-.(A) of the irreducible repre- 
sentation corresponding to T' are given by 

Pt-(v-2)~Pl~(v + 2)< •■•■<Pl~Pl-2v\^ 

xAA) = \A\xriA), r = [n/2], 

where p^ and ...| have the 

same meaning as in the formula for the char- 
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acters of irreducible representations of 
GL{n, K). 

The irreducible representations of SO{n) can 
be obtained immediately from those of 0{n). 
Namely, if T= T(/j,/2, ...,/^) is not self- 
associated, Z)°(A;/i,/ 2, is irreducible as 
a representation of SO{n), and the represen- 
tations of SO(n) derived from T and the asso- 
ciated T' coincide. If T is self-associated, 

D°{A', /i ,/a, • . . ,/k) can be decomposed into 
two irreducible representations of SO(n) of the 
same degree over the field of complex num- 
bers. Furthermore, the irreducible represen- 
tations of SO{n) obtained in this way from 
different pairs of associated diagrams are 
mutually inequivalent, while any continuous 
irreducible representation of SO(n) is equiva- 
lent to one of these representations. For the 
representations of S0(3) (the rotation group of 
degree 3) ^ 353 Racah Algebra. 

Since SO{n) is isomorphic to the quotient 
group of Spin(n) by a normal subgroup N of 
order 2, a continuous representation of Spin{n) 
which is not the identity representation on N 
can be considered as a double-valued repre- 
sentation of SO(n). This representation is called 
the spin representation and is important in the 
field of applied mathematics. 

The orthogonal group 0{n) consists of all 
nxn real matrices which leave invariant the 
quadratic form + + while the group of 

all n X n real matrices which leave invariant the 
quadratic form , -t- . . . -I- — — of 

tsignature (r, n — r) is called the Lorentz group 
of signature (r, n — r). The case for n = 4 and r — 
3 is used in special relativity (^ 359 Relativ- 
ity). Let Go be the connected component of the 
identity of the Lorentz group of signature 
(3, 1). Then Gq is called the proper Lorentz 
group. For G — (gij)eG, we have |<t|= ±1 and 
044 ^ 1 or 044 ^ — I. Moreover, we have Gq 
= {(t| |(t| = 1, 044 ^ 1 }, G/Go ^(Z/2Z) © (Z/2Z) 
(tfour group), and Go = S'L(2, C)/{±/}. 

K. Orthogonal Groups over General Fields 

Orthogonal groups can also be defined over 
other general fields than the field of real num- 
bers as follows: Fix a tquadratic form 
= S" j=i (I ay I #0) over a field K. Then a 

linear transformation of i^,(i = 1, 2, . . . , «) over 
K which leaves Q invariant is called an ortho- 
gonal transformation with respect to Q. The set 
of all orthogonal transformations forms a 
group. This group is denoted by 0{n, K, Q) or 
simply 0{Q) and is called the orthogonal (trans- 
formation) group over K with respect to Q. In 
particular, the normal subgroup of all trans- 
formations in 0(n, K,Q) of determinant 1 is 
called the special orthogonal group over K with 
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respect to Q and is denoted by SO(n, K, Q) (or 
simply SO(g)). 0{n) and SO{n) are special cases 
of 0{n, K, Q) and SO{n, K, Q), where K is the 
field R of real numbers and Q{^, i^) is the unit 
quadratic form + (^2 + ••• + <^n • 

Let Q{n, K, Q) be the tcommutator subgroup 
of 0{n, K, Q). Then this subgroup coincides 
with the commutator subgroup of SO{n, K, Q). 
If K is of characteristic ^2, and ifn^S and 
the tindex v of Q > 1, then Q{n, K, Q)/^ (3 is the 
center of K, Q)) is a simple group, where 3 
= {/} or 3 = I +/} (L. Dickson, J. Dieudonne). 
Suppose that is a finite field (of character- 
istic #2). Then we have v = m if n = 2m+ 1, and 
v = m or m—\ ifn = 2m. Hence v ^ 2 if n ^ 5. If 
v = 0 and X = R, we have D(n, R, Q)-SO(n) 
and, as mentioned before, SO(n)/^ is simple for 
fi > 5. The same proposition also holds when 
K is an ^algebraic number field (M. Kneser, 
1956). If K is of characteristic 2, then 0(n, K, Q) 
^ SO{n, X, g), 3 = {/}, and Q(n, X, Q) is a sim- 
ple group in many cases (Dieudonne [5]). For 
the case where X is a finite field (Dickson) 

151 Finite Groups I. 



L. Symplectic Groups 

Let (^ 1 , 1 ^ 2 ) and be two sets 

of variables, and suppose that the same linear 
transformation A over a field X acts on them 
(from the left). If A leaves the ^bilinear form 
— ^ 2 i^ 2 i-\) invariant, this linear 
transformation (or the corresponding matrix) 

A is called a symplectic transformation (sym- 
piectic matrix) of degree 2n. The set of all sym- 
plectic transformations (or matrices) of degree 
2n over X forms a group denoted by Sp{n, X) 
and called the symplectic group (symplectic 
transformation group, complex group, or 
Abelian linear group) over X. 

Any matrix in Sp{n, X) is always of determi- 
nant 1, and the center 3 of Sp{n, X) consists of 
/ and — /. The quotient group PSp{n,K) of 
Sp{n, X) by 3 is called the projective symplectic 
group over X. Except for the three cases n=\, 

X = F 2 ; n= 1 , X-F 3 ; and n = 2 , X = F 2 , the 
group PSp{n, X) (n^ 1) is always simple. 

Properties of Symplectic Groups as Lie Groups. 

When X is the field C of complex numbers or 
the field R of real numbers, Sp(n, X) is a Lie 
group. The intersection of the complex sym- 
plectic group Sp{n,C) and the unitary group 
U ( 2 «), namely, the unitary restriction of 
Spin, C), is denoted by Sp{n) and is called the 
unitary symplectic group (or simply symplectic 
group). Spin, C) is a simple and semisimple 
complex Lie group, and both Spin, R) and 
Spin) are simple and semisimple Lie groups. 
Moreover, Spin) is compact and simply con- 



nected and gives one of the four series of 
simple, semisimple and compact Lie groups 
(^ 249 Lie Groups). 

Let H" be the linear space of dimension n 
over the tquaternion field H. Define the inner 
product of two elements x = (xj , . . . , xj and y 
= {yi,---,yn) in H" by (x,y) = Xiyi -h ... -|-x„y„ 
(y,- is the ^conjugate quaternion of y,), and 
consider the group of all linear transforma- 
tions which leave this inner product invar- 
iant. Then this group is isomorphic to Spin). 
Spin) is thus compared with the orthogonal 
group 0 («), which leaves invariant the inner 
product of a linear space over the field R of 
real numbers and with the unitary group G(n), 
which has the same property over the field C 
of complex numbers (C. Chevalley [4,ch. 1]). 



M. Irreducible Representations of Symplectic 
Groups 

In the same way as for GL(n, X), the represen- 
tation D„,iA) = A® ...® A (tensor product of 
m copies of A) of Sp(n, C) can be decomposed 
into irreducible components using Young’s 
diagram. Namely, for any Young’s diagram 
T—Tifi,f 2 , ■•■,fk) ik^n) such that the num- 
ber k of rows is not greater than n, an irreduci- 
ble representation D^iA;f^, ...,/J of Spin, C) 
is determined. These D*(/l;/, , ...,f^) are mutu- 
ally inequivalent, and D„iA) can be decom- 
posed into the direct sum of representations 
D^iA;fi,f 2 ,...,A) such that/=/i + ... +/^ is 

equal to any of the values m, m — 2, m — 4, 

The character of D*(^;/i,/ 2 , ...,/J is given by 

Xr(A) = |p,_„+i , Pi-„+2 + Pi n, + Pi-2«+2 1, 

where p,- and |p,_„+,,P (_„+2 •••I have the 
same meaning as in the formula for the char- 
acters of the irreducible representations of 
GLin, K). 

For the matrices A in Spin), D%A;f^,f 2 , ..., 
/J gives rise to a continuous irreducible 
representation of Spin). Furthermore, any con- 
tinuous irreducible representation of Spin) is 
equivalent to a representation D\A; /i , /2 , • ■ ■ , 
/J corresponding to some diagram T. 



N. Relations among Various Classical Groups 

There are some isomorphisms (homomor- 
phisms) among the classical groups mentioned 
above. For general fields X [1,5]. For finite 
fields X 151 Finite Groups I. For X = R or 
C, the following isomorphisms hold: S'0(3) = 
SU i2)/{ ± /}, SU i2) ^ Sp(l), SOi5) ^ Spi2)/ 
{±/}, SO(6)^SO(4)/{±/}(- 248 Lie Alge- 
bras, 249 Lie Groups). 
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O. Classical Groups over Noncommutative 
Fields 

Let F be a right linear space over a non- 
commutative field K. Then the set of all linear 
transformations of V forms a group under the 
multiplication defined by the composition of 
mappings. This group GL{V) is called the 
general linear group on F. It is isomorphic to 
the multiplicative group of all n x n invertible 
matrices with entries in K. The eommutator 
subgroups SL(F) and SL{n, K) of GL(F) and 
GL{n, K), respectively, are ealled the special 
linear group of degree n on F and over K, 
respectively. Now, suppose that an element A 
of GL{ V) leaves each element of a subspace U 
of dimension n—\ of F fixed. Choose an ele- 
ment X of F which does not belong to U, and 
set Ax = x(x (mod U);aeK depends not only on 
A but also on the choice of x. However, the 
conj ugate class d = { AaA ' * j A e K * } of a in the 
multiplicative group K* of K is determined 
only by /I. In particular, if d = { 1 } and A^I, 
then A is called a transvection. For a ^matrix 
unit Eij, 5y(a) — 1 + ocEij is a transvection if 
iVj and a^O. SL{V) coincides with the sub- 
group of GL(F) generated by all transvections. 
This fact also holds when K is a commutative 
field, except when n = 2 and X = F2. In this 
case, transvections generate the whole GL(2, 2), 
which is isomorphic to the symmetric group 
S3 of degree 3 and does not coincide with 
the commutator subgroup. The center 3 of 
GL{n, K) consists of all scalar matrices corre- 
sponding to nonzero elements in the center of 
K. Let C be the commutator subgroup of the 
multiplicative group K* of K. Then for n>2, 
GL{n;K)/SL{n, K) is isomorphic to K*/C. This 
isomorphism can be obtained by appropriately 
defining, for AeGL{n, K), an element del A of 
K*/C which is called the determinant of A 
[6,9]. The center Jq of SL{n,K) is {a/|a"eC}. 
The quotient group PSL(n,K) = SL(n, K)/^q is 
called the projective special linear group of 
degree n over K. If X is a noncommutative 
field, then PSL{n, K) (n^2) is always a simple 
group [5, 8]. 

Next, let K be any field (commutative or 
noncommutative), and let F be a right linear 
space of dimension n over K. Consider a Her- 
mitian form /(x,y) (-^ 256 Linear Spaces) on 
F relative to an involution J of K. If for a 
fixed element £ in the center of K we have 
/(x, y) = £/'(y,x), then / is called an £-Hermitian 
form. For the rest of this article, / is assumed 
to be an £-Hermitian form on F. Let VF be a 
subspace of F. If f(x,y) = 0 for any x,yeW, 
then W is called a totally isotropic subspace. 
The largest dimension m of the totally iso- 
tropic subspaces of F is called the index of / 
We always have 2m^n. lif{Ax,Ay)^f{x,y) 
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for any x,ye F, then A is called a unitary trans- 
formation relative to/ 

The set U{n,K,f) of all unitary transforma- 
tions relative to / forms a subgroup of GL(F). 
This group is called the unitary group relative 
to / Also, the group SU{n, K,f) — U (n, K,f) H 
SL{n, K) is called the special unitary group. 
When J—l and £= 1, a unitary transformation 
(unitary group) is called an orthogonal trans- 
formation (orthogonal group), and U(n,K,f) is 
written as 0(n, K,f). Also, when 7=1 and 
£ = — 1, a unitary transformation (unitary 
group) is called a symplectic transformation 
(symplectic group), and U{n, K,f) is written as 
Sp(n, K). In fact, in these cases, for arbitrary 
choice of / the corresponding groups are 
mutually isomorphic. 

An £-Hermitian form / is called an £-trace 
form if for any xeV, there exists an ocg K 
which satisfies /(x,x) = a-f£a'^. If 7 = 1, £ = — 1 
(hence K is commutative) or £ = 1 and K is of 
characteristic j^2, then any £-Hermitian form 
is an £-trace form. If / is an £-trace form, a 
linear mapping B of any subspace VF of F into 
F such that for any x,yeW, f(Bx,By)=f{x,y) 
can be extended to an element A of the unitary 
group U{n,K,f) relative to / (Witt’s theorem). 
In particular, U(n, K,f) acts transitively on the 
maximal totally isotropic subspaces, and their 
dimensions are equal to the index m of / Now, 
let P be a Pythagorean ordered field (an or- 
dered field which contains square roots of 
any positive element). If K = P and 7 = 1, or if 
K — P(^ — 1), or if the noncommutative field 
is a ^quaternion algebra over P and 7 is the 
operation of +eonjugation of K, then for two 
Hermitian forms / /', their unitary groups 
U{n,K,f) and U(n\K,f') are isomorphic if 
and only if n = n', and the indices of / and /' 
are equal. In this case, U {n, K,f) can be writ- 
ten as U (n, m, K), where m is the index of / If 
the field K is a tquaternion algebra over P and 
/ is an tanti-Hermitian form, there exists an 
orthogonal basis (e,) of F such that /(e,-, = j 

(quaternion unit), Hence, in this case, 

the, unitary group U{n,K, f) relative to / is 
determined only by n and K. 

Suppose that we are given an £-trace form / 
over a general field K whose index m is not 
equal to 0. We exclude the case where 7 = 1 
and £ = 1. Then the unitary group U(n,K,f) 
contains transvections. Let T{n,K,f) denote 
the subgroup of U(n, K,f) generated by trans- 
vections which are unitary transformations. If 
m^2, then T(n,K,f) is the commutator sub- 
group of U {n, K,f). The center W„ of T(«, K,f) 
coincides with the intersection of T{n,K,f), 
and the center 3 of GL(n, K). If n> 3 and K 
contains more than 25 elements, then the quo- 
tient group T{n, K,f)/W„ is a simple group [6]. 
Also, if K is commutative and n^2, 1, 
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J/ 1, then T{n,K,f) = SU{n, K,f), except for 
the case where n = 3, K = F4.. 

If K is the field R of real numbers, the field 
C of complex numbers, or the quaternion field 
H, then GL{n, K), SL{n, K), and U{n, K,f) are 
all Lie groups. In particular, SL{n, K) and 
V («, K,f) are simple Lie groups except in the 
following three cases: (1) n = 1, X = R or C; (2) 
n = 2, X = R, J = 1, 1; (3) n = 4, K = R or C, 

J = 1, 1, m = 2. In cases (1) and (2) they are 

commutative groups, and in case (3) they are 
locally direct sums of two noncommutative 
simple groups. 

Suppose that X — H. Since H contains C as 
a subfield, a vector space V of dimension n 
over H has the structure of a vector space of 
dimension 2n over C. From this fact, GL(n, H) 
can be considered as a subgroup of GL{2n, C) 
in a natural way. 

Each of the complex classical simple groups 
G — SL(n, C), SO(n, C), Sp(n, C) has the struc- 
ture of an talgebraic group defined over R 
(^13 Algebraic Groups). The real forms of 

G, i.e., the algebraic subgroups of G whose 
scalar extension to C is G, can be realized as 
SL{n, K), U(n, K,f) corresponding to X — R, C, 

H. Namely, a real form of a complex classical 
group G is conjugate in G to one of the follow- 
ing groups: (i) The real forms of SL{n, C): 

SL{n, R) (type AI); SL{k, H) only for n = 2k 
(type All); and the special unitary group 

S U (n, m, C), 0 ^ m < [«/2], relative to a Her- 
mitian form / of index m (type AIII). (ii) The 
real forms of SO(2n + 1, C): the proper ortho- 
gonal group SO (2n -h 1 , m, R), 0 ^ m n, relative 
to a quadratic form of index m on a space of 
dimension 2« 4- 1 (type BI and BII). (iii) The 
real forms of SO(2n, C): SO{2n, m,R),0^m^n 
(type DI and DII); and U (n, H, f) relative to an 
anti-Hermitian form / on H (type Dill), (iv) 
The real forms of Sp{n, C): Sp(«,R) (type Cl); 
the unitary group U {2n, m, H), 0 ^ ^ n, rela- 
tive to a Hermitian form / of index m on H 
(type CII); and Sp{n) corresponds to the special 
case m — 0. The quotient groups of these real 
forms by their centers can all be realized as the 
groups of automorphisms of semisimple alge- 
bras with involutions J which commute with 
J {A. Weil [10]). 
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Clifford Algebras 

A. Definitions and Basic Properties 

Let V be an ^-dimensional tlinear space over a 
field X, and let Q be a tquadratic form on V. 
Denote the ttensor algebra over V by T(F), the 
tensor multiplication by ®. Let I{Q) be the 
two-sided ideal of T{ V) generated by the ele- 
ments X 0 X — Q{x) • 1 (xe V). The resulting 
^quotient associative algebra T{V)/I{Q) is then 
denoted by C{Q) and is called the Clifford 
algebra of the quadratic form Q. The elements 
of C{Q) are called Clifford numbers. 

The composite of two canonical mappings 
t: F->-T(F), (r:T{V)-yC{Q) is a linear injection 
aoz'.V~^C{Q). Hence we can regard F as a 
linear subspace of C{Q) via got. Then C(Q) 
is an associative algebra over X generated 
by 1 and V. Furthermore, x^ — Q{x)- 1 for 
every x in V. 

Indeed, C{Q) is the universal associative 
algebra with these properties. That is, let A be 
any associative algebra with a unity element, 
and let /: be a linear mapping such that 

/(x)^ = Q{X) ■ 1 for every x in F. Then / can be 
extended uniquely to an algebra homomor- 
phism f\C{Q)^A with /(!)== 1. Furthermore, 
let O be the tsymmetric bilinear form associ- 
ated with Q-.^{x,y) = Qix + y)-Q{x)-Q{y), 
x,yeV. Then xy -h yx = 9>(x, y) • 1 for every 
X, y in F. C{Q) is of dimension 2" over X. If 
Cl , . . . , is a basis of F, then 

l,Ci, CiC,(i <;■),..., Cl C2...c„ 

form a basis of C(Q). In particular, if {cj is an 
orthogonal basis relative to Q, we have 

e.ej=-€jei, cf = 0(Ci)‘l; 

(j=l,...,n, i^j. (1) 
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In this case, C(Q) can be defined as an as- 
sociative algebra (with a unity element) gen- 
erated by the {C;} together with the defining 
relations (1). In particular, for Q — 0, C{Q) is 
the texterior algebra (tGrassmann algebra) 
over V. 



B. The Principal Automorphism and the 
Principal Antiautomorphism of C{Q) 

There exists a unique automorphism a of the 
algebra C{Q) such that a(x)= — x for every x in 
V. This automorphism a is called the principal 
automorphism of C{Q), and we have = 1. 
Also, there exists a unique antiautomorphism 
^ of the algebra C{Q) such that ^{x) — x for 
every x in V. This antiautomorphism jS is 
called the principal antiautomorphism of C{Q), 
and we have 1. 

For the rest of this article we assume that 
the tdiscriminant of Q is We also assume 
for the sake of simplicity that the characteristic 
ofKis ^2. Let C+ • 1 -f 
+ ,..,and C~ =C-{Q)^V+V^ + V^ + .... 
Then C{Q) is the direct sum of the linear 
subspaces C^{Q) and C~(Q). Furthermore, 

C + C+cC+, C+C“cC',C-C+cC-,and 
C“C“c:C + .Thus C(0 = C+ + C‘ has the 
structure of a tgraded algebra with the index 
group { ±1}, and is a subalgebra of C(Q). 
The elements of C^{Q), C~{Q) are called even 
elements and odd elements, respectively. We 
have dim C^(0 = dime (Q) = 2" L 

C. The Structure of C(Q} and C^{Q) 

C{Q) and C^{Q) are both tseparable, tsemi- 
simple associative algebras over K. Suppose n 
is even: n = 2r. Then C{Q) is a tsimple algebra 
with K as its center; the center Z of C^(Q) is 2- 
dimensional over K. Let e be an ortho- 
gonal basis of V. Then 1 and 2 = 2''ci ... form 
a basis of Z, and we have 

z^=^2^^{-iyQ{e,)...Q{e„) = i-iyD, 

where D is the tdiscriminant of d> relative to 
the basis {e,}. Thus if ( — 1)''D has a square root 
in K, Z ^ K @ K (direct sum), and so C (Q) is 
decomposed into the direct sum of two simple 
algebras. If ( — 1)'’J5 does not have a square 
root in K, then Z is a field and C'^{Q) is a 
simple algebra. In particular, if the index of Q 
(i.e., the dimension of a maximal ttotally sin- 
gular subspace of F(^ 348 Quadratic Forms)) 
is r, C{Q) is isomorphic to the iotal matrix 
algebra of degree 2' over K, and C*{Q) is 
isomorphic to the direct sum of two copies of 
the total matrix algebra of degree 2''“^ over K. 

Now suppose that n is odd: n = 2r+l. Then 
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C (Q) is a simple algebra with K as its center. 
(In particular, if Q is of index r, then C^(Q) is 
isomorphic to the total matrix algebra of 
degree 2’' over K.) The center Z of C(Q) is 2- 
dimensional over K, and we have C(Q) = 
lfe„...,e„ is an orthogonal 
basis of V, then 1 and z = . . . e„ form a basis 

of Z. Putting z' = 2'"^^z, we have z'^ = 2( — 1)''D, 
where D is the discriminant of O relative to 
{ej. Thus if 2( — VyD has a square root in K, 
C{Q) is the direct sum of two 2^’'-dimensional 
simple algebras. If 2( — I)'’D has no square root 
in K, then C{Q) is a simple algebra. 

D. The Clifford Group 

Let G be the set of all invertible elements s in 
C{Q) such that sFs“* = K Then G forms a 
group relative to the multiplication of C(Q). 
This group G is called the Clifford group 
of the quadratic form Q. The subgroup G^ 

~ G n (Q) is called the special Clifford 

group. The linear transformation (p(s):x~> 
sxs“' of V induced by seG belongs to the 
torthogonal group 0(Q) of V relative to Q. 
Moreover, the mapping s-*(p(s) is a homomor- 
phism from G into 0(Q). Thus <p is a trepre- 
sentation of G on V. This representation cp is 
called the vector representation of G. The tker- 
nel of (p consists of invertible elements in the 
center Z of C(Q). If xeGD F, then Q(x)#0 and 
— (p(x) is the reflection mapping of F relative 
to the hyperplane orthogonal to x. If n = dim F 
is odd, (p(G) — (p(G'^) = SO(Q). If n is even, cp{G) 
= 0(0, <piG^) = SO{Q). 

Exploiting the principal antiautomorphism 

of C(0, we obtain a homomorphism N.G^ 
-yK* (the multiplicative group of K) defined 
by N{s) = li{s)s (seG'^), and N{s) is called the 
spinorial norm of seG"^. The normal subgroup 
of G"^ defined as the kernel of N is denoted by 
Gq and is called the reduced Clifford group (of 
Q). The subgroup (p{Gq) of SO{Q) is denoted 
by Oq (Q) and is called the reduced orthogonal 
group. 

In particular, when the ground field K is the 
real number field R, Oq (O coincides with the 
tidentity component of the tLorentz group 
0(Q). Furthermore, if Q is definite, Oq (0 = 
SO(n), so that the identity component Spin{n) 
of Go is a tsimply connected ^covering group 
of SO(n) via the covering homomorphism <p 
(with each point in SO{n) covered twice). The 
group Spinin) is called the spinor group of 
degree n. 

E. Spin Representations 

In this section we assume that the ground field 
K is the complex number field C and that n 
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= dim V^3. Then we have (Q) = SO{n,C), 

so Gq is a simply connected covering group of 
SO(n, C) via the covering homomorphism (p. In 
this section we denote Gq by Spin{n,C) and 
call it the complex spinor group of degree n. 
Spinin,C) is the tcomplexification (-^ 249 Lie 
Groups) of the compact Lie group Spin{n) and 
is a complex analytic subgroup of the tcom- 
plex Lie group C{Q)* consisting of all inver- 
tible elements of C{Q). With the bracket 
operation [x,y] = xy — yx, C{Q) becomes the 
tLie algebra of C{Q)*. Furthermore, the Lie 
subalgebra of C{Q) associated with the com- 
plex analytic subgroup Spin{n, C) is given by 
'Li<jCeiej, where Cj, is an orthogonal 
basis of V. The spin representations of the 
group Spin{n, C) are defined as follows: 

(1) When n is odd: n = 2r+ 1. Since C^{Q) is 
isomorphic to a total matrix algebra of degree 
2'' over C, C^{Q) has a unique (up to equiva- 
lence) ^irreducible representation p, which is of 
degree 2''. The restriction of p on Spin{n,C) (on 
Spin{n)) defines an irreducible representation p 
of degree T of Spin{n, C) (of Spin(n)); p is called 
the spin representation of the group Spin{n, C) 
(of Spin{n)). The elements in the representation' 
space of p are called spinors. Thus we can say 
that a spinor is a quantity with 2'’ components 
which obey the transformation law according 
to the spin representation (^ 258 Lorentz 
Group). This representation p defines a rep- 
resentation of the Lie algebra 5o(n, C) of 
Spin{n, C) (note that so(n, C) is a ^complex 
simple Lie algebra of type B,). This represen- 
tation of so(n, C) is also called the spin repre- 
sentation of so(n, C). Note that p is not well 
defined on SO(n,C) or on SO{n); p is of valence 
2 on SO(n,C) or on SO(n). 

(2) When n is even: n = 2r. Since C{Q) is 
isomorphic to a total matrix algebra of degree 
2' over C, C{Q) has a unique (up to equiva- 
lence) irreducible representation p, which is of 
degree 2^ The restriction of p on Spin{n, C) (on 
Spin(n)) defines a representation p of degree 2' 
of Spin{n, C) (of Spin{n)); p is called the spin 
representation of the group Spin{n, C) (of 
Spin{n)). This representation p is, however, not 
irreducible; p is decomposed into the direct 
sum of two irreducible representations p+ and 
p_ . They are not equivalent to each other, and 
both are of degree 2'~^. By taking a suitable 
minimal left ideal L of C{Q) as the representa- 
tion space of the representation p, we obtain 
the representation spaces L^, L~ of p^, p^, 
respectively, by putting — LDC^ {Q) and 

L =Lnc (0. The representation p+ (or p ) 
is called the half-spin representation of the 
group Spin{n,C) or of the group Spin{n). The 
elements in the representation space of p (or 
p~) are called half-spinors. Again, p^ and p~ 
are not well defined on SO(n,C) or on SO(n). 



They are of valence 2 on these groups. The 
representations of the Lie algebra of the Lie 
group SO(n,C) (note that this Lie algebra is a 
tcomplex simple Lie algebra of type D^) asso- 
ciated with p* , p~ are also called half-spin 
representations of this Lie algebra. 
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A. Cluster Sets of Functions Meromorphic in 
an Arbitrary Domain 

Let D be an arbitrary tdomain in the complex 
z-plane, F its boundary, and E a ttotally dis- 
connected closed set contained in F. Let w = 
f{z) be a single-valued tmeromorphic func- 
tion defined in D. Then for each point Zq in F, 
we can define the following sets related to the 
mapping w—f{z) in the complex w-plane (or 
on the complex w-sphere C). 

The Cluster Set. A value tx is called a cluster 
value of f{z) at Zq if there exists a sequence of 
points {z„} such that 

z„eD, z„^Zo, f(zj-ya. 

The totality Co(/,Zq) of all the cluster values of 
/(z) at Zq is called the cluster set of /(z) at Zq 
or, more precisely, the interior cluster set. It is 
a nonempty, closed, but not necessarily con- 
nected set. 

The Boundary Cluster Set. The set of all values 
a such that there exist a sequence of points 
{C„} of r — {zq} (resp. F — {zq} — E) and a se- 
quence of points {w„} in the complex w-plane 
satisfying 

C„^Zq, w„eCoif,C„), vv„->a 

is called the boundary cluster set of /(z) at Zq 
and is denoted by Cr(/ Zq) (resp. Cp_£(/, Zq)). 
These are closed sets, and 

Cr-fi(/ Zq) <= Crif, Zq) "= Coil Zo)- 

If ZqG F — £ or Zq is an isolated point of E, then 
Cp_£(/, Zq) - Crif, Zq). Furthermore, C^if, Zq) 
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(resp. Cy-^if, Zq)) is empty if and only if Zq is 
an isolated boundary point (resp. Zq is an 
exterior point of F — E). 

Range of Values. The set of values a such that 

z„eD, z„^Zo, /(z„) = a 

is called the range of values of f(z) at Zq and is 
denoted by R[y{f,ZQ). In other words, /?o(/,Zo) 
is the set of values a assumed by /(z) infinitely 
often in any neighborhood of Zq and is a tG^- 
set. 

The Asymptotic Set. Let Zq be an taccessible 
boundary point of D. If f{z) converges to a 
value a as z tends to Zq along a simple arc in D 
terminating at Zq, then a is called an asymp- 
totic value of / at Zq. The totality A^{f,Z(y) 
of asymptotic values of / at Zq is called the 
asymptotic set of / at Zq. If Zq is an inacces- 
sible boundary point, we let zq) be the 
empty set. 

B. Iversen-Beurling-Kunugui Theorems 

Suppose now that E is empty, and put 

Q = Co(/,Zo)-Cr(/,Zo). 

tiversen’s theorems in the case where D is the 
unit disk |z|< 1 and Zq is a point on |z| = 1 are 
generalized as follows. 

{ 1 ) Q - Ro(/, ^o) ^ ^ d(/ ^o) (K. Noshiro, 1936). 

(2) First Beurling-Kunugui theorem: If Zq is 
not an isolated boundary point, then Q is an 
open set. 

(3) Second Beurling-Kunugui theorem: Sup- 
pose that the open set D is not empty. Then 
f{z) assumes every value belonging to each 
component Q„ of infinitely often, with two 
possible exceptions belonging to 0„, that is, 

Q„ n (C — Roif, ^o)) consists of at most two 
values (an extension of t Picard’s theorem on 
an isolated essential singularity). 

Next suppose that E is not empty and of 
tlogarithmic capacity zero, and put 0 = 

then 

either a is an asymptotic value of /(z) at Zq or 
there exists a sequence of points (n = 

1,2,...) converging to Zq such that a is an 
asymptotic value of /(z) at each C„ (Noshiro, 
1937). Furthermore, if Zq is contained in the 
closure of F — £, then Q is an open set (which 
may be empty), and Q — Ro(/>^o) is at most of 
logarithmic capacity zero (M. Tsuji, 1943). If E 
is contained in a single component Fq of the 
boundary F, Zq is contained in the closure of F 
— £, and Q is nonempty, then /(z) assumes 
every value belonging to each component Q„ 
of Q infinitely often, with two possible excep- 
tions belonging to Q„ (Noshiro, 1950). In par- 
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ticular, if / is bounded in a neighborhood of 
Zq, the number of such exceptional values is at 
most 1. This is still true if each point of £ is 
contained in a boundary component that is a 
tcontinuum containing at least two points and 
Q is not empty (M. Herve, 1955). However, 
this conclusion does not hold if we remove the 
hypothesis on the set £ (K. Matsumoto, 1960). 

C. Cluster Sets of Functions Meromorphic in 
the Unit Disk 

Let D be the unit disk (|z|< 1}, ZQ — e‘^° be a 
fixed point on the unit circumference F, A be 
an open arc of F containing Zq, and £ be a set 
of tlinear measure zero such that ZqgEcA. 
With every e'^eA — E we associate an arbi- 
trary simple arc in D terminating at e‘^ and 
the curvilinear cluster set {f, e‘% defined as 
the set of all values a such that z„eAg, z„^e’^, 
/(z„)->a. We put 

cf-j(/,z„)= n M.. 

r>0 

where denotes the closure of the union 

e‘®) for all in the intersection of A — £ 
with |z — ZqI < r. 

By using the above cluster set C*_£(/, Zq) 
instead of the boundary cluster set Cr_£(/,Zo), 
we obtain results similar to those in the pre- 
ceding section (M. Ohtsuka, 1950; Noshiro, 
1955). Many interesting results have been 
obtained by F. Bagemihl and W. Seidel, E. F. 
Collingwood, O. Lehto and K. I. Virtanen, 
and others concerning the cluster sets of func- 
tions meromorphic in the unit disk. They 
studied functions of Seidel’s class U, normal 
meromorphic functions, and other functions 
where the class U is the totality of regular 
bounded functions in the unit disk possessing 
almost everywhere on |z| = 1 radial limits of 
the constant modulus 1; a nonconstant mero- 
morphic function /(z) in |zj < 1 is normal if 
the family {/(T(z))} is tnormal in the sense 
of Montel, where T(z) is an arbitrary con- 
formal mapping of | z|< 1 onto itself. 

D. The More General Case 

The definitions of cluster sets are also available 
for arbitrary functions for which neither ana- 
lyticity nor continuity is assumed. If there exist 
two simple arcs Aj and A 2 in the unit disk D 
terminating at a point z = e‘® such that 

CA^(/,^"')nQ^(/,e^«) = 0, 

then z — is called an ambiguous point of f 
Bagemihl proved the following: The set of 
ambiguous points of an arbitrary complex- 
valued function defined in the unit disk D is at 
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most countable [5]. Under the same hypoth- 
esis, the set of points e'^ such that 
Cr(/,c'®) is at most countable {Collingwood, 
1960). This result shows the importance of 
introducing the cluster set C^_£(/,Zq) that was 
previously mentioned. 

E. History 

The theory of cluster sets originated from the 
tvalue distribution theory of analytic functions 
in the neighborhood of their essential singular- 
ities. The first systematic results were those 
of F. Iversen and W. Gross, obtained about 
1920. Subsequent significant contributions 
were made by Seidel, J. L. Doob, M. L. Cart- 
wright, A. Beurling, and others. Since 1940, 
some important results have been obtained 
by K. Kunugui, S. Trie, Y. Toki, Y. Tumura, 

S. Kametani, Tsuji, Noshiro, and other Japa- 
nese mathematicians. Many results have been 
extended to tpseudoanalytic functions. As can 
be seen from the Bagemihl ambiguous point 
theorem, some properties of cluster sets are 
not intrinsic to analytic mappings [2, 5]. On 
the other hand, it seems to be an interesting 
problem to extend the theory of cluster sets to 
the case of analytic mappings between open 
Riemann surfaces. 
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Coding Theory 

A. General Notions of Coding Theory 

When we wish to store, to search for, or to 
send information in the presence of noise 
efficiently and with the least error, we can 



apply various bounds to the efficiency (-"213 
Information theory). Sophisticated coding 
operations are required in order to achieve 
efficiencies as close as possible to the bounds. 
In the abstract sense, the information is gener- 
ally understood to be a choice of an element 
from a finite set X. For implementation, we 
take a set Koi q elements called alphabets. 
Each element of K is called a letter. We con- 
sider the direct product K", i.e., the set of all 
sequences of n letters. An injection from X 
into K" is called encoding. The sequence ^{x) 
for xe Y is called a code word, and the image 
\j/{X) (a collection of code words) is called a 
code. Such a code is also called a block code. 
The noise is represented by a mapping cu : X" 

-> X", but usually it is restricted to a certain 
subset of the set of such mappings with 
special properties. For example, we usually 
assume that the sequences xeX" and tu(x) are 
different only at less than d letters, where d is a 
preassigned constant. The inverse mapping <p, 
i.e., a mapping (p:K"^X satisfying (potj/ix) — x 
for all X G Y, is called the decoding of ij/. A code 
\j/ satisfying the property 0 }O\{/{x)^\j/{X] for all 
xe Y and for all cue is called error-detecting 
with respect to the noise Q. If \f/ has the decod- 
ing (p satisfying <p o cu o i/c(x) — x for all x e Y 
and for all cueD, is called error-correcting 
with respect to the noise O. To discuss such 
properties, we can assume Y = i/^(Y) ( c X") 
without restricting the generality, so hereafter 
we assume this condition. Also, we take q = 2 
unless explicitly stated otherwise. Because 
in many communication systems q~2is 
commonly adopted, and the generalization 
for other prime powers q can be obtained 
naturally. 

B. Bounds for the Size of Codes 

Let x = (xi,...,x„), y = (yi, ...,y„)eX". The 
Hamming distance d(x,y) = dn(x,y) between 
the elements x and y is the total number 
of unequal bits (x.-^y,). We put d„,in = 
min{dH(x,y)|x,ye Y,x^y}. When e — 
max{dn(x, CL»(x))|cueQ, xeY} is less than 
the errors due to Q can be detected, and if 
^ 2c -I- 1, the errors ean be corrected. The 
maximal integer t satisfying d^i„^2t+ 1 is 
often called the error-correcting capability. 
Several important relations are known among 
t, n (the length of the code), and \X\ (the 
size of the code word) as follows. Hamming 
bound: |Y| ^$27E'=i(i). A code satisfying the 
equality here is called a perfect code [6]. Plot- 
kin bound: Yl/(2(|Yi — 1)). On the 

other hand, if the Varsharmov-Gilbert-Sacks 
bound 2"^*'>2;f=o”^(”T') is satisfied, there 
exists an (n, k)-linear code (Section C). 
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C. Linear Codes 

Let K = GF{2) = {0, 1 }, K" be an n-dimensional 
vector space over K, and X be a /c-dimensional 
linear subspace. Then X is called a group code 
or (n, /c)-linear code. In the present case, we can 
take a suitable basis of K for which there exists 
a kx{n — k) matrix P over K such that every 
vector X = (x 1 , . . . , x„) representing a code word 
is expressed as x = {zP, z) by a suitable vector z, 
and conversely, every vector of this form is a 
code word, i.e., a vector \ eX is a code word if 
and only if 

{x,,...,x„^t) = (x„-fe+i,...,x„)P = 0. (*) 

Introducing the matrix = — P^J, where 

/„ is the unit matrix of order n — k and ^ 
indicates the transpose, the condition (*) is 
equivalent to xH^ — 0. Therefore, H is called 
the parity check matrix, x„ x„ are 
called the information bits, and Xj , . . . , x„_j 
are called the check bits. If x is deformed to 
y = (ux by a noise co, i.e., the original signal x 
is transmitted as signal y, we call s = yH^ — 

{y — \)H^ the syndrome of the transmitted 
signal y. If we have an algorithm to determine 
the error vector e = y — x from the syndrome s, 
we have a decoding with error-correcting 
property. For the linear code, it is evident that 
dn,in = min {the number of nonzero elements of 
x|x 

The Hamming code is given by n = 2'" — 1 (m 
being an integer '^2), k = n — m, where the (/,))- 
component (i = 1, 1, ... ,n) of the 

parity check matrix H is given by the ith bit of 
the number j — 1 expressed in the binary (2- 
adic) number system. The check bits are Xj 
(y = 2®, 2S ...,2'"“^), and all other bits are the 
information bits. This code has the error-cor- 
recting property with respect to the noise oj 
for which the nonzero component of e = cox — 

X is at most 1. In fact, from the syndrome s = 
(sj, ... ,sj, we compute j = Si -I- 2 s 2 -I- . . . -I- 
2'"“'s„ and put e-= 1, e^ = 0 for all if j — 0, 

we can put e = 0. The Hamming code is a per- 
fect code. 



D. Cyclic Codes 

The cyclic code is a special case of a linear 
code. This is the object of one of the most 
important applications of the theory of finite 
fields. By using this theory, we can actually 
construct the codes with high for error- 
correcting capability in which the encoding 
and the decoding operations are performed 
algebraically. 

An («, k)-linear code X over GF(2) is called a 
cyclic code if x = (xi,X 2 , ... ,x„)e A implies (x„, 

X],X2,...,X„_i), {X„_1,X„,X,,...,X„_2),...,(X2, 
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...,x„,Xi)eA'. To each xeX, we can associate 
a polynomial of one variable x((J) = Xi -|-X 2 i^-I- 
... + x„(^"“’ over GF(2). Let QxiQ be the poly- 
nomial of the least degree n — k among the 
polynomials corresponding to xG2f. gxi^) is 
called the generator of X, because the property 
xgX and the property that x(<^) is divisible by 
gxi^) are equivalent. gxiO also divides <^"--1. 
Let the quotient be hxi^) = (C—'^)/gxii)- Then 
xeX is equivalent to x((^)/zj^-(i^) = 0 (mod((^” — 
1)) hxiO is called the parity check polynomial. 
The BCH code (Bose-Chaudhuri-Hocquenghem 
code) defined below is a typical example of a 
cyclic code. 

Let ot be an element of order n in Gf (2”) 

(the finite field with 2"' elements), and let g,((^) 
be the polynomial of the lowest degree in 
GF(2) [i^] for which a' is a root (i = 1, 3, . . . , 

2f — 1). The BCH code is given by the least 
common multiple g(i^) of the polynomials 
gi(^), 93(0> The BCH code has at 

least n — mt information bits and satisfies 

+ and algebraic decoding methods 
are known [3, 5]. The Hamming code is a 
BCH code when t= 1, n — 2"'— 1. 



E. Other Codes 

Other important types of codes include con- 
volutional codes (not block codes) for correct- 
ing errors in consecutive digits (hurst errors) 
and Goppa codes (an extension of the BCH 
class). For correcting burst errors, special 
cyclic codes or their mixtures are also used. 
This field of research is closely connected to 
information theory, algebra, and various ap- 
plications of combinatorial analysis, such as 
texperimental design [3]. 
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Cohomology Operations 

A. General Remarks 

The notion of cohomology operations was 
introduced by L. S. Pontryagin and N. E. 
Steenrod in order to solve homotopy classifi- 
cation problems 202 Homotopy Theory). 
Since then, numerous works have proved the 
importance of cohomology operations as 
applied to thomotopy theory, tdifferential 
topology, and other branches of topology. In 
fact, the use of cohomology operations is 
indispensable in studying problems related to 
thomotopy groups, tcharacteristic classes of 
manifolds, etc. 

We denote by the symbol H*(X;A) = 

Z H"{X; A) the ^singular cohomology ring of 
a topological space X with coefficients in an 
tAbelian group A. 

B. Primary Cohomology Operations 

A (primary) cohomology operation (or simply 
an operation) 9 is a tnatural transformation 

<?:]!«'*( ;B.) 

A n 

between the cohomology functors defined on 
the tcategory of topological spaces and con- 
tinuous mappings. That is, is a family of 
mappings satisfying the following conditions: 

(1) For each space X, <p defines a mapping 

A // 

that is not necessarily additive. 

(2) For each mapping /: T, the commu- 

tativity f*oq> = (pof* holds in the diagram 

A p, ^ 

Y\H‘^(Y;A)^Y[^"'^iY-, B). 

We list here two trivial examples. 

(I) Addition of cohomology groups deter- 
mines an operation cp : H'{X ; A) x H\X; A)-» 
H‘(X; A). 

(II) The tcup product determines an 
operation 

(p-.H''iX;A,)x H‘^(X; A 2 )^H‘^^‘^{X;A, ® A^) 
denoted by 
q){cc, (i) — a^p. 

The composite of two cohomology opera- 
tions is defined in the obvious way. Among 
cohomology operations the most important 
ones are operations of one variable. A co- 



homology operation of type (A, /; B, m) is a 
natural transformation 

(p:H\ ;A)^H'”{ ;B). 

C(T, /; B, m) denotes the Abelian group consist- 
ing of all such operations. 

Denote by H"'{A, /; B) the mth tcohomology 
group of an +Eilenberg-MacLane space K{A, 1), 
and let ueH\A, /; A) be the Tundamental class 
of K{A, /). If A is a CW complex, by assigning 
f*u to each f:X-yK{A,l), we obtain a one- 
to-one correspondence between the set of the 
homotopy classes n{X,K(A, /)) and the co- 
homology group H\X; A) (-- 70 Complexes 
F). Hence by utilizing condition (2), it can 
be shown that the value of q> on //'(A; A) is 
uniquely determined by its operation on 
H‘{A, 1; A). Thus the assignment cp-xpu defines 
the isomorphism 0(A, /; B, m) = H"'{A, 1; B). 

Here we should remark that the isomorphism 
n{K{A, 1), K{B,m))^H"'{A, 1; B) defines a one- 
to-one correspondence 0(A, /; B, m)^n{K{A, /), 
K{B, m)). In some cases H"'{A, /; B) vanishes, 
for example, when 0<m<l; I — 1 <m, A = Z; 
l~2, m = 2m' + 1, A = Z; l — 2l’<m = 2m'+l, 

A = Z B = Q;/ = 2r+l<m, A = Z, B = Q; A 
is finite, B — Q; etc. 

The following four types of operations to- 
gether with the two above are called elemen- 
tary operations: 

(III) Homomorphisms induced by a coeffi- 
cient homomorphism; There are homomor- 
phisms r]^ : H\X; A)->H'(A; B) induced by a 
homomorphism t]:A^B. 

(IV) Bockstein (cohomology) operation: This 
operation is given by the tconnecting homo- 
morphisms 3* :H‘{X;C)-^H‘'^^{X;A) asso- 
ciated with a short exact sequence 

C-^0 of coefficient groups (-^ 200 Homo- 
logical Algebra). For example, the coefficient 
sequence 0^ZAZ->Z„— >-0 (Z„ = Z/nZ) defines 
a Bockstein operation (or Bockstein homomor- 
phism), which is usually denoted by (l/n)(5 or 

A„. 

(V) Steenrod (or reduced) square operations 

Sq' {i^0):Sq‘ are sequences of operations 
defined by the following five axioms [2, 5]: 

(VI) For each pair of integers i^O and /^O, 

Sq':H‘{X;Z2)^H‘^^{X-Z2) 

is a natural transformation of functors that is a 
homomorphism. 

(V2) Sq^^l. 

(V3) If degx = i, then Sq‘x = x^ x (cup 
product). 

(V4) If degx<i, then Sq'x — 0. 

(V5) (Cartan formula) 

Sq‘(xwy)= ^ Sq^x^Sq'~jy. 

j = 0 




237 



These five axioms imply the following two 
formulas: 

(V6) Sq^ is the Bockstein operation fij of the 
coefficient sequence 0 ->Z 2 ^Z^~>’Z 2 ^ 0 . 

(V7) (Adem relations) If 0 < / < 2j, then 

Sq‘oSqJ^f^(^ '‘oSg^ 

k=o\i-2k ) 

The binomial coefficient is taken mod 2. 

We can extend the definition of the Sg' so 
that they operate on the relative cohomology 
groups. Now (VI), (V2), and (V5) imply: 

(V8) If b : H‘{ Y; Z 2 )^H‘+^{X, Y; Z 2 ) is the 
tcoboundary homomorphism, then boSq' = 
Sq'oS. 

(V') Steenrod pth power operations (i^O): 
Let p be an odd prime. Then is a sequence 
of operations defined by the following five 
axioms [5]. 

(VT) For each pair of integers i^O and /^O, 

: H'(X; Zp)-^//'+2‘'P“*>(A; Z^) 

is a natural transformation that is a homo- 
morphism. 

(V'2) ^^=1. 

(V'3) If dcgx = 2/c, then ^^x = x’’. 

(V'4) If degx<2/c, then ^*‘x = 0. 

(V'5) (Cartan’s formula) 

i 

^‘{x^y)= ^ 
j-o 

These axioms imply the Adem relations for 
and the Bockstein homomorphism Pp asso- 
ciated with the coefficient sequence O^Z^-^ 
Zp2^Z^^0 Appendix A, Table 6.II). 

We can extend the definition of to the 
relative cohomology groups too, and we ob- 
tain bo0^^ = ^^' ob. 

(VI) Pontryagin pth power operations 

Let p be a prime, is a system of operations 
satisfying the following five conditions [3]: 

(VI I ) For each pair of integers / ^ 0 and 

h^\, 

^p-.H\X-Zpk)->H^\X-Zp,..) 

is a natural transformation. 

(VI2) U q:Zph*\-*Zph is a homomorphism 
defined by >7(1)= 1, then we have q^o*'^pX — x’’. 

(VI3) If p:Zph^Zph+\ is a homomorphism 
defined by p(l) = p, then we have 

= ^~^pX + ^jPpy+ (J^^jp^jp^ix' ^y”-). 

(VI4) ^p(xwy)=‘:PpXw^^y. 

(VI 5) If p > 2 and deg x = 2k -I- 1 (odd), then 

^)3pX-0. 

Let A, B be finitely generated Abelian 
groups. Then the computation of H*{A, 1) 
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shows that each element of 0(A, /; B, m) can be 
written as the composite of a finite number 
of the operations of the types (I)-(VI) (H. 
Cartan). 

Here we show some examples. Let u be the 
fundamental class of X(Z2,2). Then the ele- 
ments of H^{Z 2 , 2 ; Z4) ^ Z2, //3(Z2, 2; Z4)^ 

Z2, H^(Z2, 2; Z4)^ Z4, and H^{Z 2 , 2; Z4)^ 
Z4-I-Z2 correspond to the cohomology opera- 
tions 22^, Ar]^ o(5/2), p'!) 32> and pq^o{b/4)o 
'’)S2 + 22^ o Sq^ oSq^, respectively, where 2, p 
denote integers satisfying 1, 0^ 

p^3 and >/:Z->-Z4, 2:Z2->Z4 stand for the 
homomorphisms defined by >;(1)= 1 , 2(1) = 2, 
respectively. 

Suppose that we are given sets of integers /,• 
and rrij. We define the stable (primary) coho- 
mology operation q> with respect to these sets 
of integers /,• and ttij as a system of natural 
transformations 

-^Bp) 

A ft 

satisfying the following condition, for all 
integers n^O; 

(3) Let S:H^^\SX; A)^H‘{X-, A) denote the 
■•■suspension. Then the eommutativity S o 9 — 
(poS holds in the diagram 

^ IS '^15 

n H”^‘^^HSX; A)^f] (5A^; B). 

I ft 

This condition is equivalent to the commuta- 
tivity with the coboundary homomorphisms. 

For example, the cohomology operations 

{-\rPp:H’'iX;Zp)^H"^HX-,Zp) 

define a stable cohomology operation p. Sq\ 

'')S‘ are also examples of stable cohomology 
operations. 

A stable cohomology operation cp of type 
(A, B) and of degree q is p sequence of coho- 
mology operations of type {A,n; B,n + q) de- 
fined for all integers n^O. 5I(A, B), denotes the 
Abelian group eonsisting of all stable coho- 
mology operations of type {A, B) and of degree 
q. When A^B, iH(A) = i:^"Lo5I(A, A)^ is a 
^graded algebra, where multiplication of two 
operations is given by their composition. Let p 
be a prime. Then 91 (Z^) is called the Steenrod 
algebra modp and is denoted by 9t(p). 91(2) is 
the augmented graded algebra over Z2 gen- 
erated by Sq' subject to the Adem relations. 
Suppose that we are given a sequence of non- 
negative integers / = (/,, j2, ... , i^)- We call / 
an admissible sequence if ij_, ^ 2i^ holds for 
2^s:$/c. We write Sq‘ -Sq^' oSq'^o ...oSq‘\ If 
I is an admissible sequence, we say that Sq' is 
an admissible monomial. The admissible mono- 
mials form an additive basis for 91(2), which 
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has the structure of a tHopf algebra whose 
tcomultiplication (2) ^'21 (2)® 31(2) is 
given by ijjSq‘ = J^j=QSq^ ®Sq‘~K The dual 
space 3l(2)* = Homz^(3l(2), Z 2 ) gives a Hopf 
algebra that is the polynomial algebra gen- 
erated by of degree 2 ‘ — 1 , where is the dual 
element of ^Sq^' ^...Sq^ with respect 
to the additive basis. The comultiplication 
(f>* : 3t(2)* ->3I(2)* ® 31(2)* is given by 

3I(p) has properties similar to 31(2) (p is an 
odd prime). In particular, 3I(p) is a tHopf 
algebra generated by and jS subject to the 
Adem relations with comultiplication ij/ given 
+ and ijj^' = ® 

An additive basis of 3I(p) is given by 
{jgco^'i where ^, = 0 or 1 , 
pim+i -t-fim (m> 1) and 1. Such a sequence 
/ = (£o, I'l , £ 1 , . . . , (fc, £fc) is also called an admis- 
sible sequence. Denote the dual element of 
and 

and Tj, respectively; then the dual algebra 
3I(p)* is isomorphic to the tensor product of 
the polynomial ring over Zp generated by , 
(^ 2 , and the exterior algebra over Zp gen- 
erated by Tj, T 2 , The comultiplication (p* is 

given by (p*{^,) - 2 j=o ® and (p*(T;) = 
ti ® 1 -1- S}=o ii-j ® '^j [5] (-^ Appendix A, 
Table 6 .ITI). 



C. Secondary Cohomology Operations 

Here we restrict our attention to a special 
type of operation treated by J. F. Adams [4] 
that has been proved to be powerful in 
applications. 

For a specified prime p, we write H'*'{X) — 
H‘(X;Zp). We can regard H^{X) as a 
■•■graded left module over 3I(p). Now let Q 
(s = 0 , 1 ) be a pair of deft free modules over 
3I(p), and let d:Ci->Co be a graded homomor- 
phism over 3I(p). Suppose that we are given an 
element zeCi such that dz — Q. For future pur- 
poses, Co and Cj are assumed to have bases 
{co,^} and {ci,p} with degCo.;i = /;i and degCj.^ 
= mp, respectively, in terms of which d has the 
representation dci_p = ZAn^,A^o,;.- Then z is 
expressed in the form z — 'Z^bpCi p. 

We say that <F is a stable secondary coho- 
mology operation associated with the pair (d, z) 
if it satisfies the following four axioms: 

(1) Let £>"(d, X) be the module consisting of 
homomorphisms e.Cq-*H^{X) over 3t(p) of 
degree n such that £d = 0. Putting £(co,a) = ^/ 1 > 
we can assume that D"(d, A) = {£ = IIa^;iG 
n, H^^HX) 1 E, = 0} . Let Q"(z, X) be 
the submodule of H'^{X) consisting of ele- 
ments of the form <^(z), where ^ is an 3I(p)- 
homomorphism:C, ->//'^(A) of degree n—l. 



In other words, Q"(z, A) = Z^h^//'’+'"-“^(A). 
Then for each n ^ 0 and each X, is a 
mapping 

(i>\D''id,X)^H^iX)/Q''iz,X). 

(2) For each mapping f:X^Y, the commu- 
tativity /*od>=(ho/* holds in the diagram 

D"(d,X)^//+(X)/e"(z,X) 

t/* xr 

D"(d, Y)^H^{Y)/Q"(z, F). 

(3) Let S:H"^^{SX)^H''iX) denote the 
suspension. Then the commutativity S o (b = 

0 o S holds in the diagram 

D%d,X)^H^{X)/Q’'{z,X) 

£>" + 1 (d, SX)^H + {SX)/Q''^^ (z, SX). 

(4) Let i \ Y^X be an injection such that 
i*o£ = 0. That is, — We can then find 
homomorphisms (over 3I(p))p:Co->//^(X, 7) 
of degree n and C : Cj ^(7) of degree n — 1 
such that the following diagram commutes: 

H^iY)CH^{X)CH^{X, 7)^f/ + (7)t//+(X) 

I TC 

I ^ ^ ^ ^ ^ 

Then for any such pair (i/, () we have i* o d)£ = 
Czmod(*Q"(z, X), where <5* denotes the homo- 
morphism defined by ^*y = (— 

For each pair (d,z), there is at least one 
associated operation <1>. Existence is proved by 
means of a tPostnikov system. Let d> and <I>' be 
two operations associated with the same (d, z). 
Then they differ by a stable primary operation 
in the sense that there is an element (peC^ldC^ 
such that 0'£ = 0£ - 1 - ^£ mod Q{z, X), where 
= nioddCi means ^£ = Sa<3a^a- 

For example, define the action of 31(2) on 
Z 2 by the rule a • v = 0 if deg a > 0 and 1 • v = v 
for each veZ^. We consider a minimal projec- 
tive resolution 

O^Z 2 <^Co^Ci^C 2 ^... 

(-^ 200 Homological Algebra). First we take 
Co = 31(2) and define £(1)= 1. Next we take Cj 
to be free over 31(2) with generators c,-, where 
0^1 and dyCi — Sq^'. Furthermore, we take C 2 
to be free over 31(2) with generators Cij with 
1 and 

d2{CiJ^Sq^'cj+ X Kj.kCk 

Osik<j 

with h,- j ^g3I(2). Here the d^Ci — Sq^' form 
a minimal set of generators of 31(2), and 
the equations 0 = dyd 2 Cij = Sq^‘ oSq^^ + 
Tio^k<jbi,i.k^^^'‘ form a minimal set of gen- 
erators over 31(2) of the Adem relations. Let 
Cl ( 7 ) be the submodule over 31(2) generated 
by with 0 k ^ j in Cj , and let d 1 (j) be the 
restriction of d, on Ci(j). We write z,_j = d 2 C,_j. 
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Let j denote an operation associated with 
{di{j),Zi j). Then <1)( ^ is defined on the sub- 
module D"j{X) consisting of elements xe 
H^iX) such that Sq^'‘x ^Oior O^k^j and 
takes values in modulo the 

submodule 



QljiX) = Sq^‘H"^^^-^X) 

OKk<j 

For example, (Fq o^Ker^ 2 ^Cokj 82 is the 
generalized Bockstein homomorphism defined 
by 6j2^ and (bi j is the operation discovered 
by J. Adem that appears as the ttertiary ob- 
struction of S" for n^2. If k^3, we have the 
relation 

Z 



mod E “i.jMJX) 



j#i + l 



These formulas can be applied to prove the 
nonexistence of an element with tHopf- 
invariant 1 in n 2 „-i(S'‘) unless n= 1,2,4 , 8 
(Adams [4]). Analogous formulas also hold 
for 'il(p) with p>2. We have no satisfactory 
theories concerning cohomology operations of 
orders higher than 2. For cohomology oper- 
ations in generalized cohomology theory ^ 
237 K-Theory. 



References 



[1] N. E. Steenrod, Products of cocycles and 
extension of mappings, Ann. Math., (2) 48 
(1947), 290-320. 

[2] J.-P. Serre, Cohomologie modulo 2 des 
complexes d’Eilenberg-MacLane, Comment. 
Math. Helv., 27 (1953), 198-232. 

[3] N. E. Steenrod and P. E. Thomas, Coho- 
mology operations derived from cyclic groups. 
Comment. Math. Helv., 32 (1957), 129 152. 

[4] J. F. Adams, On the non-existence of ele- 
ments of Hopf invariant one, Ann. Math., (2) 
72(1960), 20-104. 

[5] N. E. Steenrod, Cohomology operations, 
Ann. Math. Studies, Princeton Univ. Press, 
1962. 

[ 6 ] J. F. Adams, On the structure and applica- 
tions of the Steenrod algebra, Comment. 

Math. Helv., 32 (1958), 180-214. 

[7] R. E. Mosher and M. C. Tangora, Coho- 
mology operations and applications in homo- 
topy theory, Harper, 1968. 

[ 8 ] E. H. Spanier, Algebraic topology, 
McGraw-Hill, 1966. 

[9] G. W. Whitehead, Elements of homotopy 
theory, Springer, 1978. 



65 (IX.17) 

Combinatorial Manifolds 

A. Introduction 

First of all, we explain some notions in geo- 
metric combinatorial topology, also called PL 
(piecewise linear) topology, in order to distin- 
guish between the usual topological view- 
point and the combinatorial (or PL) one. By 
an n-dimensional simplicial complex K we 
mean an n-dimensional, tlocally finite and 
trectilinear simplicial complex in the Euclidean 
N-space R''^, i.e., an n-dimensional ^Euclidean 
simplicial complex (^ 70 Complexes). The 
subspace P — \K\ of R^ covered by all sim- 
plexes of K is called an n-dimensional poly- 
hedron, and K is referred to as a simplicial 
division or simply a division of P. A poly- 
hedron covered by a tsubcomplex of a division 
of P is called a subpolyhedron of P. A poly- 
hedron contained in a polyhedron P is a sub- 
polyhedron of P if and only if it is a closed 
subset of P. An open subset of a polyhedron P 
is always a polyhedron but not necessarily a 
subpolyhedron of P [13]. For arbitrary divi- 
sions Xj and K 2 of a polyhedron there is a 
common ^subdivision of X, and X 2 . Thus a 
property of a simplicial complex X which is 
invariant under subdivision is called a com- 
binatorial property of X or of a polyhedron 
|X|. Two polyhedra P and Q are said to be 
combinatorially equivalent if they have tisomor- 
phic divisions X and L, respectively. Then a 
homeomorphism / :P->Q which is induced 
from an isomorphism /:X->L is called a 
combinatorial equivalence. Since a combina- 
torial equivalence / :P^Q is an isomorphism 
in the category of polyhedra and PL (tpiece- 
wise linear) mappings, (PL maps) which is 
called the PL category, it is also called a PL 
homeomorphism or a PL isomorphism, and P 
and Q are said to be PL homeomorphic or PL 
isomorphic. A ttopological space X which 
is homeomorphic with an n-dimensional 
polyhedron P is said to be triangulable and 
is referred to as an n-dimensional topological 
polyhedron. Then a division X of P, or a 
homeomorphism V.P^X, is called a triangula- 
tion of X. A division X of P is a triangulation 
of P. However, there is a compact polyhedron 
of dimension n (^5) which has a triangulation 
X not combinatorially equivalent to its divi- 
sion ( ^ 70 Complexes; for triangulation prob- 
lems on manifolds ^ Section C below). The 
notion of a simplicial complex or cell complex 
had been introduced as a tool to describe 
topological invariants of triangulable mani- 
folds (H. Poincare [1]). The notion of an n- 
dimensional combinatorial manifold (i.e., a 
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polyhedron locally combinatorially equivalent 
to an n-dimensional simplex) was established 
as an important geometric object in com- 
binatorial topology by means of J. H. C. 
Whitehead’s theory of regular neighborhoods 
[2, 1939, 1940]. The 1960s saw remarkable 
results in the study of combinatorial manifolds 
together with the topological study of dif- 
ferentiable manifolds, which gave rise to con- 
temporary combinatorial or PL topology 
[3,4]. The fundamental conjecture (Haupt- 
vermutung) for combinatorial manifolds and 
the combinatorial triangulation problem of 
manifolds were negatively solved by R. C. 

Kirby and L. C. Siebenmann [5, 1969]. In the 
1970s, important results about general trian- 
gulations were obtained by T. A. Chapman [6] 
R. D. Edwards, and J. W. Cannon [7], 

B. Pseudomanifolds and Homology Manifolds 

In this section we explain some topological 
properties of triangulable manifolds which can 
be defined by means of incidence relations 
among their triangulations. A tpolyhedron M 
= I K I is called an n-dimensional pseudomani- 
fold if K satisfies the following three con- 
ditions: (i) Every simplex of K is either an n- 
simplex or a face of an n-simplex; (ii) each 
(n — l)-simplex is a face of at most two n- 
simplexes; (iii) for any two n-simplexes cr, x of 
K, there exists a finite sequence of n-simplexes 
a — OQ,o^,...,a^ — x such that <T; and (7,+i have 
an (n — l)-face in common. 

Consider the set S of (n— l)-simplexes of K, 
each of which is a face of only one n-simplex. 
Then the set of all ere S together with their 
faces forms a subcomplex L of K. The poly- 
hedron |L| of L is called the boundary of the 
pseudomanifold M = \K\ and is denoted by 
dM as well as by dK = L. The boundary 
of a pseudomanifold is not necessarily a 
pseudomanifold. 

Let |K[ be an n-dimensional polyhedron (or 
tcell complex). A point p in [K] is called a 
regular point of | K | if it has a neighborhood in 
|K| homeomorphic to an n-dimensional sim- 
plex; otherwise, p is called a singular point. A 
pseudomanifold without singular points is a 
topological manifold, called a triangulated 
manifold. An n-dimensional polyhedron |K| is 
a pseudomanifold if and only if the set of all 
regular points in | K 1 is dense and connected 
and the set of all the singular points is of di- 
mension less than n — 1. Thus a connected 
triangulated manifold is a pseudomanifold 
whose boundary coincides with the boundary 
as a topological manifold. 

Let Tj, T 2 be oriented n-simplexes of an n- 
dimensional pseudomanifold M — \K\. Suppose 



that they have an (n— l)-face a in common. If 
the tincidence numbers of these simplexes with 
a satisfy the relation [tj , c] = — [^ 2 , 0 '], then Tj 
and T 2 are said to be coherently oriented. We 
call M — \K\ with all its u-simplexes oriented 
an oriented pseudomanifold if any two n- 
simplexes x^ and T 2 are coherently oriented 
whenever they have an {n— l)-face in common. 
For an oriented pseudomanifold, the formal 
sum of all its oriented n-simplexes forms an 
integral cycle of the pair (M, dM), which is 
called the fundamental cycle of (M, dM). Its 
homology class, or fundamental class, gener- 
ates the homology group H„{M,dM;Z) — Z. 
When M = |K| is nonorientable, the funda- 
mental class with coefficient Z 2 (Z 2 = Z/2Z) is 
similarly defined. A pseudomanifold is orient- 
able if and only if the topological manifold 
consisting of all its regular points is orientable. 
If for each point p of a polyhedron M — \K\ 
the local homology groups are /f;{p) = 0 (iV 
n) and W„(p) = Z or 0, we call M = |K| an n- 
dimensional homology manifold, where //,(p) — 
Hi(M, M — p\ Z). If M is connected, it is also 
a pseudomanifold. The set dM of points p of 
M = \K\ with H„{p) = 0is called the boundary 
of M and is an (n — l)-dimensional homology 
manifold without boundary. 

Let M = |X| be an n-dimensional homology 
manifold. Given simplexes <r,x oi K,o<x 
means that <r is a face of x and a^^x. By defi- 
nition, a vertex of the tbarycentric subdivision 
Sd K of K is the barycenter of a simplex of K, 
and a /c-simplex t of Sd K is defined to be 
|dod, ...dfcl, where ao,(Tj, are simplexes 
of K such that (Tq < ctj < . . . < (t^ and d; is the 
barycenter of cr,-. In the following we denote 
SdK by K'. For a q-simplex a of K, we de- 
note by K((t) the union of all (n — ql-simplexes 
jddi ... d„_g| of K' such that < ... 

Then K(cr) is an (n — (j)-dimensional homology 
manifold and pseudomanifold contained in M. 
K{a) is called the (n — < 5 f)-dual cell or simply the 
dual of o. Let be the subcomplex of K' 
consisting of simplexes of K' contained in 
K(a). Thus we have |K„| = |K(cr)|. We denote 
by (X(<t))' the union of simplexes of which 
do not intersect with o. The set of dual cells 
K* = {K((T)lc7eK} of simplexes of K is called 
the dual complex of K; and K* is also called 
the dual subdivision of |K|. K* satisfies the 
following conditions: 

(i) 1^1- Each K{a) is the tjoin 
<t*(K((t))‘ of the barycenter d and (K{(t))'. 

(ii) (t<t<^^(/C((t))’ =)K(t) {a,xeK), {K{a)) = 

(iii) If K{a)f]K{x)¥^0, then there exist sim- 
plexes of K with <7 and x as faces, and for the 
least simplex p of K with a and x as faces, it 
holds that K{a) fl K(r) = X(p). 

The boundary d(K{a)) of the (n-q)- 
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dimensional homology manifold K{(j) satisfies 
^(K((T)) = (K(ff))‘U((aX)((7)), where {dK){(7) 
denotes the dual cell of cr in dK, and we under- 
stand that {dK)(t7)^0 if K*l}{dK)* is 

called the dual complex of (K, dK) and is de- 
noted by {K,8K)*. For an r-simplex cr (O^r^ 

« — I), it holds that 

HiiiKia))', (5K(cr))'; Z) = 0, i^n-r-\. 

Let \K*\‘‘ be the union of i-dual cells of 
(K, dK)* - X* U (dK)* such that i ^ q. Let 
C,(X*) = //,(|X*|V|K*r’), and let d^:C,[K*) 
-^Cg-i{K*) be the tconnecting homomorphism 

in the treduced homology exact sequence 
of(|X*|MX*|‘'“MX*|« 2)(-201 Homol- 
ogy Theory F). Then C^{K*) is a free Abe- 
lian group generated by the q-dual cells 
of (X, ^X)*, and C{K*) = { C,(X*), d,} is a 
tchain complex. It follows from the property 
of H^{{K(o)) ,{dK{a))' ; Z) as above that 
H,(C(X*))-//,(iX|) as in the case of a +CW 
complex. 

Now suppose that X is oriented. For a q- 
simplex <T = |floai we give an torientation 
[ao,fli, ...,aj. Furthermore, we choose a,+i, 
a^+ 2 , ...,a„so that laQ,Ui, ...,a„] is the orien- 
tation of |ao«i induced from the orien- 
tation of X, and we give X(cr) the orientation 
determined by [d, (Tj , . . . , where <T; = 
lapCJi We define c- X(<r)= 1 for a and 

X {a) thus oriented, and x ■ K{(t) — 0 if r 
Let c = Z e C,(X) and c* = Z ^ijK{aj). Then 
we define c • c* £ Z by c • c* = Z,-. j ' K (cr,)). 
The integer c • c* is called the intersection 
number of c and c*. For homology classes 
[z]€//q(X) and [z*]6f/„_^(X*), the intersec- 
tion number z • z* is independent of the choice 
of representing cycles, and thus we define the 
intersection number [z] ■ [z*] —z-z*. 

When X is compact, the isomorphism of 
the Poincare-Lefschetz duality theorem 
H^(X,dX ;R)^ H''~‘’{X;R) is given in terms of 
intersection numbers. 

C. Combinatorial Manifolds and PL Manifolds 

A polyhedron which is PL homeomorphic 
with a fc-dimensional simplex or its boundary 
is called a PL /c-ball or a PL (fc-l)-sphere and is 
denoted by B'‘, S'‘~\ respectively. A simplicial 
complex X or a polyhedron M — |X| is called 
an n-dimensional combinatorial manifold if 
the ■'’star of each vertex in X is a PL /c-ball. If 
M is an n-dimensional combinatorial mani- 
fold, then so is an open subset of M. An n- 
dimensional PL manifold is a topological 
manifold X with a distinguished maximal 
coordinate system ti — |(C/i,lz,)}, called a PL 
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structure on X, which consists of homeomor- 
phisms Vi from open subsets f7, onto 

open subsets of an n-dimensional simplex 
such that hjOh0 : hi{Ui R Uj)~*hj{Ui R Uj) are 
PL homeomorphisms. For a PL manifold 
{X,n) there is a triangulation t:M-*X from a 
combinatorial manifold M such that for each 
iUi,hi)€n, hiOt\t~^{Ui) is a PL homeomor- 
phism from an open subset t“^(L,) of M onto 
li,(t/,). Thus the notions of combinatorial mani- 
fold and PL manifold are essentially the same. 
For a tsmooth manifold (X, o), there is a 
unique PL homeomorphism class of a com- 
binatorial manifold by means of a tsmooth 
triangulation. In this sense, smooth manifolds 
are combinatorial manifolds; but the converse 
of this is not true (^114 Differential Topol- 
ogy C). A triangulation of a topological mani- 
fold which is a combinatorial manifold is 
called a combinatorial triangulation. The trian- 
gulation problem for n-dimensional topolog- 
ical manifolds, especially the existence and 
uniqueness problem of their combinatorial 
triangulations, a long-standing problem, is 
stated as follows. CT„ (Combinatorial Triangu- 
lation Problem): Is every n-dimensional topo- 
logical manifold homeomorphic with a com- 
binatorial manifold? CH„ (Hauptvermutung for 
combinatorial manifolds): Homeomorphic n- 
dimensional combinatorial manifolds should 
be PL homeomorphic. When n ^ 3, CT„ and 
CH„ hold; this was shown for n = 2 by T. Rado 
[15] and for « = 3 by E. E. Moise [16], More- 
over, in this case, topological manifolds admit 
unique smooth structures, and homology 
manifolds are combinatorial manifolds. For 
M ^ 5, ^surgery theory was used and developed 
as an obstruction theory for “homotoping” 
a homeomorphism h:M^M' between n- 
dimensional combinatorial manifolds to a PL 
homeomorphism by D. Sullivan (1967). By the 
discovery of the torus-unfurling method for 
“isotoping” h:M-*M' to a PL homeomor- 
phism, Kirby reduced CT„ and CH„ for « ^ 5 
to CH„ for some specific manifolds, such as 
thickened tori modulo the boundary, for 
which surgery methods had been sufficiently 
developed (for isotopy ^ Section D). The 
solutions (l)-(9) can be stated as follows. 

(1) Classification of combinatorial trian- 
gulations. Assume «>5. Existence: An n- 
dimensional closed topological manifold X 
admits a combinatorial triangulation if and 
only if an obstruction A(X)g//'^(X;Z 2 ) van- 
ishes. Uniqueness: A homeomorphism h:M^ 
M' between n-dimensional closed combina- 
torial manifolds is isotopic (^ Section D) to a 
PL homeomorphism if and only if an obstruc- 
tion A(/i)e//^(X;Z 2 ) vanishes (Kirby and 
Siebenmann [5] ). 

In fact, for each « > 5, there is an n- 
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dimensional combinatorial manifold which is 
homeomorphic to a torus x ... x S* but is 
not PL homeomorphic, and there is an n- 
dimensional closed manifold which is not 
homeomorphic to a combinatorial manifold 
(Siebenmann [18]). 

As a special case, the following conjecture is 
proved for n^5. 

(2) The R-dimensional annulus conjecture. 
The closed region bounded by two mutually 
disjoint tlocally flat (n — l)-dimensionaI topo- 
logical spheres S, and Sj in an n-dimensional 
sphere S" is homeomorphic with an n- 
dimensional annulus x [0, 1] (Kirby [19]). 

Regarding the uniqueness problem for (not 
necessarily combinatorial) triangulations of 
topological manifolds, we mention the follow- 
ing deep result. 

(3) Double suspension theorem for homology 
spheres. The double suspensions of homology 
n-spheres are homeomorphic with where 
a homology n-sphere is a closed topological 
manifold with the same homology groups as 
those of S" (Cannon [7]). 

For each n^3, there is a homology n-sphere 
P” which is a combinatorial manifold with 
the nontrivial fundamental group (for n = 3, 
see Poincare [1, 1904] and for n>4, see M. 
Newman, 1948). The suspension of P” is an 
example of an (n+ l)-dimensional homology 
manifold which is not a topological manifold. 
The double suspension of P" is not a com- 
binatorial manifold but is homeomorphic 
with The problem of whether every n- 

dimensional topological manifold is homeo- 
morphic to a polyhedron is still open for 
4. This problem is reduced to a problem in 
homology 3-spheres (T. Matumoto; D. E. 
Galewski and R. J. Stern). 

In the following we list some results on 
combinatorial manifolds focusing on White- 
head’s theory of regular neighborhoods [2]. 
Let M = \K \ be an n-dimensional combina- 
torial manifold without boundary. The dual 
complex K* consists of PL balls. Thus the 
division K of M gives rise to a decomposition 
into PL thandles H{K) of M in such a way 
that for each /c-dimensional simplex aeK, 
the star StK..((j) of its barycenter a in the sec- 
ond barycentric subdivision K" of K is a PL 
handle of index k. If L is a subcomplex of K or 
K*, then the star StK"(|L|) of |L| in K" is a 
subhandlebody of H{K) or of its dual han- 
dlebody H*(K) = H{K*), denoted by 
or respectively. In particular, if M 

is closed, orientable, and of dimension 3, 
then U = Hj,(X'’>) and F=//j,*(K*,) are 
PL homeomorphic with a PL boundary- 
connected sum of some copies of a solid torus 
and M=C/UF, UHV^dU^dV, 
where X'*'* and stand for the /c-'^skeletons 



of K and K*, respectively. This decomposition 
{U , V; dil = dV) of M is called a Heegaard 
decomposition (or splitting) of M [12], We say 
that a simplicial complex L collapses elemen- 
tarily to a subcomplex Lj of L, referred to as 
an elementary collapse L\Li,ifL — Lj = 

{(7, t}, where cr is a /c-dimensional simplex of L 
and T is a (/c — l)-dimensional face of a which is 
not a face of any simplex in L — {a, t}. We say 
that a polyhedron P collapses to a subpoly- 
hedron 2 of P or g expands to P, referred to 
as a collapse P\Q or expansion Q/ P, if there 
is a division L of P containing a subcomplex 

L, dividing Q so that there is a sequence of 
elementary collapsings L \ Lj \ . . . \ L,. For a 
subpolyhedron P of a combinatorial manifold 

M, a subpolyhedron U of Af is called a regular 
neighborhood of P in M if (1) U is a combina- 
torial manifold which is a closed neighbor- 
hood of P in M and (2) U \P.ln general, for a 
subcomplex L of a simplicial complex K, the 
^star Stjf..(|L|) of |L| in the second barycentric 
subdivision K" of K is called a second barycen- 
tric derived neighborhood or simply a derived 
neighborhood of |L| in |X| (E. C. Zeeman [4]). 

(4) Regular neighborhood theorem. Let P be 

a compact subpolyhedron of a combinatorial 
manifold M. Existence; Every derived neigh- 
borhood of P in M is a regular neighborhood. 
Uniqueness: Any two regular neighborhoods 
of P in M are PL homeomorphic by a map- 
ping keeping P pointwise fixed [2, 1939]. On 
one hand, this theorem is regarded as a com- 
binatorial counterpart to the Tubular neigh- 
borhood theorem of differential topology. For 
a topological manifold X with boundary dX, 
there are a neighborhood U of dX in X and a 
homeomorphism /i : 5A x [0, 1] (7, called a 

collar of dX in X, such that h(x,0) = x (xedA"). 
In particular, for a combinatorial manifold 

M = A we can take (7 as a derived neighbor- 
hood of oM in M and /i as a PL homeomor- 
phism, called a PL collar of dM in M. (For 
more about normal bundle theory 147 
Fiber Bundles Q). On the other hand, it is 
also regarded as a combinatorial deforma- 
tion theorem of a handlebody decomposition 
//^■'(L) of a derived neighborhood StK..(|L|). 

Two polyhedra P and Q are said to be 
simple homotopy equivalent if Q is obtained 
from P by a finite sequence of collapsings and 
expansions. A composite mapping /: P-^Q of 
the inclusion mappings and deformation re- 
tractions associated naturally to the expan- 
sions and the collapsings from P to Q is called 
a simple homotopy equivalence. Moreover, if P 
and Q are subpolyhedra of a combinatorial 
manifold M and those collapsings and expan- 
sions take place in M, then their regular neigh- 
borhoods are PL homeomorphic [2, 1940]. 

(5) A simply homotopy equivalence is clear- 
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ly a homotopy equivalence. Conversely, we 
have the simple homotopy theorem; For the 
homotopy class of a homotopy equivalence 
f .P-^Q between compact connected poly- 
hedra P and Q, there is a well-defined element 
W(f\ called the Whitehead torsion of / in the 
^Whitehead group Wh{G] of the fundamental 
group G = Ui{P) such that /is homotopic to a 
simple homotopy equivalence if and only if 
W{ f) = 0, ([2, 1950]; - also J. W. Milnor 
[ 20 ]). 

Let VF be a compact combinatorial mani- 
fold of dimension n+\.\{ dW consists of two 
connected components M and N such that 
the inclusion mappings M o W and N are 
simple homotopy equivalences, then (W;M,N) 
is called an s-cobordism of dimension n + \. 

The following theorem of D. Barden, B. Mazur, 
and J. Stallings is the nonsimply connected 
PL version of Smale’s t/i-cobordism theorem. 

(6) s-Cobordism theorem. For n^5, an s- 
cobordism (W;M,N) of dimension n-l- 1 is PL 
homeomorphic with (M x [0, 1]; M x 0, M x 1) 
(B. Mazur [21]; M. Kervaire [22]; C. Weber 
[23]; J. F. P. Hudson [8]). 

(7) Topological invariance of simple homo- 
topy types. Two homeomorphic compact 
polyhcdra arc simple homotopy equivalent 
(T. A. Chapman [6]). 

The famous Poincare conjecture that a sim- 
ply connected closed manifold of dimension 3 is 
homeomorphic to a 3-sphere is still unsolved 
despite much effort by many mathematicians. 
This conjecture is generalized for dimension n 
as follows. 

(8) Generalized Poincar6 conjecture in dimen- 
sion n. Any homotopy n-sphere 27" is homeo- 
morphic to an n-sphere, where a homotopy 
«-sphere is defined to be an n-dimensional 
topological manifold homotopy equivalent 

to an n-sphere. 

This has been affirmatively answered for 
n^5. The first proof was given by S. Smale 
1 14 Differential Topology F, K). From the 
viewpoints of collapsing and general position 
Section D), J. Stallings [3, 1960] and E. C. 
Zeeman [11, 198 204] proved the following. 

(9) Engulfing lemma. For a compact poly- 
hedron P of dimension m in the interior M = 

M — dM of an n-dimensional combinatorial 
manifold M, there is a PL n-ball in M contain- 
ing P in its interior (i.e., engulfing P), provided 
that M is ni-connected (i.e., t!:j(M) = 0 

and n — m^3. (For the topological version ^ 
M. H. A. Newman [24].) If a combinatorial 
manifold M = |X| is a homotopy n-sphere 
(n>5), then by the engulfing lemma, there are 
PL n-balls Bj and 82 engulfing 
and Hf^:{Kf 2 )) so that M. By the 

generalized Schoenflies theorem (^ Section 
G), B 2 — Bi = M — Bi is homeomorphic to 
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an n-ball. It follows that by the Alexander 
trick (-- Section D) M is homeomorphic to an 
n-sphere. Since CH„ holds for S" (n^5), the 
homotopy PL n-sphere M is actually PL 
homeomorphic with S'". 

D. Embeddings of Combinatorial Manifolds 

An injective PL mapping f:P^Q between 
polyhedra is called a PL embedding if /(P) is a 
subpolyhedron of Q (-*• Section A). Namely, 
a PL embedding / : P-^Q is a PL mapping 
which is induced from a simplicial injection 
f -.K^L, called a division of / for some divi- 
sions K and L of P and Q, respectively. In the 
following we explain PL embeddings from an 
m-dimensional combinatorial manifold M into 
an n-dimensional combinatorial manifold N. 
For a PL embedding / : M -^N, c — n — m is 
called the codimension of / If f^^{dN) — dM, 
f is said to be proper. Since an injective PL 
mapping /: M^iV is a PL embedding if and 
only if it is proper as a mapping between topo- 
logical spaces M and N (i.e., the preimages of 
compact sets are compact), this term “proper” 
for PL embeddings should not be confused 
with the term “proper” for mappings. Two PL 
embeddings /: M -> AT and /' : M' ->7V' are said 
to be (PL) equivalent if there is a PL homeo- 
morphism h:N^N', called an equivalence 
from / to /', such that ho f{M )= When 
M — M\ f and /' are said to be (PL) isomor- 
phic if there is an equivalence h:N-*N' from 
/ to /' such that hof=f'.A submanifold of 
N is defined as a subpolyhedron of N which 
is a combinatorial manifold. Thus the equiv- 
alence class of a PL embedding f:M^N is 
nothing but the PL homeomorphism class of a 
pair (A/,/(M)) of N and its submanifold /(M). 
For a submanifold of N, the same terms as 
those for the inclusion mapping are used. For 
every division /:X— >L of a (proper) PL em- 
bedding / :M^N and for xe M we have a 
(proper) PL embedding /j St^lx) from a PL m- 
ball Stjf(x) into a PL n-ball St^(/(x)) which 
represents the +germ of / at xeM. A PL em- 
bedding of a PL m-sphere into a PL n-sphere 
is called a PL (n, m)-knot or (n, m)-sphere pair 
and a proper PL embedding of a PL m-ball 
into a PL n-ball is called a PL (n, ni)-ball knot 
or (n, m)-ball pair. A PL (n, ni)-knot (a PL 
(n, m)-ball knot) is said to be trivial or un- 
knotted if it is equivalent to the inclusion 
mapping x0c.d/"'*^^(/'"c./"), where I = 
[—1, 1]. Then we have the following state- 
ments (l)-(5). 

(1) Zeeman’s unknotting theorem. Every PL 
(n, m)-knot and every PL (n, m)-bal! knot are 
unknotted, provided that n — m > 3. 

In general, an injection g .X-^Y between 
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topological manifolds is said to be locally flat 
if for each point xeX, there is a neighborhood 
(7 of X in so that the local germ of / at x is 

equivalent to the inclusion mapping (7 x Oc.f/ 
X r. Then / is necessarily proper, i.e., f~^{dY) 
= dX. A PL embedding f:M^N is said to be 
locally flat if it is locally flat in the PL cate- 
gory. Namely, f\ St^lx) is equivalent to x 
Oo/". Conversely, Zeeman’s unknotting theo- 
rem implies the following. 

(2) Every proper PL embedding of codimen- 
sion ^ 3 is locally flat. 

Since every locally flat PL embedding / : M 
admits a normal tblock bundle, the classi- 
fication of f :M->t7 up to isomorphism is 
reduced to the classification of block bundles 
(-^147 Fiber Bundles Q), where 17 is a derived 
neighborhood of /(M) in N. (For the general 
classification of locally flat PL embeddings by 
surgical methods C. T. C. Wall [25, Corol- 
lary 1 1.3.1]). 

In the codimension 2 case, there exist non- 
trivial knots (the classical knots) (^ 235 Knot 
Theory) and there is a proper PL submanifold 
which is not locally flat. For example, a com- 
plex projective curve V defined by x^ + y" 

= 0 can be regarded as a PL 2-sphere in the 
complex projective plane CP^, and for the 
inclusion mapping i: Vc^CP^, at 0^(0:0: 1), 
i| St^(O) is PL equivalent to a cone of a torus 
knot k:S^ of type {n,n— 1). For further 
results in higher-dimensional knot theory (-- 
235 Knot Theory) we mention the works of M. 
Kato and Y. Matsumoto [26] and S. Capell 
and J. Shaneson [27]. 

Here we consider only proper PL embed- 
dings of a fixed m-dimensional combinato- 
rial manifold M into a fixed connected n- 
dimensional combinatorial manifold N, so 
that a codimension 0 PL embedding is a PL 
homeomorphism. Two PL embeddings /q, 

are said to be pseudoisotopic if there 
is a proper PL embedding F: M x [0, 1]^N 
X [0, 1], called a pseudoisotopy from /q to 
/i, such that F(x, 0) = (/o(x), 0) and F(x, 1) = 
(/i(x), 1) (xgM). Moreover, if F satisfies the 
level-preserving condition F{M x {t})cN x {r} 
(0< 1), then /o and /j are said to be isotopic, 

and F or a homotopy {/} defined by F{x,t) — 
(/(x), t) ((x, t)G M X [0, 1]) is called an isotopy 
from /o to /j. When M — N,aPL homeomor- 
phism which is isotopic to the identity, or the 
isotopy of the identity itself, is called an am- 
bient isotopy of N. Two PL embeddings from 
M to N are said to be ambient isotopic if they 
are isomorphic by an ambient isotopy of N. 

Since /'‘ (/ = [ — 1 , 1 ] ) is a /c-dimensional 
^convex cell and is a cone 0 * dl'‘ of from 
the center 0, each point x of — {0} can be 
written uniquely as x = f • u for 0 < t < 1 and 
uedl^. Thus a PL embedding / ■.dl"'-^dr can 



be extended to a PL embedding F:I'”-^1’', 
called a cone extension or simply a cone of / 
by setting F(0) = 0 and F(t-u)~t-f(u) for 
t-uel'” — {0}. This method of cone extension is 
often referred to as Alexander's trick. By using 
Alexander’s trick and the uniqueness of regu- 
lar neighborhoods, one can show without 
much difficulty that (i) the isotopy classes of 
PL homeomorphisms of PL spheres or PL 
balls onto themselves are classified by their 
tdegrees (= +1) (V. K. A. M. Gugenheim, 

1953), and (ii) every PL (n, w)-knot or PL 
(n, m)-ball knot is equivalent to the standard 
one 5/'"'''^ X Oc.5/"^' or /"■ x Oc./" up to iso- 
topy, provided that n — m^i. This is why we 
define the notion of ambient isotopy in a dif- 
ferent way from the differentiable category, 
where the smooth isotopy is always covered by 
the ambient isotopy. However, for codimen- 
sion ^ 3, because of Zeeman’s unknotting 
theorem, the following holds. 

(3) The pseudoisotopy class, the isotopy 
class, and the ambient isotopy class of a PL 
embedding of codimension ^ 3 are the same 
(J. F. P. Hudson and E. C. Zeeman, 1964). 

In order to construct a PL embedding from 
a continuous mapping within its homotopy 
class, the notion of general position is useful. 

A PL mapping <p:M^N is said to be in gen- 
eral position if dim<p“/y)<0 (yeN) and 
dimS((p)^2m — n, where S(<p) is the closure of 
{xeM\(p~^{(p{x))^{x}} in M (J. F. P. Hudson 
[ 8 ]). 

(4) General position theorem. Every con- 
tinuous mapping if/:M-^N can be approxi- 
mated by a PL mapping cp:M^N which is in 
general position. 

The following theorem can be proved by 
sharpening the general position theorem and 
the engulfing lemma. 

(5) Irwin’s embedding theorem. Let 

be a continuous mapping from a compact 
combinatorial manifold of dimension m into a 
combinatorial manifold of dimension n such 
that ij/ 1 dM is a PL embedding from dM into 
dN. Assume that (i) M is d-connected, (ii) N is 
(d + 1 )-connected, and (iii) n — m ^ 3, where d — 
2m — n. Then is homotopic to a 

proper PL embedding f:M-^N relative to 
dM. Moreover, if M and N are closed and if 
(i*) M is {d+ l)-connected and (ii*) N is (d + 2)- 
connected, then two homotopic PL embed- 
dings from M into N are ambient isotopic (M. 
C. Irwin, 1965). 

E. 3-ManifoIds 

There has been a great deal of research done 
on 3-manifolds, and we have, among others, 
the following results (l)-(4). 
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(1) Sphere theorem. Let M be an orientable 
3-manifold with 7T2(Af)^0. Then there exists a 
PL embedding / of a 2-sphere into M such 
that / is not homotopic to 0 in M (C. Papa- 
kyriakopoulos [28]). 

(2) Dehn’s lemma. Let D be a 2-cell with 
self-intersections in a 3-manifold M, with its 
boundary a simple polygon C. If some neigh- 
borhood 1/ of C in D has no singularities, then 
there exists a 2-cell in M without singularities 
whose boundary is C. In 1910, M. Dehn as- 
serted this lemma, but his proof was incom- 
plete. In 1957, Papakyriakopoulos and T. 
Homma proved the lemma independently 
[28,29]. 

(3) Loop theorem. Let N be the boundary 
of a 3-manifold M and U be an open set 

in a component of N. If there is a closed 
curve in U which is homotopic to 0 in M 
but not in N, then there exists a simple closed 
curve in U homotopic to 0 in M but not 
in N (Papakyriakopoulos [30], J. Stallings 
[31]). 

(4) Unique decomposition theorem for a 3- 
manifold (J. W. Milnor [32], H. Kneser). We 
assume that all manifolds are connected, 
oriented, and triangulated 3-manifolds without 
boundaries and that all homeomorphisms are 
piecewise linear. Two manifolds M, M' are 
said to be isomorphic (MthM’) if there exists 
an orientation-preserving homeomorphism of 
M onto M'. Removing an open 3-cell from 
each of two 3-manifolds M, M' and identifying 
the boundaries of these removed cells, we 
obtain a 3-manifold M # M', called the con- 
nected sum of M and M'. A manifold that is 
not isomorphic to the 3-sphere is called 
nontrivial. A nontrivial manifold P is called 
prime if P cannot be decomposed as P — 

Ml # Mj with Ml and M2 both nontrivial. 

A manifold M is called irreducible if every 
2-sphere in M bounds a 3-cell. Then from 
the sphere theorem the following results can 
be deduced: If a compact 3-manifold M is 
nontrivial, then M is isomorphic to a con- 
nected sum Pi # P2 # • • • # A of prime mani- 
folds P; (i = 1, 2, . . . , k), where Pj , P2 , . . . , P^ 
are determined uniquely up to order. Every 
irreducible 3-manifold M has n 2 {M) — 0. Con- 
versely, if a 3-manifold M has 712 (M) = 0, then 
M is irreducible, provided that the Poincare 
conjecture is correct. (For further results 
on 3-manifolds ^ F. Waldhausen [10] and 
J. Hem pel [12].) 

Irreducible sufficiently large 3-manifolds 
[10] can further be decomposed into Seifert 
fibered spaces and “simple manifolds” which 
contain no essential tori (W. Jaco and P. B. 
Shalen, K. Johannson). Existence of hyperbolic 
structures on “simple manifolds” has been 
extensively studied by W. Thurston. 
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F. Wild Spaces 

Nonclosed 3-manifolds behave very differently 
from closed 3-manifolds. For example, there 
exists an open 3-manifold U that has the same 
thomotopy type as an open 3-cell but is not 
homeomorphic to it (M. H. A. Newman and J. 
H. C. Whitehead, 1937). The construction of 
this and similar examples generally involves 
infinite processes. Such examples, which do 
not exhibit the properties we expect, are com- 
monly termed pathological (or wild) spaces. 

For the manifold U above, the product 1/ x 
is found to be homeomorphic to the Euclidean 
4-space R"^. R. Moore proved that the quotient 
space of the Euclidean 2-space R^ by any 
upper semicontinuous decomposition consist- 
ing of continua not separating is homeo- 
morphic to R^. On the other hand, R. H. Bing 
(1959) gave an example of a similar upper semi- 
continuous decomposition of R^ for which the 
quotient space B is not homeomorphic to 
R^ and is not even a manifold. This quotient 
space has the property B x R^ ssR"^. Bing 
(1957) also proved that at most countably 
many copies of a wild closed surface can 
simultaneously be embedded in R^. On the 
other hand, Stallings (1960) proved that there 
exists a wild disk of which a set of copies with 
the power of the continuum can simulta- 
neously be embedded in R^ [1 1], 

The study of the quotient space of n- 
dimensional manifolds by cell-like decompo- 
sitions has been fully generalized by Edwards 
and Cannon [7] for n ^ 5 to prove the double 
suspension theorem of homology spheres (--^ 
Section C (3)). 

G. The Schoenflies Theorem 

In 1906, A. M. Schoenflies proved that every 
tsimplc closed curve in the plane is the bound- 
ary of a 2-cell. This result is sharper than 
the tJordan curve theorem and is called the 
Schoenflies theorem. However there was a gap 
in the proof, and the complete proof was given 
by L. E. J. Brouwer in 1910. 

The following higher-dimensional analog of 
the above theorem is called the Schoenflies 
problem: Is every (n — l)-sphere embedded in 
the n-dimensional Euclidean space R" (n^3) 
the boundary of an embedded n-disk? 

For the case where the embedding is topo- 
logical, L. Antoine constructed a counter- 
example, called Antoine’s necklace, for « = 3 in 
1921 and solved this problem negatively. Also, 
J. W. Alexander constructed another counter- 
example, called Alexander’s horned sphere, in 
1924 (Fig. 1). A similar example was given by 
R. H. Fox and E. Artin in 1948 by making use 
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of knot theory (Fig. 2). The study of wild em- 
beddings began with these examples, and 
many interesting results were obtained (Bing). 

For the case where the embedding is PL or 
smooth, the Schoenflies problem has been 
solved affirmatively for m = 3 (Alexander). 

M. Brown (1960) and B. Mazur (1959) 
proved the following: Let h:S"~^ x [~1, 
be a topological embedding; then — x 
{0}) consists of connected open sets D+ and 
D of S", and the closures of D+ and Z)_ are 
both homeomorphic to the n-disk. This result 
solves the Schoenflies problem affirmatively 
for n ^ 5 when the embedding is PL and lo- 
cally flat or smooth. 
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Combinatorics 

A. General Review of Combinatorics 

Combinatorics is concerned mainly with prob- 
lems of discrete sets, such as the enumeration 
of subsets satisfying certain conditions, or their 
actual construction and the selection of an 
optimal subset with respect to a suitable crite- 
rion. Generally speaking, both the theoretical 
analysis and the construction of discrete sets 
are much more difficult than those problems in 
analysis concerning infinite sets with suitable 
topologies. 

The main emphasis and the name of this 
field have changed from time to time and from 
person to person. Other names such as com- 
binatorial analysis, combinatorial theory, or 
combinatorial mathematics are commonly used 
and mean almost the same thing. Recently, 
the expression “discrete mathematics” has 
also been used to describe the same field. In 
many cases a problem originally conceived as 
number-theoretic, algebraic, analytic, or geo- 
metric has eventually turned out to be com- 
binatorial (-^ 102 Design of Experiments, 241 
Latin Squares). 
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tGraph theory, originally a combinatorial 
study of 1 -dimensional topological tcomplexes, 
has developed into an important branch of 
mathematics. 

In this article we discuss various topics not 
yet considered to be established branches or 
organized fields of mathematics. The topics 
treated under the title “combinatorics” are 
at present not unified, and consist of a mix- 
ture of isolated techniques. However, recent 
progress with electronic computers is play- 
ing an important role in the solution of 
various combinatorial problems arising from 
large systems, and the study of combinatorics 
is now quite active both in theoretical develop- 
ment and application. We now have special 
journals in this field, such as the Journal of 
Combinatorial Theory (Academic Press), Com- 
binatorica (Elsevier), the European Journal 
of Combinatorics (Academic Press), Discrete 
Mathematics (North-Holland), and Discrete 
Applied Mathematics (North-Holland). 



B. Three Fundamental Principles 

The most fundamental among the various 
principles in combinatorics are the following 
three principles for enumeration problems. Let 
Q be a finite set and let |A| denote the number 
of elements in a subset A of Q. 

(1) Rule of sums; If AC\B = 0, then \ A + B\ — 
\A\ + \B\. 

(2) Rule of products; |/lxB| = |A|x|B|, where 
Ax B is the tdirect product of the two sets A 
and B. 

(3) Principle of inclusion and exclusion; 

\A,CA^C...CA„\ 

i i<j i<j<k 

-...+(-i)"-MA,nA2n...nA„|. 

These principles can be generalized as state- 
ments concerning various measures. 

C. Mobius Inversion Formula 

Let P be an ordered set whose arbitrary inter- 
val is a finite set. Let [P x P] = {(x,y)eP x 
P I X be the set of intervals of P. A func- 
tion satisfying the following two conditions is 
determined uniquely and is called the Mobius 
function over P; 

(i) p{x, x) = 1 for all x e P; 

(ii) £ 2 .jt^ 2 ^y/z(x, z) = 0 for all x, yeP with x<y 
and x^y. 

When K is a field, for a mapping /: [P x P] 
->X, we can define another mapping g:lPx 
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by 

9ix.y)=^ E /(^>4 

ze(x,y] 

Then we have the Mdbius inversion formula 

f(x,y)^ E 9 ix,z)fi{z,y). 

2e[x.y] 

This formula implies, as its special cases, many 
important formulas, such as the tMbbius in- 
version formula in number theory, the prin- 
ciple of inclusion and exclusion, and the in- 
verse relation of difference and summation. 

D. Some Special Sequences 

In the enumeration problem, many important 
sequences {a„} are given by the ^generating 
function f{x) = 'E.a„x". Typical examples are as 
follows: (1) tBinomial coefficients: (1 -t-x)'" = 
2„C)x". (2) Multinomial coefficients: (xj -1- ... 

+ ^rr = 2(n,!”.n,)^i‘ (3) Multiple combi- 
nation u„ = >„ = r^"-i):(l = 

(4) Bell numbers: Qxpie'^ — \)^T,B„x''/n[ (5) 

The Stirling number of the second kind is 
the total number of bijections from N„ to N„, 
divided by m!, where N„ = { 1 , 2, . . . , n}. This is 
equivalent to the total number of partitions of 
N„ into m nonempty blocks: (e^— = 

S*„,S:x7n!. 

In general, when the generating function 
/(x) is an analytic function of one or several 
variables as above, we can obtain recurrence 
formulas for the sequence from the functional 
equations or the differential equations which 
f(x) satisfies; or we can get asymptotic esti- 
mates by using the tsaddle point method. 

E. P61ya’s Enumeration Theorem 

Let 1/ be a finite set, F be a (finite or infinite) 
set, G be a transformation group operating on 
U, and w be a function assigning to each ele- 
ment of K a “weight.” A weight is an element 
of a ring and is usually defined by a suitable 
generating function. The weight of a function 
f:U^V is defined by w(f) = Oust; w(/(u)). 
Two functions /i and /2 are considered iden- 
tical (/, =/ 2 ) if there exists a tieG such that 
/i = /2 o n. Put F = V^/(E), the set of all equiv- 
alence classes of the relation H. w(/) is well- 
defined for / eF, because fi ^/2 implies w(/i) 
= w(/ 2 ). Then we have the equality S/6FW(/) 

Here, f f 2 » • ■ • > ^m) is the cycle index of G, 
which is given by n ^ •• • 

where n — \G\, m = \U\, and ji^{n) is the number 
of cyclic permutations of length k contained in 
71. This is called Pdlya’s enumeration theorem. 
When G = S„ (the tsymmetric group), Ps^(yi, 

• • • . y«,) = 



. . . y^"'. When G — A^ (the talternating group), 

PA„{y\-: • • • > >’m) — 2^Afc>0.kAi=m.Z/l*=even2(2l I 

• ■ • . . . y^-. When G = Q (the tcy- 

clic group). Pc Jy 1 ,...,yj = m~^ Iklm 
where (p{k) is the tEuler function, i.e., the num- 
ber of integers less than k and coprime to k. 
This theorem is widely applicable to various 
problems with suitable choices of weights. 
Several generalizations have also been studied 
(see, e.g., [3]). 

F. The Notion of Polymatroids 

The theory of matroids was originally intro- 
duced as an abstraction of the notion of ffinear 
dependence in a vector space or of some prop- 
erties of tgraphs or of the notion of talge- 
braic dependence in the theory of fields (see, 
e.g., [4, 5]). But recently, as many important 
applications were discovered, it has become 
an important branch of combinatorics (see 
[6-8]). 

Let /i be a function from a tlattice L to a 
totally ordered module R. jx is called sub- 
modular if /i(x) -I- /r(y) ^ /r(x v y) -I- /x(x a y) for 
every pair of elements x, ye L. The set of ueL 
such that /x(a) = min^gcP(^) constitutes a sub- 
lattice in L. Let L be the family of all subsets 
of a finite set E. The sublattices Kin L = 2^ 
and the pseudo-orders over E correspond in a 
one-to-one manner as follows (here pseudo- 
order means a binary relation taken as an 
tordering without assuming the tantisym- 
metric law): (i) The class of a partition P of £ 
corresponds to the difference of two subsets 
which are the immediate predecessor and 
immediate successor in K, or the complement 
with respect to £ of the maximum element in 
K, or the complement of the minimum ele- 
ment. (ii) The complement of the maximum 
element in K is maximum in the order of P, 
and the minimum element in K is minimum in 
the order of P. (iii) Two classes ^2 ^^ B-: 
given as the difference of two elements in K, 
satisfy the relation (^1 ^(^2 with respect to the 
order of P if and only if all elements of K such 
that x3M^i in £ satisfy x^<^ 2 - 

Now let £ be a finite set, L — 2^, and let R be 
the real number field with the usual addition 
and order relations. If a submodular function 
satisfies the following conditions, 
then P = (£, fi) is called a polymatroid over £, 
and is called the rank function over P. (i) 
m( 0)==O, where 0 is the empty set. (ii) xcy 
(c£) implies n{x)^ix{y). A function u:£->R is 
automatically extended to u:2®-»R by u(x) = 
'LeexM^) {x<^E), which is also denoted by 
the same symbol u. If a function u satisfies 
0^u(x)^/r(x) for every xeE, then u is called 
an independent vector over P. An independent 
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vector u with u(E) — ^(E) is called a base of P. 

A function is identified with an ele- 

ment in R", where n — \E\. Then a set of inde- 
pendent vectors in P forms a ^convex poly- 
hedron in R". If we denote the polyhedron by 
P, then such a P is characterized by the follow- 
ing two conditions: (PI) If ueP then each v 
satisfying 0 ^i;(c)^m(c) for every eeE also 
belongs to P. (P2) If two u, ueP satisfy u{E)< 
v(E), then there exists weP, w^u, satisfying 
u(cX w(c):^max[w(c), t;(c)] for every ce£. If 
there are two polymatroids Pj = {E,Hi) and 
P 2 =(£,/i 2 ) over the same finite set £, then we 
have max{u(£)|uePi nP 2 } ==min{/ij(x) + 

H 2 {E — x)\xciE]. According to information 
theory, an example of a polymatroid (£, fi) is 
given by the following: n(x), xe£, is the simul- 
taneous tentropy of a subset x of the infor- 
mation source £ with correlation. 



G. Matroids 

A polymatroid P = (£,//) is called a matroid if 
the values of fj. are always integers and /r(x) ^ 

I X I = the cardinality of the set x. A set x c £ is 
called an independent set if //{x) = |x[, and 
a dependent set otherwise. An independent 
set with maximal cardinality is called a base. 

A minimal dependent set is called a circuit. 

The set y with max{ |y| |yi3x,^{x) = ^(>’)} is 
uniquely determined by x, which is called 
the closure of x and denoted by cl x. Several 
axioms stated in terms of these notions exist, 
and can be used to characterize matroids. 
Some of them are: 

(1) The axiom for the family of independent 
sets (i) 0e./; (ii) xe./ and ycx implies 
ye I; (iii) Ifx,y and jx] <|y|, then there 
exists an eeE in y — x such that xU{e}ey. 

(2) The axiom for the family of bases (i) ^ 

is not empty; (ii) If b,b'e^ and h/h', then for 
every eeb — b', there exists an e' such that e'e 
b' — b and (b~{e})U{e'}e^. 

(3) The axiom for the family of circuits ‘if: 

(1) 0 6 (ii) For every pair x, y e y is not a 
proper subset of x; (iii) If x, ye^ and 

then for every eex fly there exists such 

that zcxUy — {c}. 

(4) The axiom for the closure function cl ; 
2®->2^: (i) xcclx = cl(clx) for every xe£; (ii) 

X c y implies cl x <= cl y; (iii) If e 6 cl(x U {£} ) — 
clx, then c'Gcl(xU{e}) — clx. 

The following are typical examples of ma- 
troids: (1) Let £ be the family of finite set of 
vectors in a vector space V and ,/ be the 
family of sets of linearly independent vectors. 

(2) Let £ be the set of edges of a tgraph and 
be the set of edges constituting a fundamen- 
tal circuit. (3) Let £ be a finite set and S be a 
family of subsets of £. x = {cj , . . . , cj <= £ is 
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defined to be independent if there exist differ- 
ent elements Sj , . . . , s,. in S' such that C; e s,- 
for every 

Let a weight function w:£->R be defined 
for every element ee£ in a matroid (£, fi). To 
obtain a base b of the matroid with minimal 
weight w{b) — Seeft w{e), we can apply the fol- 
lowing procedure, called the greedy algorithm: 

( 1 ) Arrange the elements c j , . . . , of £ in such 
a manner that w(e 1 )< . . . ^ w(e„). (2) Put h*®' : = 
0, and /:=0. (3) If x = h*'^U{c,+i}G./, then 
h<‘+*b=x; otherwise, b^‘~^^'^:=b^‘\ (4) Repeat 
process (3) for i = 0, 1 1 . Then the final 
is the base of minimal weight. 

H. Operations on Matroids 

For a matroid M = (£, /i) with the family ^ of 
its base, ^* = {£ — x|xe£^} also satisfies the 
axioms of the base. The matroid M* =(£, n*) 
with .^* as its base is called the dual of M. 

Here we define (x) = | x | -f- /i(£ — x) ~ /i(£). 
When £ <= £, M I £ = (£, jj.p) is called the reduc- 
tion of M = (£,|i) to £, where jXp is the restric- 
tion of n on 2^. A matroid M x F — {M* \ £)* is 
called the contraction of M to £. A matroid 
given by the form (M | £) x £' is called a minor 
of M. From two matroids M, =(£,/x,) and M 2 
= (£,^ 2 ) over the same set £, we can construct 
a matroid M = v M 2 = (£, fi) whose rank 
function fi{x) is defined by the minimum of 
(£i(>') + £ 2 (y) + l^“>'l This is called 

the union of M^ and M 2 . 

Let M = (£, ^) be a matroid and a be a posi- 
tive constant. The function /i(x) = /i(x) — a|x| is 
submodular. The set L of x which minimizes 
il(x) is a sublattice of 2^. The partition of £ 
and the order relation of its classes defined by 
L are called the principal partition of M with 
respect to the parameter a. When the value of 
the parameter a is not specified, a is taken to 
be 2. 

As was seen in example (1) at the end of the 
preceding section, the family of vectors in 
a vector space V over a field X is a matroid 
M. A matroid isomorphic to such an M is 
called linearly representable over X. If X = 
GF{p), the finite field with p elements, then it 
is called a p-ary matroid. A matroid linearly 
representable over any field is called regular. If 
[G] is a matroid isomorphic to the matroid 
defined by the family of edges of a graph as 
shown in example (2), it is called graphic. If the 
dual M* is graphic, M is called cographic. 
Graphic and cographic matroids are always 
regular. There exist matroids not linearly 
representable over a field X, or linearly repre- 
sentable over a particular field X but not 
regular, or regular but neither graphic nor 
cographic. If [G] is a matroid isomorphic to 
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the matroid of the type in example (3), it is 
called transversal. Such a matroid is linearly 
representable when the cardinality of the field 
K is sufficiently large. 
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A. General Remarks 

A ring R (^ 368 Rings) whose multiplication 
is commutative is called a commutative ring. 
Throughout this article, we mean by a ring a 
commutative ring with unity element. 

B. Ideals 

Since our rings are commutative, we need not 
distinguish right or left ideals from hdeals. A 
subset a of a ring R is an ideal of R if and only 
if a is an R-submodule of R (^ 277 Modules), 
if and only if a is the tkernel of a ring homo- 
morphism from R into some ring (except for 
the case a = R). Given an ideal a of a ring 
R, the set of elements which are tnilpotent 
modulo a, i.e., {x e R | x " g a (3n) }, is called the 
radical of a and is often denoted by ^/a- The 
radical of the zero ideal is called the radical 



of R, or more precisely, the nilradical of R 
(- 368 Rings H). 

For a subset S of a ring R, the smallest ideal 
a containing S is called the ideal generated by 
S, and S is called a basis for a; if S is a finite 
set, then S is called a finite basis. When 
(agA) are ideals of R, the sum of these ideals, 
denoted by 2 is defined to be the ideal 
generated by the union of the ideals (/IgA). 

If A is a finite set, say { 1, 2, . . . , n}, then the sum 
is denoted also by -I- ... -t-a„, and in this case, 
the sum is also called a finite sum. Note that 
Oi -h ... -l-a„ = (aj -f ... -t-a„|a,GaJ. The product 
Oj . . . a„ of a finite number of ideals Oj , . . . , o„ is 
defined to be the ideal generated by the set 
...fl„|aiGa,}. The intersection of an arbi- 
trary number of ideals is an ideal. When a 
is an ideal of a ring R and S is a subset of R, 
the quotient a:S is defined to be the ideal 
{xgRIxSco}. If a, b, c, b;i (/IgA) are ideals, we 
have (a ; b) : c — a : be, a : 2 (a : b;^). 

C. Prime Ideals 

An ideal p of a ring R is called a prime ideal if 
R/p is an tintegral domain; p is a prime ideal if 
and only if p R and also abep (a, he R) im- 
plies aep or hep (some literature includes the 
ring R itself in the set of prime ideals). Let S be 
a multiplicatively closed subset of a ring R, i.e., 
a nonempty tsubsemigroup of R with respect 
to multiplication. A maximal member among 
the set of ideals which do not meet S is called a 
maximal ideal with respect to S. Such a mem- 
ber is necessarily a prime ideal; when S = (1 }, it 
is called a maximal ideal of R. An ideal m is a 
maximal ideal of R if and only if R/m is a field. 

D. Jacobson Radical 

The intersection J of all maximal ideals of a 
ring R is called the Jacobson radical of R; in 
some cases, this intersection J is called the 
radical of R (-- 368 Rings H). Let N be an R- 
submodule of a finite R-module M. If MJ -h 
N — M, then M = A (Krull-Azumaya lemma 
or Nakayama lemma). 

E. Krull Dimension 

For a prime ideal p of a ring R, the maximum 
of the lengths n of descending chains of prime 
ideals p == Po ^ p i ^ ... ^ p„ which begin with p 
(or oo if the maximum does not exist) is called 
the height or rank of the prime ideal p. For an 
ideal a, the minimum of the heights of prime 
ideals containing a is called the height of the 
ideal a. The maximum of heights of prime 
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ideals of R is called the Krull dimension (or 
altitude) of the ring R. For an ideal a of R, the 
Krull dimension of R/a is called the Krull 
dimension (or depth) of the ideal a. (The mean- 
ings of the terms rank, dimension, and depth 
now depend on the writings in which they 
are found; standardization of definition of 
these terms is beeoming more and more of a 
necessity.) 

F. Primary Ideals 

For an ideal q of R,if q^R and every zero divi- 
sor of R/q is nilpotent, then q is called a pri- 
mary ideal. (If R is included in the set of prime 
ideals, then R is regarded as a primary ideal.) 

In this case, p = is a prime ideal, and q is 
then said to belong to p or to be a p-primary 
ideal. The intersection of a finite number of pri- 
mary ideals belonging to the same prime ideal p 
is p-primary. Assume that an ideal o is ex- 
pressed as the intersection a = q ^ fl . . . fl q„ of a 
finite number of primary ideals q j , . . . , q„ . If 
this intersection is irredundant, that is, if none 
of the q, is superfluous in the expression of a, 
then the set of .^q; (i = 1 ,...,«) is uniquely 
determined by a. The ^/q, are called prime 
divisors (or associated prime ideals) of the ideal 
a; a minimal one among these is called a mini- 
mal (or isolated) prime divisor of the ideal o, 
and those which are not minimal are called 
embedded prime divisors of the ideal a. A max- 
imal one among the prime divisors of a is 
called a maximal prime divisor of the ideal a. 
(For definitions of these concepts in the case 
where a is an arbitrary ideal, see [4].) If, in the 
expression of a as above, n is the smallest 
occurring in similar expressions, then the 
expression is called the shortest representation 
of the ideal a by primary ideals. In this case, 
each q; is called a primary component of the 
ideal a; if is isolated, then qj is called an 
isolated primary component of the ideal a, and 
otherwise q, is called an embedded primary 
component of the ideal a. Isolated primary 
components are uniquely determined by a, but 
embedded primary components are not. 

G. Rings of Quotients 

Let R be a ring. Then the set U of elements of 
R which are not zero divisors is multiplica- 
tively closed. In the set RxU = {{r,u)\reR, 
ueU], we define a relation = by (r, u) = 
(r',u’)oru' = r'u. Then = is an equivalence 
relation, and the equivalence class of (r, u) is 
denoted by r/u. In the set Q of these r/u, addi- 
tion and multiplication are defined by r/u + 
r' ju' = {ru' + r'u)/uu' , {r/u)(r' /u') — rr' /uu' . Then 
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Q becomes a ring, and r/1 can be identified 
with r. Thus 0 is a ring containing R, gen- 
erated by R and inverses of elements of U. This 
property characterizes Q, which is called the 
ring of total quotients of the ring R. If R is an 
integral domain, then Q is a field, called the 
field of quotients of the integral domain R. Let 
S be a multiplicatively closed subset of R such 
that O^S, and let n be {xeR|xs = 0 (3s6S)} 
and (p be the natural homomorphism R-^R/n. 
Then none of the elements of <p(S) is a zero 
divisor. The subring of the ring of total quo- 
tients of R/n, generated by R/n and inverses of 
elements of (p{S), is called the ring of quotients 
of the ring R with respect to S, and is denoted 
by Rs (R[S~^] or RS"^). When M is an R- 
module, M Rg is called the module of quo- 
tients of the R-module M with respect to S. It 
is significant that Rg is tR-flat. There is a one- 
to-one correspondence between the set of pri- 
mary ideals q of R whieh do not meet S and 
the set of primary ideals Q of Rg such that q 
corresponds to Q if and only if Q = qRg (q = 

Q n R). When p is a prime ideal of R, the com- 
plement R — p is multiplicatively closed, and 
R^-p is called the local ring of p or the ring of 
quotients of the ring R with respect to the prime 
ideal p, and is denoted by Rp (-^ 284 Noe- 
therian Rings C, D). A ring of quotients is also 
called a ring of fractions. 



H. Divisibility 

In a ring R, if a = be {a, b,ce R), then we say 
that h is a divisor (or factor) of a, and that a is 
a multiple of b, or a is divisible by b. We denote 
this by h I a. This relation between a,bsRis 
called divisibility relation in R. If, in this situ- 
ation, c has its inverse in R, we say that a is an 
associate of b. A factor h of a is called a proper 
factor if b is neither an associate of a nor tin- 
vertible. An element which has no proper fac- 
tor is called an irreducible element. A nonzero 
element which generates a prime ideal is called 
a prime element. 

If in an integral domain R every nonzero 
element is a product of prime elements (up to 
invertible factors), then we say that the unique 
factorization theorem holds in R, and that R is 
a unique factorization domain (or simply u.f.d.). 

Let A = {a^, be a set of nonzero ele- 

ments of a ring R. A common divisor of A is an 
element which is a factor of U;. A common 
multiple of A is defined similarly. The greatest 
common divisor (G.C.D.) of A is a common 
divisor which is a multiple of any common 
divisor; the least common multiple (L.C.M.) of 
A is a common multiple m which is a factor of 
any common multiple. Thus, the G.C.D. and 
L.C.M. exist if R is a u.f.d. 
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L Integral Dependence 

Let i? be a subring of a ring R" sharing a unity 
element with R". An element aei?" is said to 
be integral (or integrally dependent) over R if 
there are a natural number n and elements Cj 
of R such that a^ + Cia" + ... +c„ = 0. If every 
element of a subset S of R" is integral over R, 
we say that S is integral over R. (When R has 
no unity element, a similar definition is given 
under an additional condition that c,-eR‘. An 
important special case is where R is an ideal. 
See D. G. Northcott and D. Rees, Proc. Cam- 
bridge Philos. Soc., 50 (1954); M. Nagat'a, Mem. 
Coll. Sci. Univ. Kyoto, 30 (1956).) The set R of 
elements of R" which are integral over R is a 
ring and is called the integral closure of R in 
R". If R — R, then R is said to be integrally 
closed in R". If R is integrally closed in its ring 
of total quotients, we say that R is integrally 
closed. An integrally closed integral domain is 
called a normal ring. (In some literature, an 
integrally closed ring is called a normal ring.) 
An element aeR" is called almost integral over 
R if there is an element 6 of R such that b is 
not a zero divisor and a^beR for every natural 
number n. If an element a of the ring of total 
quotients of R is integral over R, then a is 
almost integral over R. R is said to be com- 
pletely integrally closed if its ring of total quo- 
tients contains no elements which are almost 
integral over R except the elements of R itself. 

J. Group Theorem 

Let Q be the ring of total quotients of a ring R. 
An R-submodule a of Q is called a fractional 
ideal of R if there is a non-zero-divisor c of R 
such that caci R. The product of fractional 
ideals is defined similarly as in the case of 
products of ideals. The inverse a“^ of the 
fractional ideal a is defined to be 
czR}. If a contains an element which is not a 
zero divisor, then a Ms also a fractional ideal. 
When R is completely integrally closed, we 
define fractional ideals a and b to be equiva- 
lent if This gives rise to an equiva- 

lence relation between fractional ideals. The 
set of equivalence classes of fractional ideals 
which contain non-zero-divisors forms a 
group. This result is called the group theorem. 

An integral domain R is called a Krull ring if 
(i) for every prime ideal p of height 1, the ring 
Rp is a ^discrete valuation ring; (ii) R is the 
intersection of all the valuation rings R^ and 
(iii) every nonzero element a of R is contained 
in only a finite number of prime ideals of 
height 1. In a Krull ring R, for an arbitrary 
nonzero fractional ideal a, there is a uniquely 
determined product of powers of prime ideals 



of height 1 which is equivalent to a in the sense 
stated above (^ 439 Valuations). 



K. Dedekind Domains and Principal Ideal 
Domains 

A ring R is called a Dedekind domain if (i) R 
is a tNoetherian integral domain, (ii) R is a 
normal ring, and (iii) the Krull dimension of R 
is 1. For an integral domain R which is not a 
field, R is a Dedekind domain if and only if the 
set of all nonzero fractional ideals is a group, 
if and only if every nonzero ideal of R is ex- 
pressed as the product of a finite number of 
prime ideals and such expression is unique up 
to the order of prime factors. An important 
example of a Dedekind domain is the ring of 
all talgebraic integers, i.e., the tprincipal order 
of an talgebraic number field of finite degree. 

In general, if R is a Dedekind domain with 
field of quotients K and L is a finite algebraic 
extension of K, then the integral closure of R 
in L and any ring R' such that R c R' ^ K are 
Dedekind domains. 

An ideal generated by an element is called 
a principal ideal; a fractional ideal generated 
by an element is called a principal fractional 
ideal (or simply principal ideal). In a Dedekind 
domain, the set P of nonzero principal frac- 
tional ideals is a subgroup of the group I of 
nonzero fractional ideals; I/P is called the ideal 
class group of R, and a member of it is called 
an ideal class. The border of I/P is called the 
class number of R (-^ 14 Algebraic Number 
Fields). There are many Dedekind domains 
whose class numbers are infinite. 

A ring R is called a principal ideal ring if 
every ideal is principal; furthermore, if R is an 
integral domain, then R is called a principal 
ideal domain. A principal ideal ring is the 
direct sum of a finite number of rings of which 
each direct summand is either a principal ideal 
domain or a fiocal ring whose maximal ideal is 
a principal nilpotent ideal. A principal ideal 
domain which is not a field is a Dedekind 
domain and a u.f.d. We consider, for an arbi- 
trary natural number n and a principal ideal 
ring R, the set M{n, R) of all « x n matrices 
over R. Given an element A of M(n, R), there 
exist elements X, Y in M(n, R) such that (i) 

X~^, are in M(n, R), and (ii) denoting by 
bij the (j,j)-entry of XA Y, we have i R 622 ^ 
=)...=> b„„R and = 0 if i 7^ j. The nonzero 
members of the set {bji,b22, •••,b„„} are called 
the elementary divisors of the matrix A. Apply- 
ing this to a finite module M over the principal 
ideal ring R, we see that M is the direct sum of 
myR, ...,m^R{m^eM) such that, with 0 , = 

(xgR I /n,oc = 0}, we have Qj c 02 «= . .. ca, (^ 2 
Abelian Groups B; 269 Matrices E). 
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L. Euclid Rings 

A ring R is called a Euclid ring if there is a 
map of R — {0} into a twell-ordered set W 
(the set of natural numbers is a special case) 
satisfying the condition that if a, he/?, a ^0, 
then there are r,qeR such that h = aq + r and 
either r = 0 or (f>{r) < (p{a). 

Every Euclid ring is a principal ideal ring. 
Besides the ring Z of rational integers, there 
are several familiar examples of Euclid rings, 
such as Z[,y~ 1], Z[,y — 2], Z[o;] (oj^ = 

1 , 0 ) 9 ^ 1), and the polynomial ring of one 
variable over a field. 
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68 (XII.10) 

Compact and Nuclear 
Operators 

A. General Remarks 

Let A be a finite-dimensional linear space. 
Then a linear operator in X is surjective 
if and only if it is injective. If X is infinite- 
dimensional, this is no longer the case in 
general. For an operator of the form I + K 
with an integral operator K of continuous 
kernel, I. Fredholm [I] developed the the- 
ory of determinants and retrieved the above 
equivalence (^ 217 Integral Equations). Later, 
F. Riesz [3] simplified the proof and showed 
that the equivalence holds if /C is a compact 
operator in a Banach space. I. Ts. Gokhberg 
and M. G. Krein [4] reformulated the result as 
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the stability of indices of operators. The con- 
cept of compact operators itself was introduced 
by D. Hilbert (who used the terminology, “com- 
pletely continuous operators”). The Hilbert- 
Schmidt class is also due to him. These classes 
of operators are employed in the spectral 
theory of integral operators [2]. The trace 
class of operators in Hilbert spaces is defined 
as the class of operators for which the trace 
has meaning. A. Grothendieck [5] introduced 
nuclear operators, extending the definition of 
trace class operators to general Banach spaces. 
Nuclear operators and related classes of 
operators, such as integral operators and 
absolutely summing operators, play important 
roles in the theory of topological tensor prod- 
ucts [5], determinants [6], vector measures, 
and measures in linear spaces [7], 

B. Compact Operators 

A tlinear operator T from a tBanach space X 
to a Banach space Y is said to be compact (or 
completely continuous) if T maps any ^bounded 
set of X to a trelatively compact set of T. In 
other words, T is compact if, for any bounded 
sequence {x„} in X, the sequence {Tx„} in Y 
contains a tstrongly convergent subsequence. 

A compact operator is necessarily ^bounded 
and hence continuous. A compact operator 
from X to y maps any tweakly convergent 
sequence in A to a strongly convergent se- 
quence in Y. If X is ^reflexive, the converse is 
also true. 

In this article the set of all bounded (resp. 
compact) linear operators from X to 7 are 
denoted by B(X, 7) (resp. B<‘>(X, 7)). 

C. Examples of Compact Operators 

(1) Degenerate operators. An operator 
T 6 B(X, 7) is said to be degenerate or of 
finite rank if the trange /?(T) of T is finite- 
dimensional. A degenerate operator is neces- 
sarily compact. The identity operator in X is 
compact (7=X) if and only if X is finite- 
dimensional. (2) tlntegral operators with con- 
tinuous kernel. Let E and F be bounded closed 
regions in the Euclidean spaces R™ and R", 
respectively, and let k = k{t,s), leF, se£, be a 
continuous function defined on F x E. Then 
the integral operator T with ^kernel k, 

(Tx)(t) = |* k{t,s)x{s)ds, teF, (1) 

is a compact operator from the Banach space 
C(£) to C(F). (For the notation for various 
function spaces ^ 168 Function Spaces.) (3) 
Integral operators of Hilbert-Schmidt type. 

In example (2) let £ and F be '''Lebesgue mea- 
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surable sets, and let keL 2 (F x E). Then the 
integral operator T determines a compact 
operator from the tHilbert space L 2 {E) to the 
Hilbert space L 2 (F). (4) Let O be a bounded 
open set in R", s<t two real numbers, and 
the tSobolev space of order s (— ^ 168 
Function Spaces). Then the natural injection 
from H‘{Q.) into H%Q) is compact (F. Rellich’s 
lemma). 

D. Properties of Compact Operators 

Any linear combination of compact opera- 
tors is again compact. If a sequence { T„] of 
compact operators converges in the norm 
(i.e., in the tuniform operator topology), then 
the limit T is compact. Thus 7) is a 

closed subspace of the Banach space B(2(r, 7) 
with the ^operator norm. Any product of 
a compact operator and a bounded opera- 
tor is compact. Namely, AeB'‘'’(A', 7), Be 
B(7,Z), and CeB(Z,X) imply BAeB<">(Z,Z) 
and /4CeB<"»(Z, 7). In particular, B<^>(X) = 
B*‘^’(A', X) forms a closed ttwo-sided ideal 
of B(AT) = B(X,X). An operator TeB(A, 7) 
is compact if and only if its tdual operator 
T' eB{Y',X') is compact. The range of a com- 
pact operator is always ^separable. Let X and 
7 be Hilbert spaces. Then for any T e , 7) 
there exist torthonormal sets {(p„] in X and 
in 7 and a sequence {c„} of nonnegative 
numbers with limc„ = 0 such that 

Tu^Y.^niu,<Pn)^n, WGX. 

n 

Consequently, any compact operator between 
Hilbert spaces X, Y can be approximated by 
a sequence of degenerate operators in the 
operator norm. However, there are Banach 
spaces X and 7 for which the statement is no 
longer true. In fact, a Banach space X (resp. 
the dual 7' of a Banach space 7) has the tap- 
pro ximation property (-^ 37 Banach Spaces L) 
if and only if every reB*‘'*(7, X) is the limit in 
norm of a sequence of degenerate operators 
for any Banach space 7 (resp. X). 

E, The Riesz-Schauder Theorem 

Let T’gB*‘^^(X) and consider a pair of linear 
equations 

u-Tu=f, feX, (2) 

(P-T'(p = g, geX', (3) 

where T e B(2f ') is the dual operator of T in 
the tdual space X' of X. Put ^ — {ueX\u — 
Tu} and = {(pEX'\(p = T'(p]. Then one 
and only one of the following two cases (i) and 
(ii) occurs, (i) ^ = {0}, — {0}; for any f eX 



equation (2) has a unique solution; and for any 
geX' equation (3) has a unique solution, (ii) 
dim = dim ^' = w, 1 ^m< oo; (2) has a 
solution if and only if / is orthogonal to 
(i.e., (p{f ) = 0 for any (peJi’)\ and (3) has a 
solution if and only if g is orthogonal to Ji 
(i.e., ,^(m) = 0 for any ueJi), This is called the 
Riesz-Schauder theorem. In particular, when T 
is an integral operator in a suitable function 
space, this theorem is also called Fredholm’s 
alternative theorem for integral equation (2). 

F. Fredholm Operators and Their Indices 

Let T be a tclosed linear operator from a Ba- 
nach space to a Banach space 7. T is called 
a Fredholm operator or an operator with index 
if both Ker T ( = null space N{T)) and Coker T 
(= Y/R{T}) are finite-dimensional. Then the 
integer ind T = dim Ker T — dim Coker T is 
called the index of T. If a closed linear opera- 
tor T has Coker T of finite dimension, then 
the range R{T) is closed. Moreover, if R{T) is 
closed and the domain D{T) is dense, then 
dim Coker T = dim Ker T', where T' is the dual 
of T, and hence ind T = dim Ker T — dim Ker T’. 
A linear operator K from to 7 is said to be 
T-compact if the domain D{K) contains D{T) 
and if K from D(T) with the tgraph norm into 
7 is compact, i.e., for any bounded sequence 
{x„} in D{T) with sup || Tx„\\ < oo, the se- 
quence {Kx„} contains a strongly convergent 
subsequence. 

The following are basic properties of Fred- 
holm operators [4, 11]; (1) If T is Fredholm 
and K is T-compact, then T+K is Fredholm 
and ind(T-|-K) = ind T. (2) If T is Fredholm 
and another linear operator S is sufficiently 
close to T in (graph) norm, then S is also Fred- 
holm and ind S = ind T. (3) If T is a Fredholm 
operator from X into 7 and S is a Fredholm 
operator with dense domain in 7 into Z, then 
ST is Fredholm and ind ST= ind 5 -I- ind T. If 
D{T) is dense, then D{ST) is also dense. (4) 

Let r be a closed linear operator with dense 
domain; then T is Fredholm if and only if the 
dual T' is Fredholm, and then ind T'= —ind T. 

Fredholm operators and their indices were 
first studied by Russian mathematicians in 
connection with boundary value problems of 
differential equations and singular integral 
equations. M. F. Atiyah and I. M. Singer have 
proved that an telliptic linear differential 
operator P of order m on a compact differenti- 
able manifold M is a Fredholm operator from 
the Sobolev space H^M) into for 

any s and that its index is computed from the 
^symbol of P and the tcharacteristic classes of 
M (^ 237 K-Theory). Similarly, the indices 
of linear ordinary differential operators (and 
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more generally of maximally overdetermined 
systems of linear partial differential operators) 
are computed in various spaces of functions 
and generalized functions (H. Komatsu, B. 
Malgrange, J.-P. Ramis, M. Kashiwara). 



G. Spectra of Compact Operators 

The following structure of the ^spectrum of 
compact operators is derived from 

the Riesz-Schauder theorem. The spectrum 
a(T) of T consists of at most countably many 
points and has no accumulation points except 
possibly for 0. Any nonzero point of a{T) 
is an teigenvalue of T. When X is infinite- 
dimensional, 0 always belongs to a{T) but is 
not necessarily an eigenvalue of T. Each non- 
zero eigenvalue of T has finite (algebraic) 
tmultiplicity, and hence the teigenspace Jf^{T) 
= {u| Tu = ky], is finite-dimensional. If /I 
is an eigenvalue of T, then A is an eigenvalue 
of the tadjoint operator T* of T with the same 
(either algebraic or geometric) multiplicity as 
that of X. 



H. Spectral Representations of Compact 
Normal Operators 

Let T be a compact tnormal operator in a 
Hilbert space H. Then we can find a tcomplete 
orthonormal set consisting solely of eigen- 
vectors of T. Namely, for each nonzero eigen- 
value Aj of T, take an orthonormal basis 
of the eigenspace associated with Aj. Rearrange 
all the (p^'’ into a sequence {q>„} and add to it, 
if 0 is an eigenvalue, a complete orthonormal 
set of the eigenspace associated with 0. Then 
we obtain a desired complete orthonormal 
set of H. Let fi„ be the eigenvalue associated 
with (p„. Then the sequence {/i„} is precisely 
an enumeration of nonzero eigenvalues of T 
with repetitions according to multiplicity. 

In terms of {cp„} and a tspectral repre- 
sentation of T is given as 
00 
rt=l 

The eigenvalue problem of a compact non- 
negative tself-adjoint operator T can be solved 
by means of the following Rayleigh prin- 
ciple. Consider Rayleigh’s quotient R(x) = 

(Tx, x)/||x||^, x^O. The largest eigenvalue 
is obtained as and any vector 

Xi which attains this maximum is an eigen- 
vector. When the largest n—l eigenvalues 
/Xj, and eigenvectors x,, ...,x„_i are 

determined, the nth eigenvalue is the maxi- 
mum of R(x) on the subspace orthogonal to 
Xj, ...,x„_i and any vector which attains the 



maximum is an eigenvector. A more direct 
characterization of involving no previous 
eigenvectors, is given by 

u„= min / max R(x) 

/, /„-ieX U,/,)=0 

/i /„-i^o Yj = i,...,«-i 

This formula is referred to as the minimax 
principle. 



I. Classification of Compact Operators in 
Hilbert Spaces 

Let H and K be Hilbert spaces. Then a Te 
B{H, K) is compact if and only if (Te„,/„)->0 
for any orthonormal sequences e„ in H and 
/„ in K. J. von Neumann and R. Schatten 
{Ann. Math., 49 (1948)) classified operators 
TgB''*(//, K) in the following way. The opera- 
tor A = {T*Ty^^ is a compact nonnegative 
self-adjoint operator. Let aj ^ a 2 $= . . . be the 
enumeration in decreasing order of the posi- 
tive eigenvalues of A, with each repeated ac- 
cording to its multiplicity. The a„ = a„(T) are 
sometimes called the characteristic numbers of 
T. For any p>0 the set of all such 

that 

/ 00 \1/P 

is denoted by Bp{H) (or simply B^). Among 
these classes, Bj and B 2 are most important. 

Bj is called the trace class and B 2 the Hilbert- 
Schmidt class. Correspondingly, ||T||j and 
||Tj |2 are called the trace norm and Hilbert- 
Schmidt norm of T, respectively. B^ is also 
called the nuclear class and any operator 
TeBj a ^nuclear operator. The norm ||T||p can 
also be defined more directly as follows, (i) If 1 
<p<oo, ||T||p = sup||(Te„,/„)||(^; (ii) If0<p^2, 

II ^llp = inf 11(11 Tevll)lli,;(iii) If2^p<oo, ||T||p = 
sup 11(11 TeJ)||,^, where e„ and/„ range over 
the orthonormal sequences in H and K, and e„ 
over the orthonormal bases in H. The class B^ 
is a two-sided * ideal in the Banach algebra B. 
Moreover precisely, TgB^ and ReB imply 
llRrilp^llRII linip, ||TR||p<||R|| IITIIpand 
II T*\\p— IITlIp. When 1 ^p< 00 , the class Bp 
becomes a Banach space with the norm || r||p. 
It is always a tquasi-Banach space, in which 
the set of all degenerate operators is dense. 

The norm || T||p (T#0) is a decreasing func- 
tion of p. Hence BpcB^ if p^q. Also, TeBp 
and SeBq imply TSeB,., where \/r=l/p+l/q. 
Let TeBp, and let {pj} be an enumeration of 
eigenvalues of T with repetitions according to 
(either geometric or algebraic) ■''multiplicity. 
Then 

J 
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Hilbert-Schmidt class. For an arbitrary 
complete orthonormal set {11^}, we have 

Thus B2 can be defined as the set of all TeB 
such that the sum on the right-hand side is 
finite for a certain complete orthonormal set 
A linear operator T\L 2 {E)~*L 2 {F) is of 
Hilbert-Schmidt class if and only if it is an 
integral operator of Hilbert-Schmidt type 
(example (3) in Section C). The space 83 with 
the norm || T||2 becomes a Hilbert space with 
the inner product defined by 

(T,S)2 = i;(Tu„Sn,), 

where {u^} is as above. 

Trace class. For an operator TeB,, the 
trace tr(T) of T is defined as 

tr(T) = j;(ru„u,). 

Here the right-hand side converges absolutely 
and does not depend on the complete ortho- 
normal set {w^}. The trace is a bounded linear 
functional on the Banach space B, . The prod- 
uct of two operators of Hilbert-Schmidt type 
belongs to the trace class, and the converse is 
also true. If 1 < p, g < 00 satisfy \!p+\jq=\, 
then the inner product < T, S)> of TeBp(f/, K) 
and SeBq{K,H) is defined by <T,S> = tr(ST). 
Under this inner product the dual of the 
Banach space Bp{H, K) is identified with 
B,(X, H). Similarly, the dual of K) 
is isomorphic to B,(K, //), and the dual of 
B, (X, H) to B(H, K) (J. Dixmier, Schatten). 



J. Volterra Operators 



A compact operator T in a Hilbert space is 
said to be a Volterra operator if it is tquasi- 
nilpotent, i.e., its tspectral radius is 0. The 
integral operator 



(Tx)(t) = 



k{t, s)x(s) ds, 



xeL2{a, b), 



appearing in the integral equation of Volterra 
type is a Volterra operator. Conversely, a 
Volterra operator satisfying a suitable con- 
dition is unitarily equivalent to such an inte- 
gral operator (M. S. Livshits, Gokhberg and 
Krem [13]). A Volterra operator admits an 
abstract triangular representation in a manner 
similar to Jordan’s canonical form 390 
Spectral Analysis of Operators H). 



K. Nuclear Operators 

Extending the definition of trace class to 
operators in Banach spaces, A. Grothendieck 
[5] defined a nuclear operator (or Fredholm 



operator according to [6]) from a Banach 
space X into a Banach space F to be a linear 
operator T:X-yY that is represented as Tx = 
X^~j(x,aj}bj with a sequence Aj-^0 in a 
bounded sequence aj in X', and a bounded 
sequence bj in Y. In other words, a linear 
operator T: A ^ F is nuclear if and only if it is 
decomposed as the product 

a4/^4/,Af, (4) 

where A and B are bounded linear operators 
and A is multiplication by Aj in /, . The in- 
fimum of II A II lUjU;! ||B|| is called the nuclear 
norm of T and is denoted by T , . When 
X and F are Hilbert spaces, this coincides 
with the trace norm. The integral opera- 
tor T'defined by (1) is nuclear with ||T||, = 
jsup,|/c(t,s)|ds. The totality B,(A, F) of nuclear 
operators T:X~*Y forms a Banach space 
under the nuclear norm. If T is nuclear and 
A is bounded, then ||T/1|| , ^ I1T||, ||/1|| and 
II AT II 1 ^ jj A II II T|| , . If T is nuclear, then the 
dual T' is nuclear, and IIT'lli^llTII,. Suppose 
that X and F are Banach spaces satisfying one 
of the following conditions: (i) F is the dual of 
a Banach space; (ii) X' has the approximation 
property; (iii) F" has the approximation prop- 
erty. Then conversely a TgB{X, F) is nuclear 
if the dual T' is nuclear, and || T|| , = || T'|| , . 
However, this is not necessarily the case in 
general (T. Figiel and W. B. Johnson). 

Replacing (, by Ip, we obtain the definition 
of operators T:X^Y of summable pth power. 
Grothendieck [5] considered the case 0<p^ 1 
and showed that if T,, ..., 7^ are of summable 
p,,...,p„th power, then the product T, ... T„ is 
of summable rth power, where r is given by 
l/r = 0:i/p,)-(n+l)/2. 



L. Traces and Determinants of Operators 

Let T be a linear operator of tfinite rank in a 
linear space X. There are a finite number of 
elements a,- 6 A' and h,G A such that 

Tx=t<x,a,}b,. (5) 

i = l 

Then tr(T) = S(^,-, fli> and det(l — T) — 
det{5i j — {bi,aj}) are independent of the repre- 
sentation (5). Let A be a Banach space. If 
TeBj(A) is represented as Tx — 'Z,^j(,x,aj}bj, 
then it seems reasonable that the trace tr(T) 
be defined by Z Xj^bj, Qj'), but the sum may 
depend on the representation. It is known that 
a Banach space A has the approximation 
property if and only if the sum does not de- 
pend on the representation for any nuclear 
operator T. However, if T is of summable 
(2/3)rd power, the trace tr(T) is always de- 
fined uniquely. If A or T satisfies the afore- 
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mentioned condition, then the determinant 
det(l — T) is also defined uniquely as the limit 
of det(l — T„), where T„ is the nth partial sum. 
det(l ~zT) is an entire function of z and its 
zeros are exactly the reciprocals of the nonzero 
eigenvalues of T. If det(l —zT)^0, then the 
resolvent (\—zT)~^ is given by Fredholm’s 
formula extending +Cramer’s rule. Eigen- 
vectors are also computed explicitly [6]. 

Let // be a Hilbert space, and let TeBj(H). 
Then det(l —zT) = Tl{l—zXj), where Xj are the 
nonzero eigenvalues of T, and is an entire 
function of genus 0. Moreover, let p ^ 2 be an 
integer. Then the modified determinant 



deyi-zT)^n 



{l-zXj)exp(^^ k 



is defined for TeBp(H) and is an entire func- 
tion of genus p — 1. This type of determinant 
was introduced by T. Carleman {Math. Z., 9 
(1921)) for the Hilbert-Schmidt class of oper- 
ators also Hilbert [2]). It is utilized to 
prove the completeness of the troot vectors of 
T and many other facts [9, 12, 13]. 



M. Weakly Compact Operators 



A linear operator from a Banach space X to 
a Banach space Y is said to be weakly com- 
pact if T maps any bounded set of A to a rela- 
tively weakly compact set. Weakly compact 
operators are bounded. They have proper- 
ties similar to those of compact operators. 

The following are equivalent conditions for a 
bounded linear operator T:A-^y. (1) T is 
weakly compact; (2) T': Y'^X' is weakly 
compact; (3) T":X’’^Y" maps X" into Y; 

(4) There is a reflexive Banach space Z and 
bounded linear operators S.X^Z and R. 
Z^Y such that T = RS. The last characteri- 
zation is due to W. J. Davis, Figiel, Johnson, 
and A. Pelczyhski. Any linear combination of 
weakly compact operators is weakly compact. 
The uniform limit of a sequence of weakly 
compact operators is weakly compact. The 
product of a weakly compact operator and a 
bounded operator is weakly compact. 

Dunford-Pettis theorem ([9]; R. S. Phillips, 
Trans. Amer. Math. Soc., 48 (1940)). Let (Q,fi) 
be a c7-finite measure space and X a Banach 
space. If T;Li(Q)-»A is a weakly compact 
linear operator, then there exists a bounded 
strongly measurable function g on Q with 
values in X such that 



77 = 



f{t)g{t}dti(t). 



Then T maps weakly convergent sequences in 
L,(^^) into strongly convergent sequences in X 
In particular, the product of two weakly com- 



pact linear operators in Li(O) is compact. 
Similarly, Grothendieck and R. G. Bartle, N. 
Dunford, and J. Schwartz (^ [9]) have proved 
the following; Let D be a compact Hausdorff 
space and X a Banach space. If T : C(Q)-» A is 
a weakly compact linear mapping, then there 
is a tvector measure fi defined on the +Borel 
sets in Q with values in A such that is a 

t Radon measure for any x'e A' and 



77 - 



f(t)dg(t). 



T maps weakly convergent sequences in C(Q) 
into strongly convergent sequences in A. The 
product of two weakly compact linear opera- 
tors in C(D) is also compact. 



N. Absolutely Summing Operators 



A linear operator T from a Banach space A to 
a Banach space Y is called an integral operator 
if there is a positive tRadon measure g on the 
product Ax B of the unit ball A in A' and 
the unit ball B in Y" both equipped with the 
weak* topology such that 



<Tx,/> 



<x, x'X/,y">d;u(x', y"). 



or equivalently if T: A ^ 7" is decomposed 
as (4) with and /j replaced by L^(M) and 
Li(M) for a suitable compact space M with a 
Radon measure. Nuclear operators are clearly 
integral. Integral operators are weakly com- 
pact, but they are not necessarily compact or 
nuclear. However, the product of an integral 
operator and a weakly compact operator is 
nuclear, and hence every integral operator in 
a reflexive Banach space is nuclear. 

If a compact (resp. weakly compact) opera- 
tor T from a Banach space A to a Banach 
space Y maps a closed linear subspace A, of 
A into a closed linear subspace Y^ of Y, then 
the restriction : Aj -v Y^ and the induced 
operator T^'.XjX^ ->■ Y/Y^ are also compact 
(resp. weakly compact). The corresponding 
result does not hold for nuclear operators and 
integral operators. The following conditions 
are equivalent for a bounded linear operator 
T:X^Y: (i) There is a Banach space 7 => 7 
such that T:X~*Y is integral; (ii) there is a 
positive Radon measure g on the unit ball 
A of A' such that 



Tx\\^ 



|<x,x'>|d/i(x'); 



J 



(iii) there is a constant C such that 

^ ||Tx.||^Csup{Xl<Xi,x'>|| llx'Kl} 

holds for any finite set {xi , . . . , x„} in A; (iv) if 
Hx, is an ^unconditionally convergent series 
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in X, then X || Txj\\ converges. Grothendieck 
called such an operator a right semi-integral in 
view of property (i), and A. Pietsch called it 
absolutely summing in view of (iv). Similarly, 
a bounded linear operator T:X^Y is called a 
left semi-integral if there is a Banach space X 
such that X = X/X^ for a closed subspace X ^ , 
and the product T:X^Y is integral. The 
product of two right (resp. left) semi-integral 
operators is nuclear. A bounded linear opera- 
tor in a Hilbert space is right (or left) semi- 
integral if and only if it is of Hilbert-Schmidt 
class. A bounded linear operator T from a 
Banach space X into a Banach space Y is 
integral if and only if the dual T' is integral. 

It is a right (resp. left) semi-integral if and only 
if T' is a left (resp. right) semi-integral. 

For other related classes of operators ^ 
Grothendieck’s paper in Boletin Soc. Mat. Sdo 
Paw/o, 8 (1953). 
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Compact Groups 

A. Compact Groups 

A ^topological group G is called a compact 
group if the underlying topological space of G 
is a ■’'compact Hausdorff space. The ’"torus 
group T" = R”/Z” (n = 1 , 2, . . . ) (commutative 
group), the torthogonal group 0{n), the uni- 
tary group U{n\ the Nymplectic group Sp{n), 
and the additive group Zp of ^p-adic integers 
are compact groups (^ 60 Classical Groups; 
for other compact Lie groups 249 Lie 
Groups, 248 Lie Algebras). Let C(G) be the 
’'linear space formed by all the complex-valued 
continuous functions fg,h,... defined on a 
compact group G; C(G) is a ’Banach space 
with the norm 

Il/li-sup|/(x)|. 

jceG 

Since a compact group G is ’locally com- 
pact, there exists a right-invariant ’Haar mea- 
sure on G. Because of the compactness of G, 
the total measure of G is finite, and the mea- 
sure is also left-invariant. By the condition 
that the total measure is 1, such a measure is 
uniquely determined. The integral of / in C(G) 
relative to this measure is called the mean 
value of / Since for / g'eC(G), f{xy~^)g{y) is 
continuous in two variables x, y, the tconvo- 
lution / X g{x) = ^f{xy~^)g{y)dy also belongs 
to C(G). C(G) constitutes a ring under the 
multiplication defined by the convolution. 

This ring can be considered as an extension of 
the notion of a ’group ring for finite groups; it 
is called the group ring of the compact group 
G. The function x->/(x“’) will be denoted by 
/*, and the inner product in the ’function 
space L 2 (G) will be written as (f,g). 

B. Representations of Compact Groups 

Let G(E) be the group of units of tbounded 
linear operators on a Banach space E, and 
suppose that we have a homomorphism G of a 
topological group G into G(£). The homomor- 
phism U is called a strongly (weakly) contin- 
uous representation on £ of G if, for any aeE, 
the map x->’U{x)a of G into £ is continuous 
with respect to the tstrong (’weak) topology on 
£. When £ is a ’Hilbert space, a strongly con- 
tinuous representation U such that every 
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U {a) is a unitary operator is called a +uni- 
tary representation. Let U be any strongly 
continuous representation of a compact 
group G on a Hilbert space E\ for a, b in E, let 
<a, h} be the mean value of the '’'inner product 
((7(x)a, U(x)b) on E. Then G is a unitary repre- 
sentation on the Hilbert space E with the new 
inner product ^a,b}. 

A representation G of G on a Banach space 
E is said to be irreducible if E contains no 
closed subspace other than {0} and E, which is 
invariant under every G(x)(xeG). If a weakly 
continuous representation of a compact group 
on a Banach space E is irreducible, then E is 
finite-dimensional. Moreover, any unitary 
representation G of a compact group on a 
Hilbert space E can be decomposed into a 
discrete Mirect sum of irreducible representa- 
tions. Namely, there exists a family of 

irreducible (hence finite-dimensional) invariant 
subspaces E„ of E which are orthogonal to 
each other such that E = J^^^^E^. In particular, 
any continuous representation of a compact 
group on a finite-dimensional space is '•'com- 
pletely reducible. In L 2 (G) with respect to the 
tHaar measure on a ^locally compact group G, 
the representation G defined by (G(x)/)(y) = 
fiyx) {x,yeG,f eL 2 (G)) is a unitary repre- 
sentation of G. This representation G is called 
the (right) regular representation of G. Decom- 
position of the regular representation of a 
compact group G into irreducible represen- 
tations is given by the Peter-Weyl theory, 
described later. 

In the rest of this article, the representations 
under consideration will be (continuous) repre- 
sentations by matrices of finite degree. Let 
Di{x) = {d\j\x)) and D 2 {x) = {d\j\x)) be irredu- 
cible unitary representations which are not 
mutually ^equivalent. Then from tSchur’s 
lemma follow the orthogonality relations 

{du\ 4i’) = 0 and (^n^ 4‘n) = 

(where is the degree of the representation 
Gj). From each tclass of irreducible repre- 
sentations of G, choose a unitary representa- 
tive D^(x) = (d“j(x)), and let n„ denote its de- 
gree. Then from the orthogonality relations it 
follows that the ^/n^d*^(x) form an tortho- 
normal system of G 2 (^)- 

Let heC{G) and consider the map H:f-* 
hxfoi C(G) to C(G). Then // is a tcompact 
operator in C(G). Since C(G) is contained in 
L 2 (G), we can define the inner product in C(G). 
By (h X /, g) — ( f, h* xg),h — h* implies {Hf, g) 

= (/, Hg); that is, H is a ’^Hermitian operator. 
For a given / in C(G), there exists h{ = h*) in 
C(G) such that hxfis uniformly arbitrarily 
near f. From the theory of compact Hermitian 
operators, Hf can be uniformly approximated 
by linear combinations of the '•'eigenfunctions 
of H. Since the teigenspace of H is finite- 
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dimensional and invariant under U(a), the 
eigenfunctions of H are linear combinations of 
a finite number of y/^dfjix). Hence any func- 
tion / in C(G) can be uniformly approximated 
by linear combinations of a finite number of 
dfj(x). This fact, like the similar result in 
tFourier series, is called the approximation 
theorem. From this, it follows that the or- 
thogonal system {y^d*j(x)} is '•complete; i.e., 
if an element of C(G) is orthogonal to each 
element in this system, then it is 0. 

Since C(G) is dense in L 2 (G), {y/^dfjix}} is 
a •complete orthonormal system of the Hil- 
bert space L 2 (G). Hence for any /gL 2 (G), its 
“Fourier series” ZaZ;/C-,^/^d*y(x) (where c,j- 

— (/, y4i, d“)) converges to / in the mean of 
order 2 (i.e., with respect to the tmetric of 
L 2 (G)). In particular, if G is a compact ^Lie 
group and / is sufficiently many times dif- 
ferentiable, then this series converges uni- 
formly to f 

The space of dimension spanned by 
the elements d,“(x) (1 in the ith row of 

the matrix G„(x) is invariant under the right 
regular representation G. The representation 
on given by G can only be D^. Then the 
fact that is an orthonormal system 

of E 2 {G) means that the regular representation 
G of a compact group G is decomposable into 
a discrete direct sum of finite-dimensional 
irreducible representations. Each irreducible 
representation is contained in G with multi- 
plicity equal to its degree n^. 

If a function (p(x) in C(G) satisfies (p(y^^xy) 

— (p(x) for any x, y, then it is called a class 
function. The set K(G) of all the continuous 
class functions coincides with the '•'center of the 
group ring C(G). The tcharacter of an irredu- 
cible representation of G is a class function, 
and the set {/:«(>•)} of all characters plays 

the same role as the orthogonal system 
Namely, {x«(x)} is a 

complete orthonormal system in (the •com- 
pletion of) K(G), and any class function can be 
uniformly approximated by linear combi- 
nations of a finite number of these characters. 

The preceding paragraphs give a brief de- 
scription of the Peter-Weyl theory. If G is the 
1 -dimensional ttorus group T* =R/Z, namely, 
the compact group of real numbers mod 1, 
then this is actually the theory of Fourier 
series concerning periodic functions on the 
line. (For concrete irreducible representations 
of 0(n), U(n), Sp(rj), and formulas for charac- 
ters 60 Classical Groups. For represen- 
tations of compact Lie groups 249 Lie 
Groups, 248 Lie Algebras.) The theory of 
compact groups was completed by F. Peter 
and H. Weyl (Math. Ann., 97 (1927)), and J. 
von Neumann’s theory concerning almost 
periodic functions in a group (1934) united the 
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theory of compact groups with H. Bohr’s 
theory of almost periodic functions {^18 
Almost Periodic Functions). 



C. Structure of Compact Groups 

Let a be an element of a compact group G 
different from e. Since the underlying space of 
a topological group is a tcompletely regular 
space, there exists a function in C(G) such that 
f{a)^f (e). Hence there exists a representation 
D(x) of G such that D{a) is not equal to the 
unit matrix. This means that any compact 
group G can be expressed as a tprojective limit 
group of compact Lie groups. Beginning with 
this fact, von Neumann (1933) showed that a 
tlocally Euclidean compact group is a Lie 
group (-^ 423 Topological Groups N). 



D. Set of Representations 



© D2 — 



The set G' — {D} of representations of G by 
matrices admits the following operations: (i) 

Dx 0 £>2 (ttensor product representation); (ii) 

^Di 0 \ 

^ (tdirect sum representa- 
tion); (iii) P~^DP (equivalent representation); 
and (iv) D (complex conjugate representation). 
Let M be a subset of G' such that DeM when- 
ever DgM and the irreducible components 
of Dx 0 D 2 are in M whenever Dx ,D 2 ^M. 
Then M is called a module of representations 
of G. There is a one-to-one correspondence 
between closed normal subgroups H of G and 
modules M formed by all the representations 
of G/H. 

A representation of G’ is a correspondence 
which assigns to each D a matrix A{D) of the 
same degree as that of D and preserves the 
operations of G': A{Dx 0 £>2) = ^(^i) 0 ^(£>2), 
A{D,@D2) = AiDx)®A^XAiP-^DP) = 
P-^A{D)P, and AiD) = A{D). Let G" be the 
set of all the representations of G'. Define 
the product of Aj, A2eG" by AxA 2 (D) = 
AxiD)A 2 {D) and a topology on G" by the 
tweak topology of the functions A{D) of D. 
Namely, a typical neighborhood of Aq is of the 
form U{Ao;Dx,...,D,;E) = {A\\\A{Di)-Ao{Dx)\\ 
<^,i^ 1, ...,s}. G" is a topological group 
under this multiplication and topology. Then 
the Tannaka duality theorem states that G" ^ 

G holds (T. Tannaka, Tohoku Math. J., 45 
(1939)). Let R{G) be the talgebra over the 
complex number field C formed by the set of 
all the linear combinations of a finite number 
of d“j{x), and let AutR(G) be the automor- 
phism group of this algebra R(G). Let G* 
be the set of all the elements a in AutR(G) 
which commute with every left translation 
L(x) ((L(x)/)(y)= /(xy)) and which satisfy 



Then G* is a topological group 
with respect to the weak topology, and the 
Tannaka duality theorem implies that the 
correspondence which assigns to each xeG 
the right translation R(x) (the restriction of 
U (x) to R(G)) is an isomorphism of G onto 
G* as topological groups. For the case where 
G is a compact Lie group, C. Chevalley re- 
stated the Tannaka duality theorem as one 
giving a relation between compact Lie groups 
and complex algebraic groups (-^ 249 Lie 
Groups U). 
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Complexes 

A. General Remarks 

The notion of complexes was introduced by H, 
Poincare to study the topology of tmanifolds 
by combinatorial methods (^ 65 Combina- 
torial Manifolds). Various kinds of complexes 
have been introduced in the course of the 
development of topology. These will be dealt 
with individually in the sections that follow. 



B. Euclidean Simplicial Complexes 



Let R^'' be the A-dimensional tEuclidean 
space, and let Uq , a j , . . . , be points of R^ 

with coordinates a, = (a|'\aF’, ...,uJA>) for i = 0, 
1, For real numbers Xq, Xx, ■■■,X„, we 
denote by Xoa^ + XiOx + ...+X„a„ the point 

(Zr. 0 sr=0 , Sr-0 of R". 

A set of n-f 1 points Uq, Oj, ...,a„ of R^ is 
said to be independent or in general position if 
vectors ■■■, a^a^ are linearly inde- 

pendent. For given independent points Uq, 
ax, ... ,n„, let \aQUx ... n„| denote the subset of 
R'^ given by 

1 00^1 ...a„\ = {XQa(x + Xxax -I- ... 

+ X„a„ \ Xq + Xx + ... + A„ = 1,2.; ^0}. 



jao«i ...n„| is called an n-simplex with vertices 
Uq, ax, ...,a„. Simplexes are denoted by A, A', 
<7, T, etc. For example, a 0-simplex |ao| is a 
point Aq, a 1-simplex la^Ox \ is a segment d^d^, 
and a 2-simplex |ao^^i^J2l is a triangle Aaoaxa 2 . 
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We attach to X(,aQ + Xiai + ... + X„a„the co- 
ordinates (Ao, , • ■ ■ , 2„), called the barycentric 
coordinates. If lagfli ... a„| is an «-simplex, 
every subset J of {ao,cii, 

is independent. A (j-simplex •••a.J is 
called a ^-face of 

A set 51 of simplexes in a Euclidean space 
is called a Euclidean (simplicial) complex in R''^ 
if 5^ satisfies the following three conditions; (i) 
Every face of a simplex belonging to 51 is also 
an element of 51. (ii) The intersection of two 
simplexes belonging to 51 is either empty or a 
face of each of them, (iii) Each point of a sim- 
plex belonging to 51 has a neighborhood in R''' 
that intersects only a finite number of sim- 
plexes belonging to 51. A Euclidean complex is 
also called a geometric complex (or rectilinear 
complex). Each 0-simplex in 51 is called a ver- 
tex in 51. We define the dimension of 51 to be 
« if 51 contains an n-simplex but no (n-l- 1)- 
simplex, and oo if 51 contains n-simplexes for 
all « ^ 0. 

By a subcomplex of a Euclidean complex 51 
we mean a Euclidean complex that is a subset 
of 51. For a Euclidean complex 51, its r-section 
(or r-skeieton) is defined to be the subcomplex 
of R consisting of all n-simplexes {n ^ r] in 51. 

If 51 is a Euclidean complex in R'^', we de- 
note by 1 51 1 the set of points in R''' belonging to 
simplexes in 51. This set |5I| is called the Eucli- 
dean polyhedron of 51. 

By subdivision 51' of a Euclidean complex 51 
we mean a Euclidean complex such that |5I'| 

= |5I| and each simplex in 51' is contained in a 
simplex in 51. 

Specifically, we can construct a subdivision 
51' of 51 utilizing tbarycenters of simplexes in 51; 
namely, we let 51' be the set of all r-simplexes 
whose vertices consist of barycenters of the 
series Aq c A^ <=...<= of simplexes in 51. Then 
51' is a subdivision of 51, called the barycentric 
subdivision of 51 and denoted by Sd 51. 

Given a Euclidean complex 5^ and a subset 
A of 1 51 1, we define the star of A in 51 to be the 
subcomplex of 51 that consists of simplexes {A} 
and their faces such that A (1 A ^^0. Further- 
more, we define the open star of A in 51 as 
the union of topen simplexes (the interiors of 
simplexes) of 51 whose closures intersect A. We 
denote by Stj^(A) the star of A in 51 and by 
0^{A) the open star of A in 51; then 0^,(A) is an 
open set whose closure is |Srjj(A)|. 

The notion of Euclidean simplicial com- 
plexes can be generalized to that of Euclidean 
cell complexes; this is done by replacing the 
term simplex by tconvex cell in the definition of 
Euclidean simplicial complex. For a Euclidean 
cell complex R, the notions of vertex, dimen- 
sion, subcomplex, r-section, and subdivision 
are defined similarly as in the case of Eucli- 
dean simplicial complexes. 
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C. Simplicial Complexes 

Given a Euclidean simplicial complex 51, let K 
denote the set of all the vertices in 51, and let 
Z denote the set consisting of those subsets 
(uq, . • . Tr} of ^ for which there exist simplexes 
A in 51 such that {dq, . . . , coincide with the 
set of vertices of A. Then we have (1) if seT 
and s ^ s', s' ^ 0, then s'eZ; (2) every set con- 
sisting of a single element in K is in Z, and the 
empty set is not in Z. 

In general, if a pair (K, Z) of a set K and a 
set Z consisting of finite subsets of K satisfy (1) 
and (2), then the pair (or the set K) is called an 
abstract simplicial complex (or simply sim- 
plicial complex). If X is a simplicial complex, 
each element of the set K is called a vertex in 
K, and each set of Z is called a simplex in K. A 
simplex consisting of n + 1 vertices is called an 
n-simplex. We say that a simplicial complex K 
is finite if it consists of a finite number of ver- 
tices; it is locally finite if every vertex of K 
belongs to only finitely many simplexes in K. 
We define similarly countable simplicial com- 
plexes and locally countable simplicial com- 
plexes. The dimension and r-section of a sim- 
plicial complex are defined as in the case of 
Euclidean complexes. By a subcomplex of a 
simplicial complex K we mean a simplicial 
complex Xq such that each simplex of Xq is a 
simplex in X. 

Let X and L be simplicial complexes. A 
mapping ^;X->L is called a simplicial map- 
ping (simplicial map) of X to L if the following 
condition is satisfied: Ifi»o, are ver- 

tices of a simplex of X, then (p{vq), (pivj), . . . , 
(p{v„) are vertices of some simplex of L. Two 
simplicial complexes X and L are said to be 
isomorphic if there exist simplicial mappings 
qy. K^L, ij/:L->^K such that tj/ocp and qyoij/ 
are the identity mappings. 

Given a simplicial complex X, let |X| denote 
the set of all functions x from the set of ver- 
tices of X to the closed interval / = [0, 1] satis- 
fying the following conditions: (i) The set 
{ueX|x(!;)/0} is a simplex of X. (ii) Zt,6ic^(i') 
— 1. The value x(r) is called the barycentric 
coordinate of the point xe|X| with respect to 
the vertex v. Each vertex of X is identified 
with the point of 1X| whose barycentric co- 
ordinate with respect to the vertex i? is 1 and is 
called a vertex in |X|. For a simplex s = {i;o,Di, 
in X, we define |s| = {xg|X] |x(i;) = 0 
(p^s)}, which is called a simplex in |X|. We 
call {xe|s| |x(p,)>0 (i==0, !,...,«)} an open 
simplex of |X| or the interior of |s|. We remark 
here that for an arbitrary simplex s in X a 
point xe|s| can be written in the form x = 
Si,es^(i^) ■ V. We can define a metric on |X| 
by ^/(x,y) = (i:„^K(x(i;)-y(t;))2)‘/2. However, |X| 
is usually supplied with a tstronger topology 
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defined as follows: (1) Each simplex js| in |K| 
has the topology given by the metric d. (2) A 
subset U of 1K| is open if and only if L/fl js| 
is an open subset of |s| for each simplex s of 
K. Henceforward, by the topology of |K | we 
mean the topology just defined unless other- 
wise stated. The set |K| with such a topology 
is called the polyhedron of K. The topology 
of |K| coincides with the above metric topol- 
ogy if and only if K is locally finite. If sim- 
plicial complexes K and L are isomorphic, 
then |K| and |L| are homeomorphic. If K is 
the simplicial complex defined by a Euclidean 
simplicial complex R, then |/C| and |5I| are 
homeomorphic. When K is finite, there exists 
an Euclidean simplicial complex R whose sim- 
plicial complex is isomorphic to K. Accord- 
ingly |K| is homeomorphic to the Euclidean 
polyhedron |R|. 

If K and L are simplicial complexes, a map- 
ping /:|K|->|L| satisfying the following con- 
dition is said to be linear; If s — {vq,Vi, 
is a simplex in K and x = /.oVq+ + (2q + 

. . . -I- = 1 , 2; ^ 0), then /{vq), belong 

to a simplex in L and f(x) — ^ofi^o) + • • • + 

The linear mapping determined by 
a simplicial mapping (p : K ->L is denoted by 
1 (p I : I X I ^ I L I and is also called a simplicial 
mapping and denoted by the same letter (p. 

Let K and K' be simplicial complexes. If 
there exists a linear mapping l:\K'\^\K\, 
which is a homeomorphism, then we identify 
|K| and |iC'| by I and call K' a subdivision of 
K. The barycentric subdivision SdX of a sim- 
plicial complex K is defined as in the case of 
Euclidean complexes. We also have notions of 
star StK(A) and open star Of^{A) for a simpli- 
cial complex K and a subset A of |K|. If K 
and L are simplicial complexes, a mapping 
/:|K|^|Lj is called a piecewise linear map- 
ping if there exist subdivisions K' and L' of 
K and L, respectively, such that / :|X'|^|L'| is 
linear. 

Given an open tcovering = of a 

topological space X, the index set K becomes 
a simplicial complex if we consider each finite 
nonempty subset s of X such that 
0 (empty) to be a simplex. The resulting sim- 
plicial complex K is called the nerve of the 
open covering 95i. Furthermore, if = 
and 91 = {Wu,}weL are open coverings of a set X, 
91 is a trefinement of the covering and L is 
the nerve of 91, then a simplicial mapping (p : 
L-*K is defined by sending each vertex w in 
L to a vertex i; in K such that N^czM^. 

Given two disjoint simplicial complexes K 
and L, a simplicial complex K * L, called the 
join of K and L, is defined by the following: (1) 
the vertices of K * L are the vertices of K and 
the vertices of L. (2) A nonempty subset of 
vertices is a simplex of X * L if and only if its 



subsets in X and L are empty or simplexes 
there. In particular, the join of a simplicial 
complex X and a single point is called the cone 
of X. 

A simplicial complex X is said to be ordered 
if a tpartial ordering is given in the set of ver- 
tices in X such that the set of vertiees of each 
simplex is ttotally ordered. Given ordered 
simplicial complexes X and L, an ordered 
simplicial complex X x L, called the Cartesian 
product of X and L, is defined by the following: 
(1) The vertices in X x L are pairs {v, w), where 
V and w are any vertices in X and L, respec- 
tively. (2) A set of vertices (^o, Wq), . . . , (i;„, w„) 
such that and Wq ^ w„ is a 

simplex in X x L if (dq, and (wq, 
are simplexes in X and L, respectively; all 
simplexes in X x L are obtained in this man- 
ner. (3) (f^i, w,)^(r 2 ,W 2 ) if and only if v^^V 2 
and Wj. 

Assume that either X or L is locally finite or 
that both X and L are locally countable. Let 
X and Y be topological spaces and / be the 
closed interval [0, 1], We define an equivalence 
realtion ~ in the topological space X Li (X x Y 
X /)U y by x~(x,y,0) and y~(x,>’, 1), where 
xeX, ye y. The quotient space of XU(X x y 
X /) U y by this relation is called the join of X 
and y and is denoted by X * y. Then the poly- 
hedron |X X L| is homeomorphic to the prod- 
uct space |X| X |L1 of the topological spaces 
I X I and I L |. And the polyhedron | X * L | is 
homeomorphic to |X| * |L|. 

By a triangulation T of a topological space 
X we mean a pair (X, t) consisting of a sim- 
plicial complex X and a homeomorphism 
t:|X|^X. A triangulation is also called a 
simplicial decomposition. If T =(X, t) is a trian- 
gulation of X, the various concepts defined for 
X can be transferred to X by means of the 
mapping t. For example, by a simplex of the 
triangulation T we mean the image of a sim- 
plex of |X| under t. We say that a triangula- 
tion T =(X, t) is finite if X is a finite simplicial 
complex. If T = (X, r) is a triangulation and 
(X', /) is a subdivision ofX, then T' = (K',tol) 
is called a subdivision of T. If / =(X,, t,), T 2 
= 1 X 2 , 12 ) are triangulations of topological 
spaces X, , X 2 , respectively, a mapping / :Xi 
-♦X 2 is called a simplicial mapping relative to 
Ti and T 2 if o f o tj : IXj |->|X 2 | is a sim- 
plicial mapping. The following two problems 
on triangulations are famous: (1) Under what 
topological conditions is it possible for a given 
topological space to be supplied with a trian- 
gulation? (2) Given two triangulations T^, Tj 
of a space X, are there subdivisions T( — 

(K\ , t\), T 2 = (X 2 , ^ 2 ) of Ti and T 2 , respec- 
tively, such that K[ and K '2 are isomorphic? 
Concerning the second problem, the conjec- 
ture asserting the existence of subdivisions T[ 
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and T 2 as above is known as the fundamental 
conjecture (Hauptvermutung) in topology. 
Every 3-dimensional manifold is triangulable, 
and any two of its triangulations admit sub- 
divisions satisfying the condition in (2) (E. E. 
Moise [7]). 

In 1961, J. Milnor showed that the funda- 
mental conjecture is not true for polyhedra 
[8]. The triangulation problem and the funda- 
mental conjecture for topological manifolds 
were negatively solved by R. Kirby and L. 
Siebenmann [9] (-^ 65 Combinatorial Mani- 
folds). Any tdifferentiable manifold is triangu- 
lable, and the fundamental conjecture holds 
for its ^C-triangulations (r ^ 1) [6] (—"114 
Differential Topology). 

Let T] , T 2 be triangulations of topological 
spaces X 2 , respectively, and let /: 

be a continuous mapping. Then a simplicial 
mapping cp:Xi^X 2 relative to and T 2 is 
called a simplicial approximation to / if, for 
each xeX, the image (p(x) lies on the simplex 
of T 2 whose interior contains f{x). The follow- 
ing existence theorem is called the simplicial 
approximation theorem; For every continuous 
mapping f:X^^X 2 , there exist a subdivision 
T{ of Ti and a simplicial mapping (p:X^^X 2 
relative to T( and T 2 that is a simplicial ap- 
proximation of f If the triangulation is 
finite, then for a sufficiently large n we can 
choose Sd" T, as the T( above (where Sd° T — 
Tand Sd”T-Sd(Sd" ' T) (n^l)). Ifcp-.X^-^ 
X 2 is a simplicial approximation to a continu- 
ous mapping /, then / and <p are +homotopic. 



D. Cell Complexes 

Let V" be the +unit n-disk, be the tunit 
(n — 1 (-sphere, and X be a Hausdorff space. For 
a subset e of X, let e be the closure of e in X, 
and let e = c — e. A subset e of the space X is 
called an n-cell, or open n-cell in X if there is a 
relative homeomorphism S"“')-^(e,e), 

i.e., a continuous mapping cp \ F"-ve such that 
<p(S" ')cc and (p: F" — -^e — e is a homeo- 
morphism. For example, S" — {p} (peS") is an 
fi-cell. A set {c;i|agA} of cells in the Haus- 
dorff space X is called a cellular decomposition 
of X if the following three conditions are satis- 
fied: (i) is empty if (ii) X = 

(iii) If the dimension ofc^ is n+ 1, then e 2 <^ 

X", where X” is the union of all the cells 
(peA) whose dimensions are not greater than 
n. For example, the n-sphere S" has a cellular 
decomposition consisting of a single 0-cell 
and a single n-cell. 

A Hausdorff space X together with its cellu- 
lar decomposition {e^} is called a cell com- 
plex, and each is called a cell in the cell 
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complex AT. For a cell complex, the notions of 
vertex, n-section, and dimension are defined in 
the same way as the corresponding notions in 
Euclidean complexes. Let be a cell complex 
and A a topological subspace of X such that 
the closure of each cell of X intersecting A is 
contained in A. Then the set of cells e such 
that cfl/l 9-^0 forms a cellular decomposition 
of A. The set A together with this cellular 
decomposition is called a subcomplex of the 
cell complex X. A cell complex X with its cells 
|c;} is said to be finite if the number of is 
finite. If each point in a cell complex X is an 
interior point of some finite subcomplex of X, 
then X is said to be locally finite. We define 
similarly a countable cell complex and a locally 
countable cell complex. If the closure of each n- 
cel! of a cell complex X is homeomorphic to 
F" (« = 0, 1 , . . . ), is said to be regular. If X 
and Y are cell complexes, a continuous map- 
ping f-.X^Y such that /(2f ") c T" (n = 0, 1, . . . ) 
is called a cellular mapping. If X and F are cell 
complexes, the set of ^topological products 
Cl X ^2, where Cj , 62 run over all cells of X, F, 
respectively, is a cellular decomposition of the 
product space X xY. The resulting cell com- 
plex X F is called the product complex of the 
cell complexes X and F. 

A cell complex X is said to be closure finite 
if each cell in X is contained in a finite sub- 
complex of X-, and X is said to have the weak 
topology if a subset U<=X is open if and only if 
C/n? is relatively open in e for each cell e of 
X. We call a cell complex a CW complex if it is 
closure finite and has the weak topology. The 
cellular decomposition of a CW complex is 
called a CW decomposition. A locally finite cell 
complex is a CW complex. 

Fundamental properties of CW complexes 
are as follows: (i) A CW complex is a tpara- 
compact (hence '•'normal) space and is tlocally 
contractible, (ii) A subcomplex v4 of a CW 
complex is a closed subspace of X, and A 
itself is a CW complex, (iii) A mapping f:X^ 

F of a CW complex X to a topological space 
F is continuous if and only if the restriction 
f\e is continuous for each cell e of X. (iv) If 
X and F are CW complexes and f :X->Y is 
any continuous mapping then there exists a 
cellular mapping of A to F that is homotopic 
to / (cellular approximation theorem), (v) A 
pair {X, A) consisting of a CW complex X and 
its subcomplex A has the '•'homotopy extension 
property for any topological space, (vi) A CW 
complex has the tcovering homotopy property 
for any tfiber space, (vii) The product complex 
X X F of two CW complexes X and F is not 
necessarily a CW complex, but it is thomo- 
topy equivalent to a CW complex, (viii) If 
either or F is locally finite, or if both X and 
F are locally countable, then the product 
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complex X X y is a CW complex, (ix) For CW 
complexes X and Y, the tmapping space 
is homotopy equivalent to a CW complex. 

(x) The tcovering space of a CW complex has a 
CW decomposition. 

If K is a simplicial complex, the polyhedron 
|K| is a regular CW complex whose cells are 
all open simplexes in |X|. A simplicial complex 
K is (locally) finite if and only if the CW com- 
plex |K| is (locally) finite. In particular, the 
Euclidean polyhedron of a Euclidean sim- 
plicial (or cell) complex is a locally finite CW 
complex. A polyhedron |X| generally admits a 
CW decomposition whose cells are far smaller 
in number than the simplexes constituting a 
simplicial decomposition of K. For any CW 
complex X, there exists a polyhedron |X| 
that is homotopy equivalent to X. In partic- 
ular, if X is an n-dimensional finite (coun- 
table) CW complex, we can choose as K an 
n-dimensional finite (locally finite and coun- 
table) simplicial complex. 

E. Semisimplicial Complexes 

By an ordered simplex in a simplicial complex 
K we mean a finite sequence (uq, , . . . , d„) 
(n^O) of vertices in K, contained in the set of 
vertices of a simplex in K. Let 0{K)„ be the 
set of all ordered simplexes of K of length n + 

1, and define mappings di:0(K)„->0{K)„_i 
and s,:0(X)„-^0(K)„+i for i = 0, 1, ...,« by 



S;(Po, . . . , r„) = (Po, . . . , , t;,-, Uf, tJ,+i 

Then the following relations hold: 


diOdj = dj_iodi 


{i<A 


SiOS~Sj+iOSi 


(KA), 


o 

1 

W 

o 


ii<jl 


o 

ii 

6 

T 


0>A+1), 


diOs~di+iOSi = 


identity. 



Let A" be the n-dimensional simplex in R" 
with vertices eo = (0,0, ...,0), ei =(1,0, ... ,0), 

. . . , = (0, . . . , 1). By a singular n-simplex in a 

topological space X we mean a continuous 
mapping T:A"^X. Let S{X)„ be the set of all 
singular n-simplexes in X, and define map- 
pings a..:S(X)„^S(X)„^i and s,:S(A)„^S(A)„,i 
for i = 0, l,...,nby a,T(Ao, ..., A„) = T(Ao, ... , 

, 0, A,., . . . , AJ and s,. T(Ao, . . . , A„+i) = T(Ao, 
...,/l„+i), where (Aq, 

...,A„)is the point D"=oAie,-, S"=o^i= 1- 
Then relation (1) holds between 5; and s^. 

Because of the importance of relation (1), 
which is basic in defining thomology of sim- 
plicial complexes and topological spaces (-^ 
201 Homology Theory), S. Eilenberg and J. A. 
Zilber gave the following definition: A semi- 



simplicial complex K consists of a sequence of 
sets (n = 0, 1, • ■ • ) together with mappings 
a,. : , S; : (i = 0, 1 , . . . , n) satis- 

fying relations (1). An element of K„ is called 
an n-simplex in K, and a,-, S; are called the ith 
face operator and the ith degeneracy operator, 
respectively. A simplex is said to be degenerate 
if it is the image of a simplex under some s,-. A 
semisimplicial complex is abbreviated as s.s. 
complex. The s.s. complexes 0{K)= {0{K)„, 
a;,s,} and S(A) = {S(X)„, a;,Sj} are called the 
ordered complex of K and the singular complex 
of X, respectively. 

Let K be an s.s. complex, and let L„ be 

a subset of K„ for n = 0, 1, If 

and S;(L„)czL„+i for each i, then L—{L„, 
a,- 1 L„, Sj t L„} is an s.s. complex, and L is called 
a subcomplex of the s.s. complex K. If A is a 
subspace of a topological space X, S(A) is 
a subcomplex of 5(A). If K is an ordered sim- 
plicial complex, a subcomplex 0'{K) of 0(K) is 
obtained by considering the set of all ordered 
simplexes (uq, ijj , . . . , uJ such that Uq ^ 

. . . ^ i;„ (n = 0, 1, . . . ). If X and L are s.s. com- 
plexes, a sequence /={/„} of mappings /„ : 
K„^L„ defined for each n is called an s.s. map- 
ping if a,o/„=/„_i oa,- and SiO/„=/„+i os; 

(0 ^ j ^ n). If /: A ^ y is a continuous mapping 
of topological spaces, then / determines an s.s. 
mapping S(/):5(A)^5(y) by 5(/)(T)=/o T. 
Two s.s. complexes X and L are said to be 
isomorphic if there is a bijective s.s. mapping of 
X to L. For two s.s. complexes X and L, we 
define the Cartesian product X x L to be the 
s.s. complex given by (X x L)„ = X„ x L„, di(a, t) 
= (a,(7,aiT), Si((T, t) = (s,<T,S;T) ((TGX„,TeL„). If X 
and L are ordered simplicial complexes, the s.s. 
complexes 0'(X) x 0'(L) and 0'(K x L) are 
isomorphic. If A and Y are topological spaces, 
the s.s. complexes 5(A) x S(T) and 5(A x y) 
are isomorphic. 

Given an s.s. complex X, we construct a 
topological space |X| as follows: First, we 
provide X„ with the tdiscrete topology and 
consider the topological space X = (J„^oX„ x 
A". Next we consider simplicial mappings 
8':A"“^->^A” and A"'^^->A" defined by s'(Pj) 
^Pj iJ<il e%)^Pj+i (;>0 and n\Pj) = Pj 
U ^ 0 , = Pj I U > where Po, ■■■,?„ are 

the vertices of A". The topological space | X | is 
defined to be the tquotient space of X with 
respect to an equivalence relation ~ that is 
defined by the following: {8ia,y)~{a,s'(y}) 
(ffeX„,yeA"“i), {Sia,y)~{o,t]^{y)) {ceK„, 
yeA”'^^), where i = 0, The space |X| is 

called the (geometric) realization of the s.s. 
complex X. Given an s.s. mapping / : X->L, 
we obtain a continuous mapping |/l:|X|-^jL| 
defined by |/|(lo',yt) = |/(o'),y|, where lcr,y| is 
the point in |X| represented by (c7,y)eX. We 
call I/I the realization of the s.s. mapping / 
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The realization |K| of an s.s. complex K is a 
CW complex whose cells are in one-to-one 
correspondence with the nondegenerate sim- 
plexes in K. For a topological space X, a 
tweak homotopy equivalence p:|5(A')t^2L is 
defined by p{\T,y\)^T{y){TeS{X)„,yeA"). 
This mapping gives rise to a homotopy equiv- 
alence when A" is a CW complex. 

The singular complex S(2f) of a topological 
space X has the following property: Given 
simplexes with 

diOj = dj^ 1 O'; {/ < j, i,j k), there exists a simplex 
(reK„+i with diO = ai {i¥=k). An s.s. complex K 
with this property is called a Kan complex. If, 
in addition, 5 ,(t = d^a' (cr, c'eK„,i^ k) imply 
— we call K a minimal complex. For 
every Kan complex K, there are minimal sub- 
complexes M of X that are isomorphic to each 
other. Moreover, |M| is a Reformation retract 
of |X|. For a Kan complex K, the thomotopy 
group can be defined combinatorially. 

Two CW complexes X and Y are homotopy 
equivalent if and only if the minimal sub- 
complexes of S(AT) and S(T) are isomorphic. 



F. Eilenberg-MacLane Complex 

Given an integer n ^ 1 and a group n (Abelian 
if n'^2), there exists an tarcwise connected 
topological space X for which the thomotopy 
groups 71, (A") are trivial for and 7c„(A) = 7t. 
Such a space is called an Eilenberg-MacLane 
space of type (tt, n). Let fi( A ; A, *) be the tpath 
space over A, and let Po:Q(A; A, *)^A be 
the natural projection. If A is an Eilenberg- 
MacLane space of type {n, n), then (Q(A ; A, *), 
pQ, X) gives rise to a standard tcontractible 
tfiber space whose fiber is an Eilenberg- 
MacLane space of type {n, n — 1). Assume 
that A is an Eilenberg MacLane space of type 
(tt, n) with Abelian group n. Then A is (« — 1)- 
connected; hence the tHurewicz theorem can 
be utilized to show the existence of an isomor- 
phism h : 7T„( A) = H„{X), while the universal 
coefficient theorem can be utilized to show 
that //"(A;7i) = Hom(//„(A),7r). Since in this 
case we have 7c„(A) = 7i, the element h~^ e 
Hom(//„(A), n„{X)) can be regarded as an 
element of Hom(//„(A),7t). Now the funda- 
mental class of A is defined to be the coho- 
mology class ueH"{X;n) corresponding to 
Let y be a +CW complex and n{Y\X) be 
the set of ^homotopy classes of continuous 
mappings from Y to A. Then there exists a 
one-to-one correspondence n{Y;X)^H"{Y; n) 
given by the assignment [/]-»/*« 305 

Obstructions). Let S(A) be the singular com- 
plex of A, and let M(A) be a minimal complex 
of S'(A). If A is an Eilenberg-MacLane space 
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of type (tc, n), M(A) is isomorphic to a certain 
complex determined uniquely by n and n. This 
complex is called the Eilenberg-MacLane 
complex of type (ti, n) and is denoted by 
K{n, n). The notation K{n, n) is also used to 
mean the space A itself. 

Let A{q) be a simplicial complex whose sim- 
plexes are all non vacuous subsets of {0, 1, 

Let ep-Alq— l)->A(g) be the simplicial 
mapping defined by £;(;') =j (0 ^ i - 1), e,(;) 

^j+l{i<j<q-i), and let t/,: A( g-h l)^A(g) 
be the mapping defined by — j (0 i), 

1). Now K{n,n) is a 

Kan complex defined by K (n, n), = Z"{A(q); n), 
dia — oo^i, Sia — aorji, where Z°(A(g); n) = n, 
Z^{A(q);n) is the set of 7t-valued functions 
defined on the set of pairs (i,j) such that 
i<j^q and satisfying the equality (j{j, k) ■ 

(j{i, /c)“' • <r(i, j) =1 for 0^i<j <k^q, and 
Z"(A(q); 7 t) (n^2) is the group of toriented 
cocycles of the simplicial complex A(g). If n is 
Abelian, the structure of the Abelian group 
Z"(A(^); n) gives K{n, n) the structure of an 
Abelian group in the s.s. category. This struc- 
ture yields a one-to-one correspondence 
K{n,n)q^K{n,n— l)q- i x ... x K{n,n— l)o for 
n^\ and leads to the expression of zeK(n,n)g 
in the form ...,cto!) with CT;eX(7r,M — l)j. 

The kF-construction of K{n,n— 1) for 1 
is a Kan complex W(n,n — l) defined by W{k, 
n-\\ = K{n,n-\\xK{^l,n\2^nd^o{(rqXz^)^ 
(doCg) • a,_i X doZg, 5,(o-„ x zj = ^,<7, x for 
1 ^i^q, S;(o-q X Zg) = Sia^x SjZ^, where cr, eK{n, 
n-l\ and x, = <a,_i, ...,CTo>eK(7r,n),. Let p: 
W{n, n—\)^K{n, n) be a natural projection. 
Then {W{n,n—\), p,K{n, n)) plays the role of 
the tuniversal bundle for K(7r, n — 1) in the s.s. 
category in the following sense: Let L be an 
s.s. complex, and let f:L->-K{n, n) be an s.s. 
mapping. We define W{n, n — 1) to be the 
subcomplex of W{Ti,n—\) x L generated by 
simplexes (cr, x t,) x p, such that x, = /(p,), 
where ct, x x,g W{n,n—\)^ and PgeL^. Let 
p\ f*W{Ti,n—\)^L be the natural projection. 
Then (f* W(n, n — 1), p, L) is called the prin- 
cipal fiber bundle induced from W{n,n— 1) by 
/ Any principal bundle over L with group 
K{n,n— 1) can be expressed as an induced 
bundle. This property means that (W{n,n — \), 
p, K{n, n)) is universal. On the other hand, 
we have an algebraic analog of the univer- 
sal bundle for the chain group of K{n,n— 1), 
called the bar construction [12]. Both these 
concepts were defined by Eilenberg and Mac- 
Lane in order to determine the structure of the 
(co)homology of K{n, n), which is denoted by 
This object was later 

achieved by H. Cartan, who introduced an 
improved notion called Cartan construction 
(-^ Appendix A, Table 6.III). Let 7z(L,K(n,n)) 
be the set of s.s. homotopy classes of s.s. map- 
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pings from L to K{n,n). If n is Abelian, there 
exists a one-to-one correspondence n{L,K{n, 
given by the assignment [/]-»^ 
f*u, where ueH"{n,n-,’n.) is the fundamental 
class of K{7i,n). By virtue of this correspon- 
dence, f*u£H"{L\ n) determines an in- 
duced bundle f* W{n, « — 1) uniquely up to 
equivalence, which is denoted by K{n, 

« — 1) x^L. 

Let X be an arcwise connected topological 
space. Let X" be the Cartesian product of n 
copies of X. Clearly, the symmetric group S„ 
of degree n operates on X". The n-fold sym- 
metric product SP"X of X is defined to be the 
quotient space of X" under the action of S„. If 
we specify a reference point of X, we have a 
natural inclusion SP"~^ X czSP"X and can 
consider the inductive limit space (J, 
denoted by SP"^X. Then the Dold-Thom 
theorem [15] shows that M{SP'^X)~ 
n,“ 1 K{Hi{X), i). In particular, we have 
M{SP'^S") = K{Z,n) for 1. This result can 
be applied to obtain a direct relationship be- 
tween the axiomatic definition of tcohomology 
operations using K{n,n) due to Eilenberg and 
Serre and the constructive definition using the 
symmetric groups due to Steenrod (A. Dold 

[16], T. Nakamura [17]). For a detailed study 
of the (co)homology of SP"X [18]. 



G. Postnikov Complex 

Let X be an arcwise connected topological 
space. For the sake of simplicity, throughout 
this section we assume that X is tsimple (-^ 

202 Homotopy Theory). Then the tPostnikov 
system of X can be defined as an inverse sys- 
tem {X„,p„) (n = 0, 1, 2, . . . ) consisting of topo- 
logical spaces X„, continuous mappings p„: 
X„-yX„^i, and a system (X,q„} (n = 0, 1,2, ...) 
consisting of continuous mappings q„\X^X^ 
such that p„oq„ = q„-i and satisfying the fol- 
lowing three properties: (1) Xq is one point. (2) 
{X„,p„,X„^i) is a tfiber space induced from a 
standard contractible fiber space over an 
Eilenberg-MacLane space of type {n„{X),n+ 1) 
by a mapping corresponding to a cohomology 
class k”^^eW^HX„-,;n„(X)). (3) q„*:n,{X)^ 
7ti(A'„) gives an isomorphism for O^i^n. 

These cohomology classes are called 
Eilenberg-Postnikov invariants (or simply k- 
invariants). Corresponding to the above facts, 
the minimal complex M{X) can be obtained 
as the inverse limit of a certain inverse sys- 
tem (K{n%p{n)} consisting of Kan com- 
plexes K{n) and s.s. mappings p(n):K(n)-» 

K{n — l) defined by K{0) = K (0, 0) and K (n) — 

K (n„, n) X ^„ + 1 K (n — 1 ) for n ^ 1 with natural 



projections p{n), where n„ = n„(X) and k"'^^ e 
This system is determined 
uniquely up to s.s. homotopy equivalence by 
its limit, called the Postnikov complex and 
denoted by K{n^,k^,n 2 , ...,k"'^^,Ti„, ...). As 
yet we are ignorant of an effective method of 
computing the cohomology of a Postnikov 
complex from n„ and k"^^. 
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Complexity of Computations 

A. Measures for Complexity of Computation 

Intuitively, complexity of computation means 
the amount of computing efforts measured on 
some suitable scale. When a problem can be 
solved by means of any one of several algo- 
rithms, it is highly desirable to compare the 
complexities of those algorithms. For example, 
the power can be evaluated with a pocket 
calculator by using either of the algorithms 

X^® = x X X X X X X X X X X X X X X X X X X, (A) 
u = xxx, v — uxu, w = vxv. 



The complexities of these algorithms can be 
measured by various quantities: (Tl) time in 
seconds, (T2) the number of times keys and 
buttons are touched, (T3) the number of basic 
operations (here, multiplications), and (S) the 
number of values to be stored in the calcu- 
lator. The quantities (Tl), (T2), and (T3) are 
called time complexities, and (S) is called the 
space complexity. Obviously, algorithm (B) is 
preferable with respect to time complexity, 
although (A) is better with respect to space 
complexity. (If the calculator has only one 
memory register, then only (A) can be executed 
without recording numbers by hand.) For 
many calculators, the following algorithm 
is best with respect to both time and space 
complexities: 

xiO-((x")^xx)^ (C) 

In the general theory of complexity, the num- 
ber of basic operations (T3) is often taken as 
the basis of time complexity, since it represents 
the intrinsic complexity of the algorithm, 
rather than of extrinsic factors such as human 
skill or mechanical performance. 

For the evaluation of a power x" in general, 
the complexity depends on the value of n: such 
a parameter dominating the complexity is 
called the size of the problem. Let Txin) be 
the number of multiplications required for 
evaluating x" by an algorithm X. Let ^ be the 
set of all algorithms for evaluating x". The 
time complexity T{n) for evaluation of the 
power is then defined by 



«-cclog2n 

In a problem concerning a tgraph, the num- 
ber m of nodes in the graph, the number n of 
its edges, or their sum m + n are chosen as 
the size of the graph. However, the complex- 
ity is not determined by such a size, because 
there are many different instances (particular 
graphs) of the same size. So the complexity of 
this problem is defined in the following man- 
ner. Let T{I) be the complexity of solving an 
instance / of the problem and L(n) be the set 
of all possible instances of size n. Then 

IF(n)=Max,,^,„)T(;) 



A(«)=xp(^)m 

where p{I) denotes the relative frequency of 
the instance I. The value W{n) is called the 
worst-case complexity, and A{n) is called the 
average complexity of the problem. 

In some cases the complexity of a circuit is 
measured by the number of its building blocks 
[1], and the complexity of a program is mea- 
sured by its length [2, 3]. 



B. Complexity of a Decision Problem 

A decision problem and its complexity are 
rigorously formulated in terms of tTuring 
machines. Let Z" be a finite set of symbols and 
be the whole set of nonempty strings of 
symbols in Z. Every instance of the problem is 
assumed to be represented by a string in Z"^. A 
decision problem is a triple (Z, L, P) of the set Z 
of symbols, a subset L of Z^, and a mapping P 
from L to {0 (false), 1 (true)}. The set L repre- 
sents the set of all possible instances of the 
problem. The problem is said to be solvable if 
and only if there exists a Turing machine M 
satisfying the following condition for every 
string a in the set L: When a tape containing 
the string a is given to the machine M whose 
head is initially put on the leftmost symbol of 
a, the machine writes the value of T(a) on the 
tape and stops after a finite number of steps. 
Such a T uring machine is said to compute the 
function P. The number T^v/(a) of steps is the 
time complexity of solving the instance a by 
the machine M, and the length Sxf(a) of the 
used area of the tape is the space complexity. 
The worst-case complexity Tj^f(n) is defined by 



Ti^(u) = Max„^2.>|=„ Tm(oc). 
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The space complexity Syy(n) for the size n is 
defined in a similar way. The complexities 
T{n) and 5(n) of the problem are not defined, 
because improvement by a constant factor is 
always possible for any Turing machine [5]. 

A problem is said to be solved in linear or 
polynomial time if its time complexity Tj^in) is 
bounded by a linear or polynomial function of 
the size n. 



C. Objectives of Complexity Theory 

The objectives of complexity theory are as 
follows: 

(i) Analyze an algorithm X for a problem P 
and evaluate its complexity. When exact evalu- 
ation is hard, the order of magnitude 0(T^(n)) 
is investigated. 

(ii) Construct better algorithms for the prob- 
lem P. This gives a better upper bound to the 
complexity T(n) of the problem. 

(iii) Make clear the limitation of improving 
algorithms for the problem P. This is done by 
establishing a lower bound to the complexity 
T(n) of the problem. 

A common approach to constructing better 
algorithms is to partition the problem into 
smaller parts, apply algorithms to the parts, 
and then combine the solutions for the parts 
into a solution for the whole (divide and con- 
quer, [5]). This approach often yields an effi- 
cient algorithm for the problem, especially 
when the partitioning can be repeated recur- 
sively. For demonstrating the limitations of 
algorithms of a certain class, diagonalization 
[5], determination of information-theoretic 
lower bounds [6], or the oracle method [6] 
are often utilized. 



D. Elementary Results on Time Complexity 

(1) Number of arithmetic operations. A poly- 
nomial of degree n with one variable can be 
evaluated in about 3n/2 arithmetic operations 
if preconditioning on coefficients is allowed. 
Otherwise, 2n—l operations are necessary and 
sufficient for evaluating a polynomial of degree 
n. For computing the product of two square 
matrices of degree n, 0{n^) operations are 
necessary and operations are sufficient. 

The inverse matrix and the value of the deter- 
minant of a square matrix of degree n are 
computed with the same order of operations 
as the product. Discrete Fourier transforma- 
tion of n points is executed in O(nlogn) arith- 
metic operations (fast Fourier transformation). 

(2) Number of comparisons and data trans- 
fers. Rearrangement of n items in increasing 
order is realized in O(nlogn) comparisons and 



data transfers. Selection of the kth largest item 
among n items is done in 0{n) operations. The 
position of an item in a list containing n items 
is found in 0{n) comparisons by linear search, 
in O(logn) comparisons by binary chopping, 
and in 0(1) comparisons on average by the 
thashing method. 

Many other results can be found in [5]-[7]. 

E. NP-CompIeteness 

A problem is easily solved if its complexity is 
0{n) or less. The problem is very hard if its 
complexity is 0(2"), unless the size n is small. 
Between these types of problems there is a 
class of problems, each of which can be solved 
in polynomial time by a Turing machine. 

A tnondeterministic Turing machine is said 
to solve a decision problem (27, L, P) if it can 
detect the case when P(a) = 1 by exercising 
good choices; more precisely, if it can stop 
after a finite number of steps and write the 
value 1 of P{a) on the tape, starting from an 
initial state with the head on the leftmost sym- 
bol of a string a on the tape, provided that 
P{ot)— 1. If P(a) = 0, then the machine may 
never stop. For a string a such that P(a)= 1, 
'NT,^f(cc) and N,S;^^(a) represent the minimum 
number of steps and the minimum length of 
tape used in computing P(a). Nondeterminis- 
tic complexities NT^{n) and NS^(n) for the 
size n are defined in a similar way as before. A 
problem is said to be solvable in NP-time or 
NP-space if it can be solved by a nondeter- 
ministic Turing machine M whose time com- 
plexity NT^(n) or space complexity NSj^f(n) is 
bounded by a polynomial function of size 
n. The class of all problems solvable in NP- 
time is denoted by NP, while that of the prob- 
lems solvable in polynomial time by ordinary 
(deterministic) Turing machines is denoted by 
P. Obviously, the class NP contains the class 
P. Nevertheless, whether NP = P or not re- 
mains one of the biggest unsolved problems. 

A decision problem (27, L, P) is polynomially 
transformable to a problem (27', L', P') if there 
exists a mapping h from 27"^ to 27'"^ satis- 
fying the following conditions: (1) h{L)^L'; (2) 
for every string a in L, P(a) — P'(fe(a)); (3) the 
mapping h is computed by a Turing machine 
in polynomial time. If a problem C is poly- 
nomially transformable to another problem C 
in P or NP, then the problem C is also in P 
or NP. A problem C is said to be NP-hard if 
every problem in NP is polynomially transfor- 
mable to C. It is called NP-complete if it is in 
NP and NP-hard. Many problems that have 
been known empirically to be very hard have 
recently been shown to be NP-complete [8]. 
For instance, integer linear programming is 
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NP-complete. Linear programming has re- 
cently been shown to be in P. The satisfiability 
problem of a conjunctive normal form was 
proven to be NP-complete in 1971 [9]. 

The complement of a decision problem 
{L, L, P) is the problem (iT, L, P'), where P'{a) = 

1 — P(a). If a problem C is in P, then its com- 
plement C is also in P. When C is in NP, it is 
not guaranteed that C is in NP. The class 
of all problems whose complements are in NP 
is denoted by co-NP. The intersection of NP 
and co-NP contains the class P, but it is an 
open problem whether this containment is 
proper or not. The problem PRIME, which 
asks whether a number N is prime or not, the 
size n being the number of digits for represent- 
ing the number N, belongs to both NP and co- 
NP [10]. However, it is not known whether it 
is in P. 

F. Other Topics 

The notion of completeness is introduced in 
many other classes of problems solvable in 
linear space, polynomial space, exponential 
time, etc. Comparisons among these classes 
have been done, although many problems still 
remain open. The fundamental conjecture in 
this field is the nonequality NP#P, the prov- 
ing of which seems to be extremely difficult. 
This conjecture can be relativized in ways such 
that it can be either true or false, so that it is 
not provable in some formal system. 

Some tricks may be used for hard problems. 
For decision problems, some algorithms have 
been proposed for guessing the correct answer 
with high probability [11]. Some algorithms 
give nearly optimal solutions efficiently, in- 
stead of constructing optimal ones over too 
long a time [7]. 

A general theory of computational com- 
plexity would include investigations on rela- 
tions among various complexity measures, 
the complexity hierarchy, and an axiomatic 
approach independent of any machine models 
[ 12 ]. 
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A. Definitions 

A Hausdorff topological space X is called a 
complex manifold (or complex analytic mani- 
fold) of complex dimension n if there are given 
an open covering and a family {</>;}, e/ of 

homeomorphisms of onto open sets in the 
^-dimensional complex affine space C” such 
that in case t/jf! Uj^0, the mapping q)iO(pf^: 
q>j{Ui n Cj)-+(Pi(t/,n [/,) is biholomorphic (i.e., 

(Pi o and its inverse are both tholomorphic 
functions when expressed in terms of coordi- 
nate functions in C"). We call X the underlying 
topological space of this complex manifold, 
and we say that an open covering and 

a family {(Pi}i^i define a complex analytic 
structure (or simply complex structure) on X. 

A complex-valued function / defined on an 
open set 1/ in is called a holomorphic func- 
tion on U if for any i the function f O(p0 on 
(pi(U n t/j) is holomorphic. When we express 
the mapping cp^ as (piip) = {z^{p), ...,z"(p)) on 
Ui in terms of the coordinates in C", each z“ 
is a holomorphic function on t/j. We call 
(z\ . . . , z") a holomorphic local coordinate sys- 
tem on Ui- Given two complex manifolds 
y, X, a mapping y-+X is said to be holo- 
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morphic if for any open set 17 in A" and any 
holomorphic function f on U, focp is holo- 
morphic on ^(t/)cz Y. When a mapping (p: 
is bijective and both (p and (p~'‘ are 
holomorphic, we say Y and X are isomorphic 
by q> as complex manifolds. 

As in the case of tdifferentiable manifolds of 
class C*, we can define concepts such as com- 
plex analytic submanifolds, holomorphic tan- 
gent vectors, holomorphic vector Helds, and 
holomorphic differential forms of degree k (or 
simply holomorphic fe-forms). Meromorphic 
functions on complex manifolds can also be 
defined as in the theory of analytic functions of 
several complex variables (-^ 23 Analytic 
Spaces D). 

Let A be a complex manifold and p a point 
of X. Take a holomorphic local coordinate 
system (z\ with center p (i.e., z*(p) = 0 

for all a). A holomorphic function defined on 
a neighborhood of p can be expressed as a 
holomorphic function in (z\ ... , z"), hence as a 
power series in (z\ ... ,z") absolutely conver- 
gent in a neighborhood of p. If we denote by 
0 — &X the tsheaf of germs of holomorphic 
functions on X, the tstalk Gp of G a.t p is iso- 
morphic to the tlocal ring of convergent power 
series in n variables z', ... ,z". At a point p, 
{djdz^)p, ...,{dldz'')p form a basis of the holo- 
morphic tangent vector space at p. A holomor- 
phic /c-form 0 ) defined on a neighborhood of 
p can be expressed as — a 

... A dz‘\ where is a holomorphic func- 

tion for each (j'l , . . . , 1 ^). 

B. Almost Complex Structures 

Let A be a complex manifold, and let 
{Ui,<Pi}iei be its complex analytic structure, i.e., 
a covering of A by holomorphic local coordi- 
nate systems with (p^ -{z-,..., z”). Express z“ in 
the form z" — x® -I- yf , where xf and y“ 
are the real and imaginary parts of z®, respec- 
tively. Then xf and yf real-valued functions 
on the open set L; of A, and the mapping 
ij/i ; defined by i>;(p) = (x/ (p), y? (p), . . . , 

x"(p),y"(p)) is a homeomorphism of 1/; onto 
an open set of R^". This { t/,-, defines on 
A a tdifferentiable structure of class C“ (in 
fact, a treal analytic structure). Thus a complex 
manifold of complex dimension n admits 
canonically a C'®-structure of real dimension 
2n. For every point p of A there is a real co- 
ordinate system on a neighborhood of p, such 
as (x\y\ ...,x",y”), where (z', ...,z") (z® = x“ 

+ y®, a = 1, • • • , «) forms a holomorphic 

coordinate system in A. The real tangent 
vector space at a point p of A has {(5/5x')p, 
(d/dy^)p, ...,{dldx\, {dldy\} as its basis. 
Define a linear endomorphism Jp by (5/5x®)p 



-^{d/dyX, {d/8y\^ -{d/dxX (a= 1, ... , n); 
then Jp — —I, and Jp does not depend on the 
choice of holomorphic coordinate system at p. 
Considering Jp as a tensor of type (1,1), we 
thus obtain a tensor field J of type (1, 1) of 
class C® on A, which is called the tensor field 
of almost complex structure induced by the 
complex structure of A. 

More generally, when a real differentiable 
manifold A is provided with a tensor field J of 
type (1, 1) of class C® such that = — 1 (con- 
sidering J as a linear transformation of vector 
fields), we say that A admits an almost com- 
plex structure or that A is an almost complex 
manifold. In this case, for contravariant vector 
fields X and y on A, we define a tensor field S 
of type (1, 2) by S(x, y) = - [x, y] -I- [J (x), J(y)] 

— J([J(x),y])— J([x, J(y)]). S is called a Nijen- 
huis tensor. An almost complex structure J 
is induced by a complex analytic structure if 
and only if its Nijenhuis tensor S vanishes 
identically [49]. A differentiable manifold A 
of dimension 2n admits an almost complex 
structure if and only if the structure group 
GL{2n, R) of the bundle of ttangent 2«-frames 
of A can be treduced to GL{n, C). Almost 
complex manifolds are torientable. 

C. Types of Differential Forms 

Let A be a complex manifold, and let (z\ 

. . . , z”) be a holomorphic local coordinate 
system in a neighborhood of a point p with z® 

= X® + y® (a = 1 , . . . , n). On the complexi- 

fied real tangent vector space Tp(A) ® C at p, 
we define djdz“, 8/81“ by 

(a/Sz“), = ( 1/2) { (a/ax“), - ^ {didy\), 

id/dl"), = ( 1/2) { id/dx\ + (d/dyX). 

It is easy to see that the operation 8/8z“ on 
holomorphic functions coincides with that of 
the holomorphic tangent vector 8j8z“ defined 
in Section A. A function / is holomorphic at 
p if and only if {8/8l“)f ==0 (a= 1,2, ...,n). 

Tp(A) ® C is the direct sum of the subspace 
spanned by {8j8z^, ...,8/8z’'} and the subspace 
spanned by (5/5zS ... , 8181"}. Moreover, this 
decomposition is independent of the choice of 
holomorphic coordinate system. Elements of 
the two subspaces are respectively called tan- 
gent vectors of type (1,0) and tangent vectors of 
type (0, 1). Similarly, the complexified space of 
real differentiable 1 -forms can be decomposed 
into the direct sum of two subspaces spanned 
by {dz\ ...,dz”) and {d1^, ...,dz"}, where dz“ 
^dx“ + yj^ dy“ and dl“ = dx“ — y/^ dy“. 
We say that the elements of the former sub- 
space are of type (1,0) and those of the latter 
are of type (0, 1). Thus the space of differential 
forms of arbitrary degree can be written as 
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the direct sum of subspaces of type (r, 5 ). Here 
the subspace of differential forms of type (r,s) 
has as a basis {dz“’- a ... a dz”-' a a ... a 
(1 ^ «!<... <a^^n, This 

decomposition is independent of the choice of 
a local coordinate system, hence the concept of 
type can be defined globally on X. 



D. 5-CohomoIogy 

In the rest of this article we consider only 
complex differential forms. For every differen- 
tial form 0 ) of type (r, s) on X, its texterior 
derivative da> decomposes into a sum of dif- 
ferential forms of types (r + 1, s) and (r, s -t- 1), 
which we denote by dco (or d'cn) and do (or 
d"o), respectively. We have d = d-{-d, idf = 0, 
(5)^ =0, and dd + dd = 0. In terms of a local 
coordinate system, we write 

do—y~^dz'^ A dz®' A ... 

ydz^ 

A dz“' A dz^i A ... A dz^\ 
5cu = ( — lyy -F^dz*' A ... 

^ ^ ydr 

A dz“' A dz^ A dz^> A ... A dz^\ 

for co= /dz“‘ A ... A dz*’- A dz^i a ... /\dz^\ A 
differential k-form o is holomorphic if and 
only if o is of type (k, 0) and do = 0. 

For the operator 8 Dolbeault’s lemma holds: 
Let CO be a differential form on a neighborhood 
L/ of a point p. If do — Q, there is a neighbor- 
hood F of p contained in U and a differen- 
tial form 6 on F such that o>=50 on F. 

Let and be the tsheaf of germs of 
differential forms of type (r, s) and the sheaf of 
germs of holomorphic p-forms on X, respec- 
tively, and let F(X, /I*’’*®') be the set of tsections 
of on X. F(X, is the set of differen- 
tial forms of type (r,s) on X, and S, F(X, 
forms a tcochain complex with respect to d. 
This complex is called the 5-complex or the 
Doibeault complex, and its ^cohomology 
groups are called the 5^-cohomology groups or 
the Doibeault cohomology groups. The qith 
cohomology group is denoted by H^’^{A,d). It 
follows easily from Dolbeault’s lemma that 0-^ 
!)_»__ jg texact sequence of 
sheaves. From this we get Dolbeaulfs theorem: 

H‘‘{X,QP)^HP'‘^iA,8), 

where the left-hand side is the cohomology 
group with coefficient sheaf Q^. 

More generally, for any tcomplex analytic 
(holomorphic) vector bundle E on X, we can 
define the d cohomology groups of the dif- 
ferential forms on X with values in E, and they 
can be shown to be isomorphic to the coho- 
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mology groups with coefficient sheaves of 
germs of holomorphic forms with values in E 
(-" 194 Harmonic Integrals E). 



£. Analytic Coherent Sheaves 

The structure of a complex manifold X is 
determined by the tsheaf & of germs of holo- 
morphic functions on X, and tP is a tcoherent 
sheaf (of rings) (Oka’s theorem). Sheaves of 0- 
modules are called analytic sheaves, and coher- 
ent sheaves of (P-modules are called coherent 
analytic sheaves. Many properties of X can be 
expressed in terms of coherent analytic sheaves 
and their cohomology groups; some exam- 
ples appear later in this article (also ^ 366 
Riemann-Roch Theorems B). 

It is important to know whether an analytic 
sheaf on a complex manifold is coherent. To 
this question, not only does Oka’s theorem 
apply but so does Cartan’s theorem: The tsheaf 
of ideals defined by an analytic subset of a 
complex manifold is coherent. (We say that a 
subset y of X is an analytic subset if it is a 
closed subset and each point of Y has a neigh- 
borhood U such that C7 D 7 is the set of com- 
mon zeros of a finite number of holomorphic 
functions on U.) Also relevant is Grauert’s 
theorem: If 7r:X-» 7 is a ^proper holomorphic 
mapping of complex manifolds (i.e., the inverse 
image of any compact subset of 7 for a holo- 
morphic mapping n is also compact), then for 
any coherent analytic sheaf F on X its tdirect 
images R^n^(F) — are also coher- 

ent [16]. (In fact, this theorem holds for ana- 
lytic spaces; ^ 23 Analytic Spaces.) 

For an analytic coherent sheaf F on a tStein 
manifold X, we have the following fundamen- 
tal theorems of the Stein manifold. Theorem 
A: H^{X, F) generates the stalk F^ (as an (P^- 
module) at every point x of X. Theorem B: 
H^(X, F) = 0 for all q>Q. Conversely, a Stein 
manifold X is characterized by the following 
property: For any coherent analytic sheaf F of 
ideals of 0, H^{X, F) — 0. If a complex manifold 
X is compact and F is a coherent analytic 
sheaf on X, then H‘^(X,F) is a complex vector 
space of finite dimension. If X is an open sub- 
manifold of another complex manifold and its 
closure is compact, then H^{X, F) is finite- 
dimensional for some q that depends on vari- 
ous properties (convexity or concavity) of 
the boundary of X [3]. 

Let £ be a tcomplex analytic (holomor- 
phic) vector bundle on a complex manifold 
X of dimension n, and let E* be the tdual 
vector bundle of E. Then H^(X, C'’(£)) and 
H”-‘i{X,(9’'~P{E*)) (where denotes the 
cohomology group with compact support) are 
dual as topological vector spaces under suit- 
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able conditions. The duality is given by the 
integration on X of the exterior products of 
the differential forms representing the re- 
spective elements of the cohomology groups. 
The duality holds, for example, when dim 
W{X, (9‘’(E))<cc. If X is compact, we need 
not distinguish from H (Serre’s duality 
theorem) [50]. 

F. Compact Complex Manifolds 

On a compact and connected complex mani- 
fold X, there are no holomorphic functions 
except constants (by the ^maximum principle 
of holomorphic functions). The field ^^(2f) of 
meromorphic functions on X is finitely gen- 
erated over the complex number field, and its 
ttranscendence degree d does not exceed the 
complex dimension n o( X. d is said to be the 
algebraic dimension of X and is denoted by 
a{X). For elements of K{X), functional inde- 
pendence and algebraic independence are 
equivalent [52]. When 1, X is a compact 
tRiemann surface and the classical theory of 
algebraic functions shows that K(X) is an 
talgebraic function field of one variable and X 
is a tprojective algebraic variety. When n = 2, 
a{X) — 2, 1, and 0 can all occur. If a(X) = 2, 
is a projective ^algebraic surface (Chow- 
Kodaira theorem). If a{X)=\, there exist an 
talgebraic curve A and a surjective holo- 
morphic mapping (p:X-*A such that /C(A) is 
isomorphic to K(2f) under (p*, and is 

an telliptic curve for all but a finite number of 
xeA. K. Kodaira investigated the structure of 
compact complex surfaces in detail [34,111]. 

On a compact complex manifold X, the free 
Abelian group generated by the set of irre- 
ducible analytic subsets of codimension 1 is 
called the divisor group of X, and an element of 
it is called a divisor of X. For an analytic sub- 
set Y of codimension 1, the sheaf of ideals 3(^) 
defined by T is a sheaf of loeally principal 
ideals of (9. For a divisor D — '^a^Y^, the sheaf 
of locally principal fractional ideals ^{D) = 
n*3(Fa)““ is called the sheaf of ideals of D. 

The set of nonzero coherent sheaves of locally 
principal fractional ideals corresponds bijec- 
tively to the set of divisors. An element / #0 of 
K{X) generates a sheaf of principal fractional 
ideals and therefore defines a divisor, which 
is denoted by {/). The divisor group has 
an ordering defined by D = Z T„>-0 if and 
only if all a^^O, under which it becomes an 
tordered group. For a divisor D, let 

L(D)-(/eX(X)|/^0, (/) + D>0}U{0}. 

Then L{D) is a C-module of finite dimension. 
This submodule of K{X) is easy to handle and 
exhibits various analytic properties of D. The 



+Riemann-Roch theorem is used to calculate 
the dimension of L{D) in terms of other factors 
366 Riemann-Roch Theorems C). Here is 
an example of how L{D) exhibits a property of 
D: We call two divisors D and D' linearly 
equivalent if there is a 0^FeK{X) such that 
(F) — D — D'. This is an equivalence relation 
finer than thomological equivalence. If D and 
D' are linearly equivalent, then L{D) and L{D') 
are isomorphic by the mapping L{D)3f^ 
fFeL(D'); therefore dim L{D) = dim L(D'). 

(The latter equation does not follow from the 
homological equivalence.) A holomorphic 
vector bundle with fiber C and structure group 
C* is called a complex line bundle. For the 
sheaf of ideals 3(15) of a divisor D, we can take 
a suitable open covering {Uj} of X such that 
for each Uj there is an Rj-er((7j,3(I5)) which 
generates 3(^)x for l^^so gjk = RjlRk 

is a holomorphic function nowhere vanishing 
on ly^n Uk- With {gjk} as the system of tcoordi- 
nate transformations, we define the complex 
line bundle determined by D and denote it by 
[D]. It is easy to see that [D] is independent 
of the choice of {Uj} or {Rj}. Moreover, [D] 
is determined only by the linear equivalence 
class of D. If we denote by tP([D]) the sheaf of 
germs of holomorphic sections of [D], the 
mapping H^{X, &{lD']))3(p^{(pj}^f^ 

(p.jRj = (PklRk^L{D) is an isomorphism of these 
modules. On an algebraic variety in a projec- 
tive space, any eomplex line bundle comes 
from a divisor (i.e., it can be expressed in the 
form [D] for some divisor D) [35, 51], but this 
is not necessarily true on general compact 
complex manifolds. However, the importance 
of complex line bundles in the theory of eom- 
plex manifolds lies in the relation L(D)^ 
H^{X,(9{ID'])), which replaces “things with 
poles” with “things holomorphic.” 

Any analytic subvariety X of the projec- 
tive space P" is an algebraic variety (Chow’s 
theorem) [51]. Suppose that X is an analytic 
submanifold of and let 7 be a general 
■•■hyperplane section of X. Then 7 is a divisor 
on A, and the tChern class of the complex line 
bundle [7] corresponds to the canonical 
Hodge metric on X (— ^ 232 Kahler Manifolds 
D). When [7] is represented by the system of 
coordinate transformations {gjk} with respect 
to an open covering {C/;}, we can associate 
with a coherent analytic sheaf F sheaves F{n) 

(« = 0, ± 1, ± 2, . . . ) as follows. Denoting by Fj 
the restriction of F to Uj, we glue Fj and Fk 
together on UjD Uk with the relation fj~fk^fj 
^ Okjfk (where fj e Fj c F, e c F) and obtain 
a sheaf (denoted by F{n)} that is locally iso- 
morphic to F. The following theorems for F{n) 
hold. For each coherent analytic sheaf F there 
exists an integer Hq such that for any « ^ Oq the 
following fundamental theorems A, B of projec- 
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live algebraic varieties hold [17]. Theorem A: 

r(2f, F{n)) generates (as an (55^-module) for 
every xeX. Theorem B: F(n)) = 0 for all 

q>0. This means that if we permit “poles” on 
Y of sufficiently high order, then F has suffi- 
ciently many sections and the higher coho- 
mology groups vanish. 

On a (nonsingular) algebraic variety X in 
P'^ we have the sheaf 0 of germs of holomor- 
phic functions (the tstructure sheaf as a com- 
plex manifold) and the sheaf of germs of 
holomorphic rational functions (the structure 
sheaf as an algebraic variety). Therefore we 
have two kinds of coherent sheaves, coher- 
ent analytic sheaves and coherent algebraic 
sheaves. In fact, the cohomology theories de- 
rived from them are isomorphic. More pre- 
cisely, for any coherent algebraic sheaf f , F 
= F (9 a. coherent analytic sheaf. The 
correspondence F^F gives an equivalence 
between the tcategory of coherent algebraic 
sheaves and that of coherent analytic sheaves, 
thus giving an isomorphism of their cohomol- 
ogy groups. In other words, as far as the prop- 
erties that can be expressed by cohomology- 
theoretic terms of coherent sheaves are con- 
cerned, there is no difference between the 
analytic and algebraic theories of projective 
algebraic varieties [51]. 



G. Deformations of Complex Structures 

The deformation theory of complex structures 
was initiated by Kodaira and Spencer [36] in 
order to explain various phenomena in the 
theory of (compact) complex manifolds. A 
triple {X, n, S) is called a family of compact 
complex manifolds if X and S are connected 
^analytic spaces and ti is a tproper holomor- 
phic mapping of X onto S such that (i) it is 
smooth, i.e., is locally identified with the pro- 
jection S’ X U-*S', where S' and U are open in 
S and C", respectively, and (ii) every fiber = 
7!:''‘(s) of TT is connected. is then a compact 
complex manifold. We sometimes write {f^jses 
instead of {X, n, S). S is called the parameter 
space of the family. Take a point oeS and fix 
it. We say that is a deformation of for any 
seS. There are a neighborhood S' of o in S and 
a diffeomorphism k~^^(S')-*S' x Vq (Kuranishi 
[39]). Thus every V^, seS', determines a tcom- 
plex structure on a fixed differentiable mani- 
fold. We say that the complex structure of V^, 
seS', is a deformation of that of Vq. 

Now (X, :n:, S) = { Kjsgs is said to be complete 
at o if for any family (Y, /r, T) = { W,}tgr with a 
point o'eT and a holomorphic isomorphism 
I'.Wg'-^Vg, there is a neighborhood V of o' in 
T and holomorphic mappings / : T'^S and 
h:ii~^{T')-^X such that (i) fpL = nh, (ii)/(o') = o. 
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and (hi) h = ion Wg.. In this case, if T is suffi- 
ciently small, then h induces a holomorphic 
isomorphism for any teT’. Hence 

{ ^^}ses contains all small deformations of Vq. 
On the other hand, {I^}ses is said to be effec- 
tively parametrized at o if the Kodaira-Spencer 
mapping (Kodaira-Spencer map) pgiTgS-^ 
H^{Vg,&) is injective, where TgS is the Zar- 
iski tangent space to 5 at o and 0 is the sheaf 
of germs of holomorphic vector fields on Vq. 
Here, the linear mapping is defined by 
Po{olds)={{dgjJds\^^^g^}, where dIdseTgS and 
Pjk are the coordinate transformations (zj, s) = 
idjki^k^ s), s), using the smoothness of n. pQ{d/ds) 
is called the infinitesimal deformation to the 
direction d/ds. 

Kuranishi’s fundamental theorem [39, 40] 
states: For any compact complex manifold V, 
there exists a family with a point oeS 

such that (i) it is complete at every point of S, 
(ii) is effectively parametrized at o, and (iii) 

Vg = V. 

The parameter space S in the theorem is 
called the Kuranishi space or the local moduli 
space of V. It is given as the zeros of a holo- 
morphic mapping / : U -^H^{V, 0) with /(O) = 0, 
where H is a neighborhood ofO in H^(F, 0). 
Hence (1) if H^{V,&) — 0, then S=U is nonsin- 
gular (Kodaira, Nirenberg, and Spencer [37]; 

(2) if ff ^ (F, 0) = 0, then S = {0}, i.e., one point 
(e.g., F=P"(C), the complex projective space). 

Kuranishi’s theorem was generalized to a 
compact analytic space F by Grauert [18] and 
A. Douady [8]. 

There are Kuranishi-type theorems in the 
deformation theory of other objects: (1) com- 
pact analytic subvarieties of an analytic space 
(Douady [9]; ^ 23 Analytic Spaces G), (2) 
holomorphic mappings (Y. Miyajima [41]), 

(3) germs of analytic spaces with tisolated 
singularities [7], etc. 

So far, only the local theory of deformations 
has been developed. The global theory is not 
yet in a satisfactory state. Its final purpose is 
to construct moduli spaces and to understand 
them. For a compact differentiable manifold V, 
we denote by M(V) the set of all isomorphism 
classes of complex structures on V. It is diffi- 
cult in general to determine the set M(V). As 
yet unsolved problems are: (1) Is M(S^) non- 
empty? (2) Does M(P"(C)) (n^3) consist of one 
point? (M(P^(C)) is known to consist of one 
point (S. T. Yau; 232 Kahler Manifolds C). 

If M(V) has a reasonable structure (e.g., an 
analytic space structure) and a universal prop- 
erty, then we call it the moduli space (-^ 11 
Algebraic Functions F, 16 Algebraic Varieties 
W). Kodaira [34, III] constructed the moduli 
space of Hopf surfaces. Only a few examples of 
moduli spaces are known. It is to be noted 
that moduli spaces cannot in general exist. 
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One of the reasons for this is that there may 
exist jumpings of structures (Kodaira and 
Spencer [36]). 

As for talgebraic manifolds (varieties), there 
are two known methods for the moduli prob- 
lem: (1) Griffiths’ period mapping (-- 16 Alge- 
braic Varieties V) and (2) Mumford’s geomet- 
ric invariant theory (^16 Algebraic Varieties 
V, W). 

H. Monoidal Transformations 

Let Y be an analytic subspace of a complex 
manifold X, which is defined by a nonzero 
coherent sheaf of ideals 3. Then X has an 
open covering such that for each member U of 
the covering, there are elements (pi,...,cp„e 
r((7, 3) that generate the stalks 3,, at all xe L. 
Let W' be the graph of the holomorphic map- 
ping xi-^((pi(x):<p 2 (^): ...:(p„(x)) from U—Uf) 
Y to P'"^^ and denote by W the closure of W' 
in U X P'”~\ Then W is an analytic space, pos- 
sibly with singularities, which is independent of 
the choice of the generators {(pjj and is deter- 
mined uniquely by U and 3. Therefore all the 
it’s can be glued together to form an analytic 
space X and to determine a holomorphic map- 
ping p:X^X. We call p the monoidal transfor- 
mation (or blowing-up) of X with center 3 or 
with center Y. When 7 is a point, p is also 
called a locally quadratic transformation or a 
<T-process. When Y is an analytic submanifold, 
X also is a manifold, Y—p~^{Y) is a. nonsin- 
gular divisor on X, and p ^(y) = p* ^ for all 
ye y, where k is the codimension of 7. More- 
over, the line bundle [7] restricted to each 
fiber p"^(y) is isomorphic to the Hopf bundle 
L on i.e., the line bundle associated to 
the natural C* bundle Conversely, 

suppose that we are given YSX and a holo- 
morphic mapping P(,: 7-^ 7 with the property 
as above so that, in particular, pJ‘^(y)^P*~' 
and [7] ^ L on for aH y^Y. Then there 
exists a complex manifold containing 7 as a 
submanifold such that X is obtained by the 
monoidal transformation of X with center 7 
as above (S. Nakano [47]). Let /: 7 be a 

proper modification of complex manifolds (or 
analytic spaces). Then there exists a proper 
holomorphic mapping h:Y'-^Y such that 
f~^oh: Y’-^X is holomorphic, where over 
any relatively compact subdomain of 7, h is 
obtained by a finite succession of monoidal 
transformations with nonsingular centers 
(Hironaka). The result is called Chow’s lemma. 

I. Fiber Spaces 

A triple of compact complex manifolds V, W, 
and a surjective holomorphic mapping / : 



V-yW is called a fiber space if general fibers 
/~^(w) are irreducible. In addition, if both V 
and W are algebraic, it is called an algebraic 
fiber space. 

For a compact complex manifold V, letting 
Ky denote the canonical line bundle of V, 
define a subfield 51 of K(L) to be {cOi/o) 2 , 
where (Oi,( 02 eH^{V,Q.) for some m>0}. The 
transcendental degree of Si over C is denoted 
by k{V), which we call the Kodaira dimension 
of V. 

If 51 = 0, then define k{V)— —co. If k{V) = 
dim V, then V is said to be of general type, 
and there exist a projective manifold X and a 
bimeromorphic holomorphic mapping p:X->- 
V. If k(F) ^ 0, there exists a fiber space f\V*-* 
W such that (1) V* is bimeromorphically 
equivalent to V, (2) dim W = k{V), and (3) some 
general fibers /“'(w) satisfy k(/“'(w)) = 0. In 
general, for a fiber space f:V^W,we have 
k(L)^ K(/^^(w))-l-dim W, where /“^(w) is a 
general fiber. Moreover, if it is algebraic, an 
inequality of the form K{V)^K{f~^{w)) + K{W) 
is called the conjecture C„, n being dim V. 

C„ has been verified by K. Ueno, E. Viehweg, 

T. Fujita, and Y. Kawamata [13, 14,32,33,54- 
57] in the following cases: (1) n ^ 3, (2) when 
general fibers are curves, (3) dimw= 1, k{w)— 1, 
(4) k{V)^0, and W is of general types, etc. By 
using the results of case (4), Kawamata proves 
that an algebraic compact complex manifold V 
is birationally equivalent to an Abelian variety 
if and only if k(F) = 0 and the irregularity of V 
equals the dimension of V [32], 



J. Analytic Surfaces 

In what follows, an analytic surface means a 2- 
dimensional compact complex manifold. For 
an analytic surface S, an exceptional curve on 
S and a (relatively) minimal model, etc., are 
defined with respect to bimeromorphic map- 
pings in analogy with the corresponding con- 
cepts for an algebraie surface (^15 Algebraic 
Surfaces). Let C c S be an irreducible curve on 
S. Then there exists a holomorphic mapping (p 
from S onto another surface S' such that (p{C) 
is a point and such that (p induces the isomor- 
phism S — C^S' — (p{C) if and only if C^= —1 
and C is a nonsingular rational curve (Grauert 
[17]). S has a minimal model if and only if S is 
not a ruled surface (Kodaira [34, III]). The 
irregularity q — h^'\ the geometric genus Pg, i- 
genus Pi, etc., are also defined in the same way 
as in the case of algebraic surfaces. Note that, 
in general, The Riemann-Roch 

theorem and M. Noether’s formula are valid 
also for an analytic surface (Atiyah and Singer; 
— ^ 366 Riemann-Roch Theorems C). 
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K. Classification of Surfaces 

The classification of analytic surfaces by the 
aid of their numerical invariants was com- 
pleted by Kodaira and includes as a special 
case Enriques’s classification of algebraic 
surfaces [34,111], By an elliptic surface we 
mean a surface from which there exists a sur- 
jective holomorphic mapping (p onto an alge- 
braic curve A, such that for a, general point p 
on A, (p~^{p) is an irreducible nonsingular 
elliptic curve. If a{S) = 1 or k{S) = 1 , S has a 
unique structure as an elliptic surface. The 
image by (p of the points on S at which q> is 
not of maximal rank is a finite subset {(Jj, 

... ,a,.} of A. Let t, be a local coordinate on 
A around a,- with tj(ai) = 0. We call a singular 
fiber of (p the divisor on S defined by {t, o 
= 0}. The structure and the construction of 
singular fibers of elliptic surfaces have been 
completely determined by Kodaira. By an 
elliptic surface of general type we mean a 
surface with Kodaira dimension 1. If ?c(S') = 2, 

S is projective algebraic and is called a surface 
of general type. If u(S)==0, then there exists 
only a finite number of irreducible curves on S. 
By a Hopf surface we mean a surface whose 
universal covering is — (0,0). If an analytic 
surface S is homeomorphic to x 5^, S is a 
Hopf surface. Let b„(S) be the vth Betti num- 
ber of S. If a(S) = 0, b,{S)= 1, h2(S)-0, and S 
contains a curve, then S is a Hopf surface 
(Kodaira). By a surface of class VIIq we mean 
a minimal surface S with bi(S)= 1. M. Inoue 
[26] constructed three families of surfaces of 
class VIIq with &2 = 0 which contain no curves. 
These are (Me5L(3,Z)) and and 

‘^ivTp.Q.r {NeSL{2, Z), p, q,reZ,teC) which 
have f/ X C as their universal covering sur- 
faces, H being the complex upper half-plane. 
These have a line bundle L such that H®(Q^ ® 
(9{L)) = 0. This property characterizes these 
Inoue surfaces among VIIq surfaces with &2 = 

0 which contain no curves. A couple of new 
surfaces of class VIIq with Z ?2 > 0 were con- 
structed and studied by Inoue, Ma. Kato, 

T. Oda, I. Nakamura, and I. Enoki [10, 27-29, 
46]. Some of these have close connections 
with cusp singularities of Hilbert modular sur- 
faces, torus embeddings, and global spherical 
shells. Enoki’s surface is denoted by S„ ^ i 
(n > 0, 0 < I a I < 1, 1 6 C") and has the following 
properties: (I) class VIIq and ^2 — (2) there 

exists a connected curve D with — 0, (3) 
S„,a,t~D is an affine bundle over an elliptic 
curve. Properties (1) and (2) characterize S„ ^ , 
(Enoki). 

Let Kg denote the canonical line bundle of 
an analytic surface S. An analytic surface S is 
said to be a K3 surface, if Ks is trivial, i.e., if 
there exists a nonvanishing holomorphic 2- 
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form CO and if q — 0. K3 surfaces are simply 
connected and are deformations of a nonsin- 
gular quartic surface in (Kodaira). We shall 
define the period mapping of K3 surfaces. Let 
L be a free Abelian group of rank 22 with the 
pairing < > which is the direct sum of the two 
copies of — £g and three copies of (7 = Zcj -l- 
Ze 2 with <ei,Ci> = <e 2 ,e 2 >= 0 , <Ci,e 2 > = 

<(^ 2 , Cl ) = 1, where — £§ denotes the lattice with 
the pairing corresponding to the Dynkin 
diagram Eg with the opposite signs. H^{S,Z) 
with the intersection form is such a pair, con- 
sisting of S and < >. A marked K3 surface is 
defined to be a pair (S, ij/), where S is a K3 
surface and i}/:H^{S,Z)->L is an isomorphism 
preserving < >. The class [ca] of co is a base of 
and satisfies <[tu],[ta]) =0 and <[cal,[co]) 
>0. Let P(Lc) denote the 21 -dimensional 
projective space associated with Lq = L®z.C- 
& = {{a)£ P(Lc) I = 0, <fl, a> > 0} is an 
open set of the quadric ^^ = {(<a)eP(Lc)K£i,a> 
= 0}. is the set of all Hodge structures on 
Lc- The period of a marked K3 surface (S, t{/) is 
defined to be (i/'cLf^])- Every point of ® is the 
period of some marked Kahler K3 surface. If 
the periods of two marked Kahler K3 surfaees 
(S, \J/) and (S', ij/’), coincide then S is isomorphic 
to S'. 

An analytic surface with Pg — q — 0 and 
Kf^sO is called an Enriques surface, which 
has a K3 surface as its universal covering. An 
Enriques surface is an algebraic elliptic surface. 
An analytic surface with q = \ and Kf ^ ^ = 0 is 
said to be a hyperelliptic surface, which has an 
Abelian surface as an unramified covering. It is 
an algebraic surface and is an elliptic bundle 
over an elliptic curve. The classification of 
minimal surfaces is given in Table 1. The fol- 
lowing relations hold among these invariants: 
Let b^ {b~)he the number of positive (nega- 
tive) eigenvalues (counted with multiplicities) 
of the intersection matrix on H^(S, R) and C; be 
the ith Chern class of S. Then 

{l)b^ ~b~ =j(ci — 2 C 2 ) (Hirzebruch signa- 
ture theorem); 

(2) if bi is even, q = h'^’^ —jb^ and b'*' = 

2pg+ 1 ; 

(3) if hi is odd, ^ = -H 1 =i(hi + 1) and 

h+ =2pg. 

Let c j and C 2 denote the Chern numbers of 
an analytic surface S, i.e., cj = (K|) and C 2 is 
the Euler number of S. Then 3c2^c\ [43], and 
if equality holds, then is ample (Y. Miyaoka) 

(^ 232 Kahler Manifolds C). If S is a surface 
of general type which is minimal, then c 1 > 

0, the bigenus P 2 ^ 2 and the m-genus P„ is 
m{m~ l)c]/2-|- 1 —q + Pg for m^2. For each 
m ^ 5, the mth canonical mapping of S is a bi- 
rational holomorphic mapping onto its image 
(Kodaira, E. Bombieri [5]). In general, c\^2pg 
— 4, and if the canonical mapping, i.e., the 
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rational mapping associated with is bira- 
tional, then cf ^ 3pg — 1. For certain kinds of 
surfaces of general type, E. Horikawa suc- 
ceeded in determining completely the structure 
of surfaces obtained as deformations of a given 
surface. Among others, every minimal surface 
with Pg = 4, q = Q, and = 5 is a deformation of 
a nonsingular quintic surface [22, 23]. 



Table 1. Classification of Minimal Surfaces 



K 


Pg 


Pn 






Structure 


2 




>0 






algebraic surface 
of general type 


1 










elliptic surface 
of general type 




1 


1 


2 


4 


complex torus 




1 


1 


2 


3 


elliptic surface 
with a trivial 
canonical bundle 


0 


0 


1 


1 


2 


hyperelliptic surface 




0 


1 


1 


I 


elliptic surface 
belonging to 
class Vllg 




1 


1 


0 


0 


K3 surface 




0 


1 


0 


0 


Enriques surface 








0 


0 


rational surface 


— 00 


0 


0 


> 1 


2q 


ruled surface 
of genus q 








1 


1 


surface of class 

VIIo 



For a,b>0, let be the set of isomor- 
phism classes of all minimal surfaces with 
and \ —q + Pg = b.T). Gieseker [15] proved the 
existence of N and constructed the mapping 
so that the following hold; (1) /t is 
injective; (2) if f'.X—^Yissi smooth holomor- 
phic mapping such that all h~^{y) represent 
some classes then the set-theoretic map- 
ping induced from /i is a morphism 

of schemes; (3) h{Za^) is a locally closed sub- 
variety of Hence, is a moduli variety 
of surfaces of general type with c\ = a and 1 — 

q + Pg = b. 

For each m with there exists a 

minimal surface of general type with Pg = q = 0 
and Ci=m. 

If there exist a compact analytic surface S 
and a curve C on S such that S — C is biholo- 
morphic to a complex manifold M, then we 
say that S is a compactification of M with 
boundary C. Every compactification of is a 
rational surface (Kodaira, J. Morrow [45]). 
Every compactification of C x (C — {0}) is also 
rational (T. Ueda [53]). However, all compac- 



tifications of (C — {0})^ are rational surfaces, 
certain kinds of Hopf surfaces, or P*-bundles 
over elliptic curves as constructed by Serre 
(Ueda). 
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Complex Multiplication 

A. Classical Theory 

If the ratio coi/m 2 of two periods coj, coj of 
telliptic function / belongs to an imaginary 
tquadratic field K, then there exists an alge- 
braic relation between /(z) and f{h) for any 
A in K, and such an / is said to have complex 
multiplication. This phenomenon for the tsn 
function with modulus ^ — 1 was discovered 
by C. F. Gauss and was applied to the prob- 
lem of dividing a tlemniscate into five arcs of 
equal length. More generally, N. H. Abel 
showed that the special dividing equation of 
an sn function with complex multiplication 
is algebraically solvable. From a number- 
theoretic point of view, L. Kronecker conjec- 
tured that every ^^Abelian extension of an 
imaginary quadratic number field K is deter- 
mined by a transform equation of an elliptic 
function with complex multiplication by a 
number of K (1880). This is an analog of the 
fact, announced by Kronecker and proved by 
H. Weber, that every Abelian extension of the 
rational number field is a subfield of a tcy- 
clotomic field. Kronecker’s work was con- 
tinued by Weber [2], and his conjecture was 
proved by T. Takagi (1903) for K = Q(.y/ — 1), 
by T. Takenouchi (1916) for K = Q{e^’^‘^% and 
by Takagi (1920) for the general case using 
tclass field theory. H. Hasse [5] and M. Suga- 
wara simplified the theory of complex multi- 
plication, and Hasse noticed a relationship 
between complex multiplication and tcon- 
gruence zeta functions. Working from Hasse’s 
idea, M. Deuring constructed the theory of 
complex multiplication purely algebraically 



and determined tHasse’s zeta function of an 
elliptic curve with complex multiplication. 

For the rest of this article, K always stands 
for an imaginary quadratic field. Let L be a 
lattice group on the complex plane C gen- 
erated by CO, , a> 2 , and let z be a complex vari- 
able. Define functions p, Q 2 , as follows: 
p(z, L) = p(z;co,,co 2 ) = z"2-hZ((z-(u)“2- 

!72(^) = 32(<«l^<«2) = 60Za>"^ gi{L.) = 

33 (coi,co 2 )= 140 where the sum 2^ is 
over the elements co of L except 0. Let p' be 
the derivative of p; then z^(l, p(z), p'{z)) is a 
one-to-one correspondence between the points 
on the complex torus C/L and those on the 
telliptic curve E; X(yXl — 4Xl—g2XlX^ — 
g^^o the projective plane. If the quotient 
wJ(X )2 generates K, then the ring of analytic 
endomorphisms of C/L (i.e., the ring of endo- 
morphisms of E) is isomorphic to a subring of 
the ^principal order o of K. In particular, if the 
lattice group L is an tideal of K (for K<=C; 

347 Quadratic Fields), then the ring of endo- 
morphisms coincides with o. The function 
J(i) = J{E) = J(L) = 2" 33^2(^)VA(L) (A(L) = 
g 2 {Ef — llg^iLf) of t = coJco 2 , lmx>0, is 
a tmodular function of level 1, and J{E) is 
called the invariant of the elliptic curve E. 

If E has a complex multiplication, then J{E) 
is an algebraic integer. Then the three main 
theorems of the classical theory of complex 
multiplication can be stated. 

Theorem 1. Let h be the tclass number of K, 
and let a 1 , . . . , a;, be a set of representatives of 
ideal classes of K. Then J(a,), . . . , J(a,,) are 
exactly the conjugates of 7(a,) over K, and 
K(J(a,)) is the maximal unramified Abelian 
extension (the tabsolute class field) of K (^ 59 
Class Field Theory). 

Next we define the function / by 

^glA-^-p(z;L)f K = Q{/^), 

= p(z;L)3, K = Q(e^^‘/^). 

Theorem 2. Let o be the tprincipal order of K, 
m be an integral ideal of K, and a be an arbi- 
trary ideal of K. Choose a number of K such 
that m = {Aeo|A(^6a}. Then K{J{a), a)) is 

the tclass field for the tray modulo m. 

The number ^ in this theorem is obtained by 
a“’m((^) = b, where b is an integral ideal be- 
longing to the ideal class of a"^m. Thus f{^; a) 
is a “special value” of an elliptic function as 
well as of a modular function. 

The tgeneral law of reciprocity, the tprin- 
cipal ideal theorem, and the ramification in the 
class field in theorem 2 can also be described 
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in terms of elliptic functions or elliptic curves. 
In general, the ring of endomorphisms of an 
elliptic curve defined over a field k of charac- 
teristic 0 is either Z or an order of an imagi- 
nary quadratic field. On the other hand, if the 
characteristic of k is not zero, then the ring of 
endomorphisms may be an order in a definite 
^quaternion algebra. 



B. Complex Multiplication of an Abelian 
Variety 

Following Kronecker’s idea, D. Hilbert posed 
in his lecture at Paris (1900) the so-called 12th 
problem: to find an analytic function whose 
special values generate Abelian extensions 
over a given algebraic number field (^196 
Hilbert). E. Hecke constructed unramified 
Abelian extensions of an imaginary biquadra- 
tic field using +Hilbert modular functions 
[10]. After this, there was no notable devel- 
opment concerning the problem until the 
theory of complex multiplication was gen- 
eralized to the case of ^Abelian varieties, which 
was made possible by progress in algebraic 
geometry, in particular A. Weil’s geometric 
theory of Abelian varieties (G. Shimura and 
Y. Taniyama [11]). The following results 
have been obtained: Consider a triple (A,X,t), 
consisting of an Abelian variety A defined over 

C, a tpolarization 3E of A, and a point t of A. 
Call two such triples {A, X, t) and (A',X', t') 
isomorphic if an isomorphism of A onto A' 
maps X onto X' and t onto t'. Then there exists 
one and only one subfield ko of C with the 
following property: In order for (A, X, t) and 

(") to be isomorphic for an automor- 
phism a of C, it is necessary and sufficient that 
a fix all elements of k^. We call /cq the field of 
moduli of (A, X, t). If A is an elliptic curve E 
and if t = 0, then kQ — Q{J{E)). In the higher- 
dimensional case, the field of moduli is gen- 
erated by special values of a ^Siegel modular 
function. 

If F is a totally imaginary number field 
that is a quadratic extension of a totally real 
field Fq of degree n, then there exists a set 
{(py, ... ,(p„] of n different isomorphisms of F 
into C such that fpi — (pj never occurs for i ^ j, 
where the bar denotes complex conjugation. 
We call (F; {(Px}) a CM-type. 

If a is an ideal of F, then L = {{(pi (a), . . . , 
9 „(a))eC"|aea} is a lattice group in the n- 
dimensional complex linear space C", and 
C"/F is analytically isomorphic to an Abelian 
variety A of dimension n in a complex pro- 
jective space. The tendomorphism ring 91(A) 
of A contains a ring that is isomorphic to 
the principal order o of F. Conversely, every 
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Abelian variety of dimension n such that 91(A) 
contains a ring isomorphic to o can be con- 
structed in this way. 

Let M be a normal extension of Q contain- 
ing F. Denote the Galois group of M by G 
and the subgroup of G corresponding to F 
by H, and put S=[J;^(p^H, where (p^ stands 
for a prolongation of the previous cp^ to G. 
Then the following three conditions are equiv- 
alent: (i) 91(A) ^o; (ii) A is simple; (iii) // = 
{yGG|Sy = S}. If H* = {SsG\5S = S}, then we 
can choose tj/„eG such that 
If F* is the subfield of M corresponding to //*, 
then (F*; {(/^^}) is also a CM-type, and F* = 
Q(ZA<PA(°f)l“^^)- Moreover, for an ideal r of 
F*, is an ideal of F. 

Now let be the group of ideals r of F* 
that are relatively prime to the norm N{m) of 
an integral ideal m of F* and for which there 
exists a number of F with 

UM-iO, N(r) = KI^ (^^Umod^m). 
Then is an fideal group modulo m in F*. 

Theorem 3. Assume that 91(A) ^o, and let X be 
an arbitrary polarization of A. Denote a point 
of A with m = {a e 0 1 At = 0} by t, and let k^ be 
the field of moduli of (A, X, t). Then k,„F* is the 
tclass field over F* corresponding to H^. 

The point t in theorem 3 always exists. In 
particular, if m = o, then t = 0. When A is an 
elliptic curve F, F is an imaginary quadratic 
field and we have F* = K, and theorem 3 coin- 
cides essentially with the content of theorems 1 
and 2. If n > 1 , F = F* holds only in special 
cases. 

The theory of complex multiplication of A is 
closely related to the tHasse zeta function of A 
(— " 450 Zeta Functions). 
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A. Algebraic Properties of Complex Numbers 

A complex number is an expression of the form 
a + ib with arbitrary real numbers a and b and 
the imaginary unit i. Writing oc = a + ib, l^ = 
c -I- id, we define a = j] if and only if a = c and 
b = d. As regards algebraic operations with 
complex numbers, we define ol + li = {a + c} + 
i{b + d), a — li — {a — c) + i{b — d), afi = {ac — bd) 

+ i{ad -h be), and for P¥=0, i.e., + d^^ 0, 

a/^ = (ac -I- bd)/(c^ + d^) + i((bc — ad)/(c^ + d^)). 
Then the addition and multiplication thus 
defined obey commutative, associative, and 
distributive laws, and complex numbers form 
a tcommutative field with 0 = 0 -hiO and 1 = 

1 -t- (0 as its zero element of addition and iden- 
tity element of multiplication, respectively. The 
set of all complex numbers is usually denoted 
by C. 



By assigning to each real number a a com- 
plex number a + iO, algebraic operations on 
real numbers are carried into those of the 
corresponding complex numbers. That is to 
say, the field R of all real numbers is mapped 
isomorphically into the field C of all complex 
numbers. By identifying a with a + iO, we are 
taking R as a subfield of C. Also, O-l- il will 
be denoted simply by i. From the previous 
definition of algebraic operations, it follows 
that Furthermore, since a — a + ib 

= (a-HiO)-H(h-hiO)(0-fil), a + ib is not a mere 
symbolic expression but can also be regarded 
as the outcome of algebraic operations on 
a, b, i in C. The real and imaginary parts of a 
complex number a — a + ib are, by definition, 
a and b, denoted by Re a and Im a, respec- 
tively. A complex number that is not a real 
number is sometimes called an imaginary 
number; in particular, a complex number cc 
with Re a = 0 is called a purely imaginary 
number. For each complex number a = a + ib, 
we define its conjugate complex number as 
a — ib, and denote it by a. We then have a + (i 
= a + )3 and ocjS =a^. The mapping a^a is an 
^automorphism of C which leaves each ele- 
ment of R invariant. Also, the following rel- 
ations hold: Rea = (a-f- a)/2 and Ima = (a — 
a)/(2(). 

Regarded as an overfield of R, C is an 
textension field of R of degree 2 obtained by 
the adjunction of i, which is a root of an irre- 
ducible equation -t- 1 =0. The important 
algebraic property of C is that it is talgebra- 
ically closed. Namely, for any polynomial f(x) 
with coefficients in C, the equation /(x) = 0 
possesses at least one root in C ('I'Gauss’s 
fundamental theorem of algebra). 

B. Topology of C 

The absolute value (or modulus) of a complex 
number cc = a + ib, denoted by |a|, is by defi- 
nition \ct\ — ,Ja^ + b^ — y/^. If a is real, then 
the absolute value of a in the sense of complex 
numbers is identical to the one in the sense of 
real numbers. It always holds that |a| ^0 and 
|a| = 0<=>a = 0. It follows further that ]a-|-)5| ^ 
|a| + |jS|, \ccli\ = \a\\p\, and |a| = |a|. 

For each pair of complex numbers a and P, 
define p(a, P) — \cc — P\. Then with p(a, )5) as the 
tdistance function, C satisfies the axioms for a 
tmctric space, and in particular, lim„_ot, oc„ = 
ocq o lim p (a„ , Xq) = 0 <=> lim | a„ — otg | 0 
(lima„ = ao and Iimh„ = ho) (where a„ = a„ + ib„, 
^0 = Uq + ib(j). From this it is easily seen, as in 
the case of the set R of all real numbers, that C 
also becomes a tlocally compact and tcom- 
plete metric space. 

With respect to this topology, the four oper- 
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ations (except for division by zero) are con- 
tinuous: oc„^iXq and imply + 

«o + /^o; and 

“n//^n~*’®o//^ (where in the last ease we assume 
and PqJ^O). Thus C becomes a topo- 
logieal field. Furthermore, the assignment 
a a gives a continuous mapping C ^ C, a 
homeomorphic automorphism of C. 

C. The Complex Plane 

If in a plane to which is assigned rectangular 
coordinate axes a complex number a — a + ib 
is represented by a point {a, b), then the plane 
is called the complex (number) plane (Gauss- 
Argand plane or Gaussian plane) (Fig. 1), and 
the point representing a is called simply the 
point a. The abscissa and ordinate axes are 
called the real and imaginary axes, respec- 
tively. A point (x = a + ib can be represented by 
^polar coordinates r, 0 with the origin and the 
real axis as the pole and the generating line, 
respectively, where r = y/a^ + b^ is the absolute 
value |a| of a and 0 is the argument (or ampli- 
tude), denoted by arga, of a. The argument 
of a is uniquely determined mod27r if o/O 
and is an arbitrary real number if a==0. 




Fig. 1 

The absolute value ja| of a complex number 
a, regarded as a vector from the origin to the 
point a, is the length of this vector. For com- 
plex numbers a and /(, to the sum of the vec- 
tors a and /J corresponds the sum a-f- jf( of the 
complex numbers. A complex number a, in 
terms of its absolute value r and argument 0, 
is expressed as a = r(cos 6 -I- i sin 0), which is 
called the polar form of a. For polar forms 
the following hold: a == r(cos 0 — i sin 0} — 
r{cos{—0) + isin{—6)y, a“' =a/|a|^ = 
r“‘(cos(— 0 )-l-isin( — (?)) (a/ 0 ); and a;a 2 = 
rjr 2 (cos((li + 02 ) + is\n{0^ + 02 )), where 
and argaj = ff for 7 = 1 , 2 . This last relation 
when r, = /*2 = 1 is called De Moivre’s formula. 
The nth roots of unity in C are given by pj — 
cos 2nj /n-\-i sin 2nj/n (j = 0, 1, ... , n — 1) (Fig. 2). 

In the complex plane, the mapping a a 
corresponds to the treflection of the plane in 
the real axis, a -> a -I- to the parallel Trans- 
lation along a vector /J, a (/ 1 0 ) to the 
■^rotation through the angle argjS followed by 



a thomothetic transformation with center 
0 and constant ratio \fi\, and a ^ a~’ to 
the ^inversion with respect to the unit circle 
{a||a| = l}. 

The distance p(a, /() = |a — between a and 
/i in C coincides with their Euclidean distance, 
provided that a and p are regarded as points 
in the Euclidean plane, so that the complex 
plane is tisometric, and accordingly homeo- 
morphic, to the Euclidean plane. 




Fig. 2 



D. The Complex Sphere 

In the rest of this article, P denotes the com- 
plex plane and Z denotes the sphere of radius 
1, with 0, the origin of P, as its center. The 
points N( 0 , 0 , 1 ) and S( 0 , 0 , — 1 ) of 27 will be 
called the north and south pole, respectively 
(Fig. 3), where the 1st and 2nd coordinate axes 
are the real and imaginary axes of P, respec- 
tively, and the 3rd coordinate axis is ortho- 
gonal to P. A straight line from N through a 
point z (a complex number) in P intersects 
27 at a point Z — {x^,X 2 ,X 2 ,) different from 
N, where z = (x, -t-ix 2 )/(l — X 3 ), x, =(z-f-z)/ 
(l+|zp),X2 = (z-z)/(i(l + lz|^)), and Xa = 

(|zp — l)/(|z|^-t- 1). The mapping z^Z is called 
a stereographic projection from N, by means of 
which P and 27 — {N} become ^conformally 
equivalent to each other. Consequently z can 
be represented by a point Z of Z—{N}, and 
27 thus used is called a complex sphere (or 
Riemann sphere). Let us adjoin to the complex 
plane P a new element, denoted by 00 , called 
the point at infinity, which corresponds to the 
only exceptional point N of 27. The topology 
of the complex plane with 00 can be intro- 
duced by the corresponding topology of the 
Riemann sphere. Indeed, the family of all 
the sets (z| |z| > M} u { 00 } for M >0 forms a 
■’'local base around 00. By introducing local 
complex coordinates C = I /2 into the neigh- 
borhoods of 00 , each element of this local base 
is represented as {C| |C| in which the 
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convention C = 0 is adopted for z — co. The 
complex sphere thus defined can be regarded 
as a ^Riemann surface (i.e., a 1 -dimensional 
1'complex manifold). 

The complex plane (complex sphere) whose 
points are represented by a variable z or w is 
called a z-plane or a w-plane (a z-spbere or a w- 
sphere). 




Fig. 3 



E. Linear Fractional Functions 



Given complex numbers a, b, c, and d with 
ad — be / 0, we define a linear fractional func- 
tion (or simply linear function) by 



az + b 
cz + d 



( 1 ) 



As a mapping from the z-sphere into the w- 
sphere, this linear function is called a Mbbius 
transformation (linear fractional or simply 
linear transformation). The usual linear trans- 
formation, i.e., the one with c = 0 in the pre- 
sent case, is sometimes distinguished as an 
entire linear transformation. Since (1) depends 
only on the proportion a:h:c:d, we can 
assume ad — be — 1 without loss of generality. 

The transformation (1) is tholomorphic and 
■•■univalent on the whole z-sphere with only 
one exceptional tpole at —die {co if c = 0) of 
order 1, and the inverse of (I) is also a linear 
fractional function. The set of all linear trans- 
formations forms a tgroup with composition 
of transformations as the group operation. 

One of its subgroups is the ^modular group. 

Linear transformations carry any circle 
of the complex plane (or of the Riemann 
sphere) into a circle of the same plane (or of the 
same sphere) if we adopt the convention that 
straight lines are a special kind of circle. (In 
the case of a Riemann sphere, no such conven- 
tion is necessary.) Given on a plane a circle 
with center o and radius r and two points p 
and p' on a half-line issuing from o satisfying 
op-op' = r^, the points p and p' are called 
symmetric points (or reflection points) with 
respect to the circle. The transformation 
is called the inversion with respect to this cir- 
cle. In the complex plane, let z and z' be sym- 
metric points with respect to a circle C. Sup- 
pose that by a linear transformation, z, z' and 



C are carried to points w, w' and a circle D, 
respeetively; then w and w' become symmetric 
with respect to the circle D (principle of reflec- 
tion). Thus symmetricity is invariant under 
linear transformations. Also the tanharmonic 
ratio of any four points Zi,z 2 , z^, and z^, 

(z 1 , Z2 ; Z3 , Z4) = (z 1 - Z3)/(z , - Z4) : (Z2 - Z3)/ 

(z2 — Z4), is invariant under a linear transfor- 
mation; i.e., (Zi,Z2;Z3,Z4) = (Wj,VV2; W3, W4) 
holds, where Wj is the image of Zj under a linear 
transformation (y = 1 , 2, 3, 4). 



F. Normal Forms of Linear Transformations 

There exist fixed points of the transformation 
(1) on the Riemann sphere, i.e., points satisfy- 
ing z = {az + b)j{ez + d). The number of fixed 
points is 2 or 1, except when w = z. If the trans- 
formation has two fixed points, they will be 
denoted here by p and q. The natural conven- 
tion p — q is adopted if the transformation has 
one fixed point. If c = 0, then p or q is 00, and 
furthermore if c = a — d = 0, then p and q are 
both 00. 

For unequal finite p and q, (1) can be re- 
written in the following normal form: 



■P 

— , a = ^ 1 , 

■q a — eq 



in which, according as arga = 0, |a| = 1, or 
otherwise, (1) is called a hyperbolic (Fig. 4), 
elliptic (Fig. 5), or loxodromic transformation, 
respectively. This classification can be applied 
also for finite p and infinite q, i.e., to the trans- 
formation w — p — a(z — p). Furthermore, for 
p = q 00,(1) is rewritten in the following 
form: 



1 






a — ep 



Fig. 4 

Hyperbolic transformation. 



Fig. 5 

Elliptic transformation. 
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In this case (1) is called a parabolic transfor- 
mation (Fig. 6). For p = q~co, i.e., if w = z + /I, 

(1) is also called parabolic. We can easily de- 
termine to which class (1) belongs from the 
discriminant D = {a + df -4 oi the quadratic 
equation cz^ — {a — d)z — b~0 obtained from 
z = (az + h)j{cz + d) with ad — bc=\ by multi- 
plying both sides hy cz + d. If a + d is real, 
then according as D>0, <0, or =0, (1) is 
hyperbolic, elliptic, or parabolic, respectively, 
and if a -I- d is not real, then the transforma- 
tion is loxodromic. 



Fig. 6 

Parabolic transformation. 




Let D and D' be two arbitrary circular disks. 
Then there always exists a linear transforma- 
tion which gives a one-to-one ^conformal 
mapping from D onto D'. Conversely, any 
mapping with this property is given only by 
linear transformations (provided that the half- 
plane having a straight line together with the 
point at infinity as its boundary is regarded as 
a closed disk), which are uniquely determined 
by giving three points a, b, c from the bound- 
ary of D and as their corresponding points, 
three arbitrary points a', b', c' from the bound- 
ary of D'. 



G. The Poincare Metric 

Since conformal mappings from the domain 
|z| < 1 onto |w| < 1 are given by the trans- 
formations w = e(z-Z(j)/(l —Zqz) (|£| = 1, 

|zo| < 1) (-^ Appendix A, Table 13), for corre- 
sponding z and w it holds that 

\dw\ \dz\ 

Th^“Th7F' 

|dz|/(l — \z\^) is called Poincare’s differential 
invariant With a metric having ds = |dz|/(l — 
|z|^) as its dine element, the unit disk |z| < 1 
becomes a tnon-Euclidean space in the sense 
of Lobachevskii, and the metric is called the 
Poincare metric. Furthermore, since the trans- 
formations (2) leave the length of curves invar- 
iant, they can be regarded as ^motions in this 
space, where the tgeodesic through two points 
z, and Z2 is the circular arc orthogonal to the 
unit circle. If we denote the intersections of the 
arc with the unit circle by Zj and z^., then the 
tnon-Euclidean distance between two points 



z, and Z2 along the geodesic is given by 
(l/2)log(Z],Z2;z3,Z4), provided that the points 
Z4, ^1 , Z 2 , Z3 are arranged on the arc in this 
order (^ 285 Non-Euclidean Geometry). 
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Computers 

A. History 

Since the beginning of civilization, people 
have utilized tools for aiding computation. In 
Japan, bamboo computing rods were used in 
the 7th century; before the close of the 16th 
century, the abacus was imported from China 
(^ 230 Japanese Mathematics (Wasan)). The 
first calculator capable of performing the four 
arithmetic operations automatically was de- 
signed by W. Schickard of Tubingen Univer- 
sity (1623). After that, B. Pascal independently 
made his famous adding machine; this was 
improved by G. W. Leibniz so that it was able 
to execute multiplications and divisions. 

Most calculators manufactored today are 
electronic; however, many of these machines 
are not “programmable” and hence require at 
every step of computation a manual operation 
for specifying the machine operations to be 
executed. On the other hand, modern auto- 
matic computers can execute automatically a 
sequence of operations according to a given 
program without any manual intervention. 

The automatic computer was conceived by 
the English mathematician C. Babbage in the 
19th century, but mechanical engineering at 
that time was not advanced enough to allow 
the construction of such a computer. His idea 
was first realized by the relay computers Z3 of 
K. Zuse (1941) and Mark I of Harvard Uni- 
versity (1944). In 1947, the first electronic 
computer, ENIAC, appeared, in which vacuum 
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tubes were utilized instead of mechanical com- 
ponents. Since then, the capabilities of com- 
puters have increased rapidly and it can be 
said that we now live in the “computer age.” 

B. Principles of Modern Computers 

Information processing in a computer is based 
on communication among its constituents by 
electric signals. In digital computers, informa- 
tion is encoded as a sequence of binary num- 
bers 0, 1, whereas continuous values are al- 
lowed in I'analog computers. The minimum 
quantity of information in digital computers 
is therefore a binary digit, called a bit. Since 
continuous values fluctuate on account of 
electric noise in the circuits, digital computers 
have the advantage of maintaining high preci- 
sion during computation. 

The binary numbers 0, 1 are represented in 
practice by two distinct electric signals: two 
distinct voltages, two distinct phases of alter- 
nating current, the existence or nonexistence 
of a pulse, etc. A system of circuits is called 
synchronous when it contains a clock, a gener- 
ator of periodic pulses, which synchronizes 
the transmission and transformation of in- 
formation. In an asynchronous system, circuits 
execute each step of information processing 
independently and advance to the next step 
after verifying the termination of the preceding 
step. 

In ENIAC, a program was inserted into the 
computer by carefully connecting many con- 
trol lines on plugboards. Since this caused 
much trouble in inserting the different pro- 
grams needed for different jobs, J. von Neu- 
mann proposed encoding programs in se- 
quences of binary digits and storing these 
programs in the memory unit of the computer. 
His principle, the stored program principle, was 
realized by EDSAC (1949) and has since been 
widely used. 

An automatic computer in general con- 
sists of the following five units: the arithme- 
tic unit, memory, control, input, and output 
(Fig. I). These units are interconnected by 
wires that exchange information in the eourse 
of computation. 




Fig. 1 

Construction of electronic computers. 



The arithmetic unit consists mainly of sev- 
eral memory registers and operational circuits 
associated with them. Each register stores a 
binary number of n bits (usually 16^n^64). 
An important building block of the opera- 
tional circuit is the basic adder of 1-bit num- 
bers. A parallel adder of n-bit numbers can 
be obtained by connecting n copies of the 
basic adder. Alternatively, a sequential adder 
can be composed of a single basic adder which 
is utilized repeatedly to sum up bit by bit two 
binary numbers from the lowest bit. Subtrac- 
tion is usually carried out by adding the com- 
plement of the subtrahend. Multiplication is 
realized by shifting the multiplicand to the 
left and adding it to the intermediate sum; 
this addition may be omitted, depending on 
the relevent bit of the multiplier. In division, 
we shift the dividend to the left and if possible 
subtract the divisor from it. By recording at 
each step whether or not the subtraction is 
possible, we obtain the quotient. 

The memory unit stores the instructions 
and given data as well as the necessary data 
obtained in the course of computation. It is 
divided into many registers, each of which 
contains a number and is referred to by means 
of a serial number called an address. At present, 
high-speed memory units are usually made 
up of magnetic cores or integrated circuits. As 
auxiliary large-scale memory, there are also 
magnetic drums, disks, and magnetic tapes, 
etc. Other elements under investigation are 
extreme-low-temperature elements, chemical 
elements, optical elements, etc. 

The control unit repeats the following oper- 
ations consecutively: (1) takes an instruction 
from the memory location indicated by the 
sequential control counter, (2) gets a data 
word from the memory according to the ad- 
dress part of the instruction, (3) decodes the 
function part of the instruction and sends 
control signals to appropriate circuits, and (4) 
increases the content of the sequential control 
counter by 1 and, after receiving end signals 
from the arithmetic unit, returns to step (1). 

For these purposes the control unit contains a 
counter, a decoder, an encoder to send control 
signals, and a register to store the instruction 
to be executed. 

The arithmetic, control, and memory units 
form the central processor of the computer. In 
contrast to the central processor, the input and 
output units are called peripheral devices. An 
input device receives the necessary information 
(a program and data); a card reader reads 
punched cards, and a teletypewriter sends 
signals directly into the computer. An output 
device presents the results obtained by the 
computer. The results are usually printed by a 
teletypewriter or line printer. Magnetic tape 
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units can be considered as auxiliary input- 
output devices, since tape reels are removable. 
The tape units can accept prepared input data 
as well as record the results, the output. 

These devices are operated according to 
given input-output instructions under the 
control of the central processor. However, 
since every input-output device contains me- 
chanical components and is extremely slow in 
comparison with the central processor, a large- 
scale computer is often accompanied by sat- 
ellite computers which undertake the control 
of input-output devices. 

C. Instructions and Programming 

In stored-program operation, a computer 
performs a sequence of calculations according 
to given instructions. There are various types 
of instructions, but those most frequently used 
are single-address instructions, each of which 
contains a single-address part designating an 
operand. 

A program is a finite sequence of instruc- 
tions arranged suitably for the required com- 
putation. Programming, or making a program, 
is therefore the task of decomposing the re- 
quired computation into elementary steps each 
of which corresponds to an instruction. 

Every instruction is represented in a com- 
puter by a number, a numeric code, which is 
determined in a definite way by the construc- 
tion of the control unit. Before starting com- 
putation, instructions thus encoded are stored 
in the memory. In this sense, a program is a 
sequence of numbers. This sequence is called a 
machine-language program. 

A program is usually divided into several 
blocks, called subprograms or subroutines. 
Some subroutines are made in advance, espe- 
cially those for frequently required jobs such as 
evaluating elementary functions and manipulat- 
ing input-output devices, etc. The system of 
these ready-made programs is called software, 
in contrast to the hardware (i.e., mechanico- 
electronic equipment) of the computer. Quite 
often basic routines are microprogrammed, i.e., 
written in a simple code and stored in a fast 
read-only memory (ROM). In this case the set 
of built-in programs is called firmware. The 
handliness of a computer depends mainly on 
close matching of the software and hardware. 

Programs are usually written in certain 
forms called external languages which are easy 
to master. A problem-oriented language is an 
advanced external language in which ordinary 
arithmetic expressions are available with slight 
modifications. Programs written in these lan- 
guages are translated into machine languages 
by program input routines. The translator for 
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a problem-oriented language is called the 

compiler. 

A compiler accepts several macroinstruc- 
tions. Moreover, it is equipped with the fol- 
lowing facilities: (i) the ability to translate 
arithmetic expressions into machine language; 
(ii) the ability to generate linkages to various 
ready-made subroutines according to certain 
simple indications; (iii) the ability to auto- 
matically allocate programs, subroutines, and 
data in the memory; (iv) the ability to check 
automatically the syntactic correctness of 
programs. Thus it accepts an external form 
such as 

if x^O then printreal (SQRT(x)) 
else printstring (‘negative’). 

Another important translator is the as- 
sembler, which translates mnemonic codes of 
instructions {add for addition, etc.) into their 
numeric codes according to a given table. It 
allows us to utilize symbols for specifying 
addresses. It also converts decimal numbers 
into binary and generates certain segments of 
the program from rather simple indications. 
Compilers and assemblers are important con- 
stituents of software. 

The large-scale high-speed computers that 
have recently appeared have made software 
systems inevitably more complex. There is 
now software, so-called monitors or operating 
systems, that supervises the uninterrupted 
processing of many programs. Some operating 
systems coordinate several assemblers and 
compilers so that several languages can be 
mixed in writing a program. Examples of 
other important software are mathematical 
(numerical) software and database manage- 
ment systems. 



D. Mathematical Models of Computers 

An operational circuit in the arithmetic unit is 
usually constructed from basic elements (log- 
ical gates), each of which performs a certain 
operation on the binary signals. Thus the 
construction of a circuit from basic elements 
is represented by a eomposition of a logical 
function defined over the set (0, 1} from a 
given set of basic functions. Post [3] consid- 
ered functional composition without feed- 
back loops and established a general crite- 
rion for a given set of logical functions to be 
complete, i.e., to be capable of generating the 
whole set of logical functions. Strictly, how- 
ever, a logical gate takes a definite time delay 
to perform its operation. Therefore Kudryav- 
tsev [4] proposed as a model of a logical gate 
a pair {f d) of a logical function / and a non- 
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negative integer d: f represents the operation 
of the gate and d represents its delay. He de- 
fined the feedback-free composition of such 
functions with delays and gave a completeness 
criterion for a set of logical functions with 
delays. His study has been extended in various 
directions by Loomis [5], Nozaki [6], and 
Rosenberg [7]. 

When a circuit contains memory elements 
or feedback loops, its function is suitably 
represented by an tautomaton. Hence the 
design and analysis of circuits with memory 
elements, state-minimization, and equivalence 
checking [8], decomposition into simpler 
components [9, 10], verification of complete- 
ness [11], etc., can be studied in terms of auto- 
mata (^31 Automata). A computer itself can 
be considered to be a finite automaton, since 
it has a finite number of memory elements and 
its behavior is completely determined by the 
content of the memory (its internal state) and 
the inputs. However, since a modern computer 
has an enormous memory and contains re- 
placeable parts, such as magnetic tapes, it is 
more adequately represented by an infinite 
model, such as a tTuring machine. 

Turing machines are capable of simulating 
many intellectual activities governed by for- 
mal rules. It is believed that any well-defined 
finite algorithm can be simulated by a Turing 
machine (Church’s thesis.) Thus any compu- 
ter can be simulated by a Turing machine, 
and hence it is unable to resolve those deci- 
sion problems which are unsol vable for Tur- 
ing machines. For instance, no program can 
decide in finite steps whether a given pro- 
gram written in a computer language, say 
FORTRAN or PASCAL, eventually stops or 
not. On the other hand, a modern computer 
can simulate a universal Turing machine pro- 
vided that its memory can be extended un- 
boundedly by supplying magnetic tapes. 

Hence any well-defined finite algorithm can 
be simulated by a computer unless the limi- 
tation of memory capacity hampers its 
accomplishment. 



E. Mathematical Theory of Programming 
Languages 

Along with the development of software, pro- 
gramming techniques have gradually accumu- 
lated. For instance, we now have an almost 
satisfactory method of translating arithmetic 
expressions into machine language. However, 
we still lack a general theory to cover effec- 
tively a wide array of programming problems. 
The design of an adequate metalanguage 



is still an important problem, if the word 
“adequate” implies complete description of 
syntax and semantics of problem-oriented 
languages. An interesting problem is to define 
and suitably classify grammars with respect to 
their capabilities of forming languages. Such 
research is an important branch of mathema- 
tical linguistics. N. Chomsky [12] has inves- 
tigated this problem, starting from research on 
natural languages, and has given formal defini- 
tions of the grammars (—^31 Automata D). 

tContext-free grammars (type-2 languages) 
play an important role in the theory of soft- 
ware as well as in Chomsky’s syntax, since 
they are powerful enough to describe the pa- 
renthesis structure and simple and easy to 
manipulate. Some variants of context-free 
grammar have been proposed for attaining 
high efficiency in the automatic appraisal of 
programs [13, 14] or for increasing the pro- 
grams’ generative power [15]. 

F. Branches of Information Science 

Many fields related to computers now make 
up the information sciences or informatiques. 
Important mathematical theories born or 
developed in the information sciences are: 

(1) Design and analysis of hardware devices: 
tBoolean algebra [16], switching theory [17], 
and theory of tautomata [18, 19]. 

(2) Design and analysis of programming lan- 
guage; theory of formal languages [14,21]. 

(3) Design and analysis of algorithms; tnumer- 
ical analysis, theory of tcomplexity of compu- 
tation, theory of Mata processing. 

(4) Mathematical foundation of programming; 
logical verification of correctness and equiva- 
lence of programs [22], trecursive function 
theory, tdecision problems. 

Applications of the information sciences are 
found in diversified fields, such as statistics, 
operations research, mathematical psychology, 
econometrics, jurimetrics, and behaviorimetrics. 
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Conformal Geometry 

A. Mobius Geometry 

We represent an n-dimensional sphere S” as 
the tquadric hypersurface S":xl + xl + ...+ 
— 2xoXgj^, — 0 in an (« -I- l)-dimensional real 
^projective space P"^\ where the (xj are 
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Conformal Geometry 

thomogeneous coordinates in P"^f We denote 
by M(n) the group of all tprojective transfor- 
mations of that leave S" invariant. Then 
the transformation group M(n) acts on S". The 
pair (S", M{n)) is called the conformal geom- 
etry or Mobius geometry. We call S” an n- 
dimensional conformal space, a transformation 
belonging to M{n) a Mobius transformation, and 
M{n) the Mobius transformation group. Every 
point of the projective space corresponds 

to a hypersphere on S”. For example, if a point 
A lies outside of S", then the intersection of S” 
and the tpolar hyperplane of A with respect to 
S" is an {n— l)-dimensional sphere and 
the point A corresponds to this real hyper- 
sphere Similarly, if a point A lies on S", it 
corresponds to a point hypersphere, and if a 
point A lies inside of S", then A corresponds 
to an imaginary hypersphere. We sometimes 
identify the point A with the corresponding 
hypersphere. For any two points A = {aJ and 
B = {bJ, we put AB — a^bi +^nb„ 

—(a^b^ + a^bQ) and call it the inner product of 
the two hyperspheres A and B. The angle 9 
between two intersecting real hyperspheres A 
and B is defined by cos 9 = AB/{^/^ ■ 

This angle is invariant under the Mobius 
transformation. 

In the projective space we take a 
tframe {Aq, Ai,...,A„, A^) that satisfies the 
conditions 

Al = AQAj^AiA^ = Ai = (), A^A^=-\, 

U=l,2, ...,n, 

where {gij) is a positive definite matrix. We see 
that Aq and A^^ are points on S" and each ^4, is 
a real hypersphere passing through these two 
points. Every hypersphere X of S" can be 
written as a linear combination of the Ai'.X — 
UqAq + UiA^ + ... + u„A„ + u^A^. That is, X 
is represented by tprojective coordinates (uj 
with respect to the above frame. We call these 
homogeneous coordinates (uj (n + 2)- 
hyperspherical coordinates of the hypersphere 
X. If we use these coordinates, the inner prod- 
uct of two hyperspheres X = {uj and Y = (r J is 
given by j=l0^JMirj-(uol^ac + Woo^’o)• 

The Mobius transformation group M{n) is a 
topological group with two tconnected com- 
ponents. If we denote by M(,(«) the maximal 
connected subgroup of M{n) and by H the 
subgroup of M(n) that leaves invariant a real * 
hypersphere of S", then the set E of all real 
hyperspheres of S" can be identified with the 
^homogeneous space Mo(«)/H. The group H 
also consists of two connected components. 

If we denote by Hq the maximal connected 
subgroup of H, the homogeneous space E = 
MQ{n)jHQ is a two-fold tcovering space of £. 
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For each real hyper sphere /I g£, an element 
AeE over A is called an oriented real hyper- 
sphere. The Mobius transformation group con- 
tains as its subgroups ones that are isomor- 
phic to the group of ^congruent transforma- 
tions of a Euclidean space and ones that are 
isomorphic to the group of congruent trans- 
formations of a non-Euclidean space. That is, 
the subgroup of M(n) that leaves a point 
hypersphere invariant is isomorphic to the 
group generated by congruent transformations 
and thomotheties of the Euclidean space E". 
The subgroup of tindex 2 (the factor group by 
a ^cyclic subgroup of border 2) of the subgroup 
of M(n) that leaves invariant a real (imaginary) 
hypersphere is isomorphic to the group of 
congruent transformation of n-dimensional 
hyperbolic (elliptic) tnon-Euclidean space. 

In the n-dimensional Euclidean space 
consider a hypersphere of radius r with center 
0. For each point P of F", mark a point Q on 
the tray OP such that OP ■ OQ = r^. We call 
the point transformation that sends P to g an 
inversion with respect to the hypersphere. A 
^symmetry with respect to a hyperplane, con- 
sidered as an extreme case of inversions, is 
also called an inversion. We adjoin a point at 
infinity to the space E” to construct an n- 
dimensional sphere S". Each inversion can be 
extended to a transformation of S", which we 
also call an inversion. Then each Mobius 
transformation is generated by a finite number 
of inversions. By a Mobius transformation 
of S", each hypersphere is transformed to a 
hypersphere. Any angle between two curves 
that intersect at a point of £" is invariant 
under Mobius transformations. Conversely, 
if n ^ 3, each local transformation of £" that 
leaves invariant the angle of each pair of inter- 
secting curves is a trestriction of a Mobius 
transformation. However, for n = 2 this is not 
true in general; any transformation that leaves 
angles invariant is called a tconformal map- 
ping. Any Mobius transformation z^w on 
the tcomplex sphere = C U { oo } can be ex- 

pressed by an equation of the form w={oiz + p)/ 
(yz -I- ^) or w = (az -I- /^)/(yz -|- ^), where a, jS, y, 
and S are complex numbers such that — 

0 and z denotes the ^complex conjugate of z. 

B. Laguerre Geometry 

Let F be an oriented smooth curve in a Euclid- 
ean plane £^. The tangent line at a point p 
of F is supplied with an orientation that is 
induced by the orientation of the curve F in 
an obvious manner. The oriented line I thus 
obtained is called the oriented tangent line 
of F at p. 



Let S be the set of oriented lines in £^ and T 
be a given set of oriented smooth curves in £^. 
Consider a bijection y of the direct product S 
xT to itself satisfying the following condition: 
If / is an oriented tangent line of a curve F 
belonging to T and y sends (/, F) to (/', F'), then 
/' is an oriented tangent of the curve F'. The 
set of such bijections forms a group G. Sup- 
pose that we have leS for which there exist 
two curves F, and F2 in T such that I is a 
common tangent line of F^ and £2 at Pi and 
P2, respectively. Further suppose that the 
element yeG sends (/, F,) to (/', F)') (i =1,2). 

Then the element y of G is called an equilong 
transformation if the following conditions (i) 
and (ii) are satisfied: (i) I' is tangent to F/ at 
points Pi (i= 1,2), (ii) the distance between p, 
and P2 is equal to the distance between p\ and 
p'2. The set of equilong transformations forms 
a subgroup H of G. In particular, if T is the set 
of oriented circles (including point circles), the 
elements of H are called Laguerre transfor- 
mations. In an obvious manner, we can divide 
the set of oriented circles into two classes, 
those with “positive” and those with “nega- 
tive” orientations. With an oriented circle F 
we associate the pair ^(F) = (P, r), where P is 
the origin of the circle and r is a real number 
whose absolute value is equal to the radius of 
the circle and whose signature coincides with 
that of the orientation of F. The mapping 
(p:T-yE^ X E^ thus defined is called Lie’s 
minimal projection. An example of a Laguerre 
transformation, called a dilatation, is given by 
a bijection 7 of S x T to itself satisfying the 
following condition: Let y{l, F) = (f, F'); then I 
is parallel to I, the distance between / and I' is 
a given number, and <p(F') = (P,r-l-c), where 
(p(F) = (F,r) and c is a given constant. We note 
here that the action of a Laguerre transfor- 
mation 7 :(/, F)^(/', F') is determined by its 
action on S (7 acts on S by 7(/) = /'). Another 
example of a Laguerre transformation 7, called 
a Laguerre inversion, is determined by means 
of a given oriented circle O and a line p that is 
not tangent to 0; given an oriented line /, its 
image /' under the action of 7 is determined 
as follows (here we describe the case where / 
is not parallel to p). There exists a uniquely 
determined oriented tangent line g of the circle 
0 parallel to /. Now we have a uniquely deter- 
mined oriented tangent line g' of O that passes 
through the point of intersection of the lines g 
and p such that g' # g. The image I' is the line 
parallel to g' and passing through the point of 
intersection of the lines / and p (Fig. 1). Each 
Laguerre transformation can be written as a 
product of a finite number of Laguerre inver- 
sions. We denote the group of Laguerre trans- 
formations by L. The pair (L, S) is, by defini- 
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tion, a model of Laguerre geometry. Notions 
such as Laguerre inversions, dilatations, and 
transformations can be generalized to cases of 
higher dimension by utilizing oriented hyper- 
spheres and oriented hyperplanes. 




C. Sphere Geometry 

Let S be the set of oriented circles (including 
point circles and oriented lines) in the Eucli- 
dean plane E^. Two oriented circles Cj and 
C 2 are said to be in contact with each other 
if they have a point p and an oriented tan- 
gent line passing through p in common. (An 
oriented circle C and an oriented line I are in 
contact with each other if and only if I is an 
oriented tangent line of C.) In this case, we call 
the pair (Cj , C 2 ) a contact pair. A bijection y of 
S to itself is called a Lie transformation if it 
sends any contact pair to another. (The Lie 
transformation is a special case of the tcontact 
transformations.) An inversion with respect to 
a circle determines in an obvious manner a Lie 
transformation, which is also called an inver- 
sion. We denote the group of Lie transforma- 
tions by G. Any element y of G can be written 
as the product of a finite number of inversions 
and Laguerre inversions, and G contains the 
group of Mobius transformations and the 
group of Laguerre transformations as sub- 
groups. The pair (G, S) is called a model of 
circle geometry. The notion of circle geometry 
can be generalized to that of hypersphere geom- 
etry for the case of higher dimensions. Speci- 
fically, when we replace by E^ and circles 
by spheres, we have sphere geometry. 

Let L be a complex 3-dimensionaI space, 
and let M, N be the sets of oriented lines and 
oriented spheres in V, respectively. Then M, N 
have the structure of 4-dimensional complex 
manifolds that are homeomorphic to each 
other. The homeomorphism is given by the 
Lie line-sphere transformation that induces a 
bijection from the set of pairs of intersecting 
oriented lines onto the set of pairs of oriented 
spheres that are in contact with each other. 
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D. Group-Theoretic Considerations 

Here we discuss the preceding three kinds of 
geometries from the group-theoretic point of 
view (^ 137 Erlangen Program). Let us con- 
sider the quadratic form Q defined by Q{x) — 
-xl + xl + xl + xl-xl'm & real projective 
space where x = (xq,Xi,X 2 ,X 3 ,X 4 ) are 
homogeneous coordinates. We denote by G 
the set of all projective transformations of P'*' 
that leave Q invariant. The group G consists of 
the set of matrices A of order 5 such that det A 
= 1 and Q{Ax) = Q{x) holds for all x in P“^; we 
denote by the set of all points x in P"*^ that 
satisfy Q(x) = 0. Then G acts transitively on L^. 
Hence if we denote by the set of all ele- 
ments of G that fix a point a in L^, for example 
a = (— 1, 1,0,0,0), we may assume that = 
G/Hg. The circle geometry that belongs to 
the group of Lie transformations of circles is 
exactly the geometry of the homogeneous 
space G/H^. The group Gj of all transforma- 
tions of G that leave the hyperplane X 4 = 0 
invariant acts transitively on fl {x 4 = 0} (G* 
is isomorphic to M(2)). The geometry of the 
homogeneous space G^/Ha H Gj, is plane con- 
formal geometry. Next, the group G^ of all 
transformations of G that leave invariant Xq 
-I- Xi = 0 coincides with and acts transitively 

on L^fl{xo-l-Xi =0}. The geometry of the 
homogeneous space L^D{xo-l-Xi = 0} (on 
which G„ acts transitively) is plane Laguerre 
geometry. In this sense, the circle geometry 
that belongs to the group of Lie transforma- 
tions contains the other circle geometries as 
subgeometries. 

We now describe how plane Laguerre geom- 
etry can be realized as the geometry of the 
space L^n{xo + X] =0}. Let E^ be a plane in a 
Euclidean space E^. We fix a Cartesian co- 
ordinate system {yo,yi, ^ 2 ) so that E^ is 
given by _Vo = 0- To each point y of E^ we can 
associate an oriented circle in with center 
{0,yi,y2X radius |pol^ and positive (negative) 
orientation if Po is positive (negative). If y lies 
on E^, the corresponding circle is the point 
circle y itself. Now let us consider the group G^ 
of all affine transformations of E^ whose rota- 
tion parts leave the quadratic form Q'{y) — 

— yo + yi + yl invariant. (G' is an isometry 
with respect to the tmetric defined by Q'.) Each 
element of G^ induces a transformation of the 
set of oriented circles in E^ (including the 
oriented lines and point circles) onto itself. 

The mappings of E^ into defined by Xq = 

(1 + Q'{yW, =(i - e'(y))/2, ^2 =yi , X 3 = P2. 
X 4 = yo is a one-to-one correspondence of E^ 
onto the subset of such that Xo + x, 

This correspondence induces an isomorphism 
of G' onto G„. In Laguerre geometry, there are 
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no essential distinctions between points and 
oriented circles on E^; and the group acts 
on a 3-dimensional space of oriented circles 
(including point circles). 

Reference 

[1] W. Blaschke, Vorlesungen iiber Differen- 
tialgeometrie III, Springer, 1929. 
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Conformal Mappings 

A. General Remarks 

Let a function w= f{z) that maps a ^domain D 
on the ^complex z-sphere homeomorphically 
onto a domain A on the complex w-sphere 
satisfy the following two conditions: (1) Every 
curve C^\z{t) 1) that starts at any point 

Zq in D and possesses a tangent there has an 
image curve C„,:w = w(t)= /(z(t)) (O^t^ 1) that 
also possesses a tangent at the image point Wq 
= /(zq). (2) The angle between any two curves 
and CP* possessing tangents at Zq is equal 
to the angle between their image curves 
and where the direction of the angle is 
also taken into account. Then the mapping 
from D onto A is said to be conformal, and A 
is called conformally equivalent to £). It has 
been proved that w= /(z) is necessarily a func- 
tion ^analytic in D (D. Men’shov, 1931). 

Consequently, the theory of conformal 
mapping is a branch of the theory of analytic 
functions. That a function w=/(z) maps a 
domain D conformally onto a domain A 
means that it is a tmeromorphic function 
^univalent in D and its range is A. Then /'(z) ^ 
0 holds at every (finite) point in D, and the 
ratio of the lengths of the segments between 
two points Wq = w(0), w(t) on and between 
two points Zo = z(0), z(t) on tends to a fixed 
nonvanishing limit |/'(zo)i indepen- 

dently of the choice of C^. Hence if Zj and Z 2 
lie on and cP*, respectively, and Wj and 
Wj are their image points lying on the image 
curves C*/’ and C';^\ respectively, then the 
two triangles AzjZqZ 2 and AwiWoWj are nearly 
similar in the positive sense, provided that Zj 
and Z 2 are near enough to zq. This justifies 
the word “conformal,” which means “of the 
same form” (-- Appendix A, Table 13). 

B. Conformal Mapping onto the Unit Disk 

A fundamental theorem in the theory of con- 
formal mapping is Riemann’s mapping theo- 



rem, which states that any +simply connected 
domain D with at least two boundary points 
can be mapped conformally onto the interior 
A of the unit circle. This theorem is equivalent 
to the assertion of the existence of tGreen’s 
function of D and can be proved in various 
ways. B. Riemann (1851) gave a proof, based 
on an idea of C. F. Gauss, by assuming the 
existence of a solution for a variational prob- 
lem minimizing the ^Dirichlet integral. The 
logical incompleteness implied by this as- 
sumption was later removed by D. Hilbert 
and others. The proof that is now regarded 
as simplest is due to L. Fejer and F. Riesz’s 
method (T. Rado, 1922, 1923), which applies 
^normal family theory. On the other hand, 
the osculating process due to P. Koebe (1912) 
is a purely constructive method of proving 
existence that is also applicable to the case 
of tmultiply connected domains. The map- 
ping function w—f{z) in Riemann’s mapping 
theorem is uniquely determined under the 
normalization condition /(zq) = 0, arg/'(zo) = 
9o at a point Zq in D, where 9q is a given angle. 

The other types of simply connected do- 
mains are the Riemann sphere less one point 
and the sphere itself In both cases every con- 
formal mapping of these domains is a linear 
transformation. A simply connected domain 
is called hyperbolic, parabolic, or elliptic if 
it is conformally equivalent to the unit disk, 
the complex plane, or the Riemann sphere, 
respectively. 

Let w = /(z) map a simply connected domain 
D on the z-sphere conformally onto a simply 
connected domain A on the w-sphere. If a 
sequence {zj,} in D tends to a boundary point 
C of D, then the corresponding sequence {w„} 
(^v=/(2v)) has no taccumulation point in A 
and does not necessarily tend to a boundary 
point of A. 

If for any sequence (zj tending to C, {/(^v)} 
tends to a unique point co on the boundary of 
A, it is said that /(z) possesses a boundary value 
CO at C- To investigate the behavior of {wJ on 
a hyperbolic domain D, we can assume, by 
using a suitable mapping if necessary, that D is 
bounded. Then the problem is reduced, in view 
of Riemann’s mapping theorem, to the case 
where D is a bounded simply connected 
domain and A is the unit disk |w| < 1. 



C. Correspondence between Boundaries 

Concerning the correspondence between 
boundaries under the conformal mapping w = 
/(z) of a bounded simply connected domain 
D onto the unit disk |wl < 1, we have the fol- 
lowing three theorems: 
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(1) To any taccessible boundary point Zq of 
D there corresponds a unique point on |w| = 1, 
and to any distinct accessible boundary points 
Zc, and z^^ of D there correspond distinct 
points on the unit circumference. Furthermore, 
the set of all points on |w| = 1 that correspond 
to accessible boundary points of D has tangu- 
lar measure equal to 2n. 

(2) There is a one-to-one correspondence 
between tboundary elements of D and points 
on |vwl = 1 (C. Caratheodory). 

(3) Let w—f{z) map the interior D of a tjor- 
dan curve C conformally onto the unit disk 
A: |w| < 1. Then it possesses a boundary value, 
say f{Q, at every point C on C that satisfies 
1/(01 = 1. Hence /(z) is continuous on the 
closed domain D — DUC and maps D bijec- 
tively onto the closed disk A: |w| ^ 1. Similarly, 
the inverse function z = <p(w) has an analogous 
property and maps A bijectively and continu- 
ously onto D; that is, a conformal mapping of 
the interior D of a Jordan curve onto the unit 
disk A can be extended into a homeomor- 
phism of the closure 5 to A (Caratheodory). 

In this case, if the Jordan curve C contains 
a tregular analytic arc F, then the mapping 
function w— /(z) can be prolonged analytically 
beyond T (except at the endpoints of T). Hence 
the mapping w= /(z) is conformal at interior 
points of r. 

Problems on the correspondence of angles 
at the boundary are closely related to tangular 
derivatives. These problems have been at- 
tacked by Caratheodory, S. Warschawski, J. 
Wolff, and others. 



D. Schwarz-Christoffel Transformation 



The problem of determining the form of an 
analytic function that maps the interior of a 
circle or a half-plane conformally onto the 
interior of a polygon was first dealt with by 
H. A. Schwarz and E. B. Christoffel (1869) 

(-^ Appendix A, Table 13). 

Let P be a polygon in the complex w-plane 
with vertices and interior 

angles at Then a function w=f{z) that 

maps a circular disk or a half-plane in the z- 
plane conformally onto the interior of P is 
given by 



m=c 



f] (fl^-z)"'* Uz + C 



/i=i 



Here and C, C are constants de- 

pending on the position and magnitude of the 
polygon P. If instead of a disk we consider a 
half-plane in the z-plane, and if one of the is 
the point at infinity, then, modifying this for- 
mula by deleting the factor of the integrand 
corresponding to = oo, we obtain a formula 



for w= /(z). This representation is called the 
Schwarz-Christoffel transformation formula. 
This formula was originally derived by Chris- 
toffel to solve a problem of 2-dimensional 
distribution of stationary temperature. It 
then found extensive application to several 
problems of conformal mapping concerning 
polygonal domains and to problems of deter- 
mining force lines or stream lines and equi- 
potential lines in 2-dimensional electrostatics 
or hydromechanics. 

An analogous formula is also derived for a 
function that maps the interior of a circle or a 
half-plane conformally onto the exterior of a 
polygon. In connection with these formulas, a 
third-order differential equation that is satis- 
fied by a function mapping the interior of a 
circle onto a domain bounded by a circular 
polygon is found useful in the theory of auto- 
morphic functions. Moreover, a representation 
analogous to the Schwarz-Christoffel trans- 
formation formula is obtained for a function 
that maps the interior of a circle onto a curvi- 
linear polygonal domain bounded by arcs of 
logarithmic spirals with the origin as asymp- 
totic point. 



E. Conformal Mapping of Multiply Connected 
Domains 

It is also important to consider problems 
concerning one-to-one conformal mapping of 
a multiply connected domain on the z-sphere 
onto a suitable multiply connected domain 
on the w-sphere. The two domains D and T) 
are then homeomorphic, but the converse is 
not true; i.e., there does not necessarily exist a 
one-to-one conformal mapping between D and 
X) even when they are homeomorphic. Now let 
D and X be multiply connected domains on 
the z- and w-planes, respectively, both possess- 
ing at least three boundary points. The tuni- 
versal covering surfaces D and X of D and X 
are hyperbolic and they can be mapped onto 
the unit disks. The groups of their tcovering 
transformations form the tpuchsian groups G 
and ©. Then in order for D to be mapped 
one-to-one and conformally onto X, it is neces- 
sary and sufficient that the group © be trans- 
formed into the group G by a suitable linear 
transformation. 

To a domain of finite tconnectivity having 
only continua for its boundary components, 
we can associate conformal invariants (namely, 
moduli) expressed by one real parameter in the 
doubly connected case and by 3n — 6 real 
parameters in the n{ > 2)-connected case. A 
one-to-one conformal mapping is possible 
only within a class of domains having the 
same invariants (^ 416 Teichmiiller Spaces). 
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While a circular disk is taken as a canonical 
domain in the simply connected case, an tan- 
nulus is often taken as a canonical domain in 
the doubly connected case. In the latter case, 
the logarithm of the ratio ( > 1) of the radii of 
two concentric boundary circles is usually 
called the modulus. There are various types of 
n(^2)-connected canonical domains, for in- 
stance, the whole plane, a circular disk or 
annulus slit along concentric circular arcs or 
radial segments, a parallel slit plane, etc. The 
possibility of a one-to-one conformal mapping 
of a given domain onto a canonical domain of 
such a type was proved by Hilbert, Koebe, and 
others in a tpotential-theoretic way and by E. 
Rengel, R. de Possel, H. Grunsky, and others 
in a purely function-theoretic way. Such ca- 
nonical domains are characterized by some 
extremal properties. For example, the horizon- 
tal parallel slit mapping function Pq{z,Zq) = 
(z-Zo)“' +u[Po] (z-Zq)+ ...,ZosD, has 
the extremal property that it is the unique 
function maximizing Rea[/] within the 
family of univalent functions /(z) = (z — Zo)“* + 
a [/] (z — Zo) + ■ ■ • in D. Fora general domain 
D, Pq(z, Zq), called the extremal horizontal slit 
mapping, is defined as the limit function of the 
sequence of horizontal slit mappings Po ’(z, Zq) 
of D„, Z(,eD„. Here is a canonical ex- 

haustion of D. It has the same extremal prop- 
erty as in the case of finite connectivity. The 
boundary components of the image domain of 
D under Pq{z,Zq) consist of a horizontal slit 
and a point. The parallel slit mapping p^(z, Zq) 
in the direction of 0 is defined similarly to the 
one that maximizes Rec“^‘®n[/] [8]. 

For a domain of infinite connectivity, by 
accumulation of boundary components, a 
pointlike boundary component can be mapped 
onto a continuum. A boundary component y 
of a domain D is called weak if its image f(y) is 
a point under every conformal mapping / on 
D. y is called strong if /(y) always consists of 
more than one point. A boundary component 
y that is neither weak nor strong is called 
unstable (L. Sario, J. Analyse Math., 5 (1956)). 
Criteria of weakness, etc., by means of ex- 
tremal length are given in [8]. Moreover, in 
terms of extremal length, a generalization of a 
boundary element is given by Oikawa and 
Suita [9]. On the other hand, the existence 
of one-to-one and conformal mapping of a 
domain bounded by a finite number of curves 
onto the whole plane with mutually disjoint 
circular disks removed was proved by Koebe 
and later derived by J. Douglas and R. Cou- 
rant as a particular case of the existence of a 
solution of tpiateau’s problem [10]. M. Schif- 
fer showed that the mapping function is a 
solution of an extremal problem involving 



Fredholm eigenvalues [11]. L. Bieberbach and 
H. Grunsky showed the possibility of mapping 
an n-connected domain onto an n-sheeted 
disk. Concerning doubly connected domains, 
detailed investigations were made by O. Teich- 
miiller, Y. Komatu, and others. For a domain 
D bounded by n analytic Jordan curves, as 
an extension of the Schwarz lemma, Ahlfors 
showed that there exists a unique extremal 
function /o(z) maximizing Re/'(zo), ZqgD, 
within the family of analytic functions /(z) 
satisfying |/(z)| < 1 in £) and that /q(z) maps D 
onto the n-sheeted unit disk (1947). /q(z) is 
called the Ahlfors function at Zq. For a general 
domain the uniqueness of the extremal func- 
tion in nontrivial cases was established first by 
S. Havinson (1961) and later by L. Carleson 
(1967) and S. Fisher (1969). 

Most tkernel functions of a plane domain D 
have connections with conformal mappings. 
For example, the Bergman kernel U(z,Q of 
exact differentials is equal to (2n)~^ (Pq(z, Q 
--p’niiiz, 0). The adjoint kernel V(z, C) is de- 
fined by (2n)~^(p'(j(z,C) + p’„/2(^^0)- M. Schiffer 
showed that the integral of V(z,C), S(z,Q — 

V(z, C)dz, is univalent and maps D onto a 
domain bounded by n analytic convex curves 
if D is bounded by n Jordan curves [13]. 

For a general domain D, this is true in the 
sense that each boundary component of the 
image domain under S(z, C) is a convex set (K. 
Oikawa and N. Suita, Kodai Math. Sem. Rep., 
16 (1964)). It is known that S(z,Q maximizes 
the area of the complementary set of the image 
domain under /(z) within the family of univa- 
lent functions /(z) = (7i:(z — Q)“' -|-6i(z — ()-!- 

The maximum value multiplied by 47T is called 
the span of the domain D; it is equal to a[po] 
“^[Prt/z]- For iFe ^Szego kernel function 
k(z, C) of a domain D bounded by n analytic 
curves, the adjoint kernel l(z,() is defined, and 
the Ahlfors function /q(z) at C is expressed by 
k{z, C)/l(z, 0 (P- Garabedian, Trans. Amer. 

Math. Sac., 67 (1949)). 

F. Universal Constants 

Among various universal constants appearing 
in the theory of conformal mapping, Bloch’s 
constant is especially famous. A. Bloch (1924) 
showed that a covering surface over the w- 
plane obtained from a mapping w = F{z) — 
z + ... that is one-to-one, conformal, and 
holomorphic in |z| < 1 always contains a tuni- 
valcnt (schlicht) disk whose radius B is a posi- 
tive number independent of the function F 
(Bloch’s theorem). The supremum © of such 
constants B is called Bloch’s constant. The true 
value of ® is yet unknown, but estimates have 
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been given by Ahlfors (Trans. Amer. Math. 
Soc., 43 (1938)) and by Ahlfors and Grunsky 
(Math. Z., 42 (1937)) in the form 







ni/3) 

r(i/4) 



/r(ii/i2)y/2 

Vni/12); 

= 0.4719.... 



M. Heins showed that the equality sign can be 
deleted from the left-hand side (Nagoya Math. 
J., 21 (1962)) and C. Pommerenke gave a 
simpler proof for the same fact (J. London 
Math. Soc., 2 (1970)). It is conjectured that the 
correct value of © is equal to the upper bound 
of Ahlfors and Grunsky. Landau’s constant 2 
corresponding to the case where the image 
disks are not necessarily univalent satisfies 
0.5 <£<0.55. The lower and upper bounds 
are due to Ahlfors (cited above) and Landau 
(Math. Z., 30 (1929)). Pommerenke showed 
0.5 < (1970, cited above). For the family of 

univalent functions these constants coincide, 
and the value is called the schlicht Bloch 
constant, denoted by 91. Its lower and upper 
bounds are known to be 0.5705 (J. Jenkins, J. 
Math. Mech., 10 (1961)) and 0.6565 (R. Robin- 
son, Bull. Amer. Math. Soc., 41 (1935)). (For 
distortion theorems and coefficient problems 
438 Univalent and Multivalent Functions.) 
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A. General Remarks 

Suppose that we are given two straight lines I 
and m intersecting at V (but not orthogonally) 
in the 3-dimensional Euclidean space E^. By 
rotating the line m around /, we obtain a sur- 
face 5. We call this surface ^ a circular cone 
with vertex V and axis /; a straight line on the 
surface passing through V is called a generat- 
ing line of 5- 

A section C of 5 by a plane n not passing 
through V (C = 71 n 3^) is called a conic section 
(or simply a conic). This C is a plane curve on 
the plane n. The point set — F consists of 
two tconnected components 5i and ^2 - Let tt,- 
(i = 1, 2, 3) be planes not passing through V. If 
the conic section fl g is tbounded, then 

Cj is contained either in or in ^2 and is 
tconnected. We call such a an ellipse. When 
C2 = 712 n g is not bounded but is connected, 
then Tt 2 is parallel to one of the generating 
lines of g, and Cj is contained either in or 
in ^ 2 - We call such a C2 a parabola. When 71:3 
intersects both of 5i and 52> i^^n C3 =713 flF 
has two connected components and is not 
bounded. We call such a C3 a hyperbola. These 
three types exhaust all possible types of conic 
sections. In particular, if the plane n is per- 
pendicular to the axis I, then C = tt fl 5 be- 
comes a circle. Thus a circle is a special kind 
of ellipse. 



B. Foci and Directrices 

Let C = TT n be an ellipse. The Euclidean 

space is divided by n into two thalf-spaces 
£f , £| (two “sides” of tt). If we put H Ej* 

= 5ii> 5i l^^2 = 5 i 2» we can construct a 
sphere S that is contained in Ef, tangent to 
1 along a circle K, and tangent to tt at a 
point F. Similarly, we can construct a sphere 
S' that is in El, tangent to along a circle 
K', and tangent to tt at a point F'. We call F, 
F' the foci of the ellipse (Fig. 1). 
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Let K, k' be the planes containing K, K'. 
Straight lines d = KC\n, d' — k' Dn are called 
directrices of C. Unless C is a circle, we have 
F ^ F', and k, n (and k', tt ) actually intersect; 
hence d, d' exist. When C = ti fl 5 is a parabola 
or a hyperbola (Figs. 2 and 3), we can similarly 
define foci (a parabola has only one focus, F, 
while a hyperbola has two foci, F, F') and 
directrices (a parabola has only one directrix, 
d, while a hyperbola has two directrices, d, d'). 



V 




Fig. 3 



Let X be a point on the plane n, let £>^(2^) 
be the distance between the point X and a 
focus F, and let D^{X) be the distance between 
X and a directrix d. Then the curve C is the 
locus of the points X satisfying the condition 
Dp{X) = e ■ DJX), where e is a constant. We call 



e the eccentricity of the conic section C. Ac- 
cording as C is an ellipse, a parabola, or a 
hyperbola, we have e< 1, e= 1, or e> 1. A 
circle is an ellipse whose eccentricity is zero. 

An ellipse is also characterized as the locus of 
X such that FX + F’X = 2a; a hyperbola is the 
locus of X such that 1 FX — F'X \ — 2a, where a 
is a positive constant. When there are two foci, 
the straight line FF' is perpendicular to direc- 
trices d and d’. 

C. Canonical Forms of Equations 

When C is a hyperbola or an ellipse that is not 
a circle, C has two foci, F and F'. In this case, 
the midpoint 0 of the segment FF' is the cen- 
ter of symmetry of C (when C is a circle, its 
center O is, of course, the center of symmetry 
of C). We call 0 the center of C; an ellipse or 
a hyperbola is called a central conic. If we 
choose a rectangular coordinate system (x, y) 
having O as the origin and FF' as x-axis, then 
the equation of C can be expressed in the 
form 

x^/a^±yyb^ = l, a,b>0. (1) 

According as C is an ellipse or a hyperbola, 
we take the -I- or — of the double sign. If 
C is an ellipse, we have a>h. Furthermore, 
e~y/a^ — b^/a if C is an ellipse and e — 

-\-h^ja if C is a hyperbola. We also have 
F = (ae,0) and F' = ( — ae,0); the equations of 
directrices are x — ± a/ e. 

On the other hand, if C is a parabola, the 
straight line that is perpendicular to the direc- 
trix d and passes through F becomes the axis 
of symmetry of C. We call this straight line the 
axis of C; the intersection 0 of the axis and C 
is called the vertex of C. If we choose a rectan- 
gular coordinate system (x,y) having 0 as the 
origin and having the axis of C as the x-axis, 
the equation of C can be expressed in the form 

= 4ax, a>0. (2) 

We call (1) and (2) the canonical (or stan- 
dard) forms of the equation of C. We call the 
associated coordinate system the canonical 
coordinate system. Suppose that C is an ellipse 
(hence a>b). Let A, A' be points of intersec- 
tion of the x-axis and the ellipse and B, B' be 
the points of intersection of the y-axis and the 
ellipse. We call A A' the major axis of C and 
BB' the minor axis of C. If C is a hyperbola 
and (x, y) is the canonical coordinate system, 
we call the x-axis the transverse axis and the y- 
axis the conjugate axis. If C is a central conic, 
the X- and y-axes of the canonical coordinate 
system are called the principal axes; if C is a 
parabola, the x-axis is sometimes called the 
principal axis. 
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D. Properties of Ellipses 

An ellipse may be considered the image of a 
circle under a ^parallel projection. Conse- 
quently, the section of a circular cylinder by a 
plane is an ellipse. Also, if we are given a circle 
C and a fixed diameter D of C, an ellipse is 
obtained as the locus of the points X lying on 
lines PM, which are perpendicular to D, with 
PeC, MeD, satisfying the condition that the 
ratio PM'.XM is constant. 

Suppose that we are given two concentric 
circles having the center at the origin O and 
with radii a, b. Let P, Q be points of intersec- 
tion of moving half-lines through O and the 
two circles. Then the locus of points X of 
intersection of the ordinates {lines parallel to 
the y-axis) passing through P and the ab- 
scissae (lines parallel to the x-axis) passing 
through Q is an ellipse (Fig. 4). Suppose that 
the equation of an ellipse is given by x^/a^ + 
y^/b^ = 1 , with a > h. Then the lengths of its 
major axis and minor axis are 2a and 2b, 
respectively. 




Fig. 4 

Given an ellipse and its center O, the circle 
with center 0 and diameter equal to the major 
axis of the ellipse is called the auxiliary circle 
of the ellipse. Given an ellipse C and its focus 
F, the auxiliary circle of C is the locus of the 
points X satisfying the condition that the line 
FX is perpendicular to a tangent line to C 
passing through X. Suppose that A is a point 
on an ellipse with foci F, F'. Let TT' be the 
line tangent to the ellipse at X {X lies between 
T and T'). Then the angle L TXF' is equal to 
L T’XF (Fig. 5). Consequently, the rays start- 
ing from one focus of an ellipse and “reflected” 
by the ellipse converge on the other focus of 
the ellipse. Also, the product of the distances 
from two foci of an ellipse to an arbitrary tan- 
gent is constant and is equal to b^. 




Fig. 5 
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The ellipse C : x^/a^ + y^/b^ = 1 is expressed 
parametrically in the form 

x — acosd, y = bsind. (3) 

We call the parameter 0 the eccentric angle of 
a point (x, y) on C. Consequently, C is a tJor- 
dan curve and divides the plane into two parts, 
the inside and the outside. The inside is the set 
of points (x, y) satisfying x^/a^ + y^/b^ < 1, and 
the outside is the set of points (x, y) satisfying 
x^/a^ -I- y^jb^ > 1. The inside is a ^convex set. 
From a point Q outside C, two tangents to C 
can be drawn. The locus of points Q such that 
these two tangents are orthogonal is the circle 
x^ + y^ — a^ + b^. We call this circle the direct 
circle. The area of the “sector” OAX formed by 
two points A (a, 0), X (a cos 6, b sin 0) {0 > 0) and 
the origin O is ab0/2 = {ab/2) Arccos(x/a); the 
length of the arc /I A of the ellipse is repre- 
sented by the value of the telliptic integral 

nj y/\ — cos^l) dd = aE{n/2 — 0, e). 

In particular, the area inside an ellipse is equal 
to nab, and the whole length of the ellipse is 
4aE{0,e). 

With respect to a polar coordinate system 
(r, 0) having the focus F(ae, 0) as the origin and 
the ray directed positively along the x-axis as 
the initial line, the equation of the ellipse C is 



l-(-/cos(p’ a 

Here / is equal to half of the length of the 
chord that is perpendicular to the major axis 
and passes through the focus. (This chord is 
called the latus rectum of the ellipse.) Suppose 
that F is a fixed point and that A is a moving 
particle attracted toward F by a ^central force 
inversely proportional to the square of the 
length of FA. Suppose further that A begins 
with an initial velocity whose direction is 
tangent to the ellipse C with focus F. Then A 
always moves on C, and the areal velocity 
described by the radius FA is constant 
(Kepler’s second law). 



E. Properties of Hyperbolas 

Two straight lines x^/a^ — y^/b^ ~0, that is, 
y/x = + b/a, are ^asymptotes of the hyperbola 
C : x^ja^ — y^/b^ = 1. The hyperbola C'.x^ja^ — 
y^(b^ = — 1 is called the conjugate hyperbola 
of C. When a = b, the asymptotes are ortho- 
gonal to each other, and C, C are congruent. 
In this case, we call C a rectangular hyperbola 
(or equilateral hyperbola). When we draw 
parallels to asymptotes from a point A on C, 
the area of the parallelogram formed by these 
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lines and two asymptotes is constant. (In par- 
ticular, when C is a rectangular hyperbola, the 
equation of C becomes xy = k^/2 if we take 
two asymptotes as coordinate axes. The seg- 
ment cut off by the asymptotes on the tangent 
to C at is divided equally at X. In the case 
of the hyperbola as well, the product of the 
distances from two foci to an arbitrary tangent 
is constant and is equal to b^, and the angle 
between two straight lines joining two foci to a 
point on C is divided equally by the tangent 
at X. 

A hyperbola C is represented parametrically 
by 

x = asec0, y — b tan 6. (3') 

In this case also, we call 6 the eccentric angle 
of (x, y). If we use the thyperbolic functions 
and the parameter u, the equation of a hyper- 
bola can be written as 



x = ucosh«, y = bsinhu 



(3") 



instead of (3'). The area of a “sector” OAX 
formed by two points A{a,0), ^(.x,y) on the 
hyperbola and the origin O is, in this case. 



abu 



=— Arc cosh 
2 



X 

a 




The length of the arc AX of the hyperbola is 
given by the elliptic integral 



-dx. 



With respect to a polar coordinate system 
(r, (p) having the focus F{ae, 0) as origin and the 
ray directed positively along the x-axis as the 
initial line, the equation of the hyperbola C 
becomes 



1— ecosf/) a 

where I is equal to half the length of the chord 
passing through the focus and perpendicular 
to the principal axis. (This chord, too, is called 

the latus rectum.) 



F. Properties of Parabolas 

The curve described by a particle attracted by 
“gravitation” in a fixed direction and affected 
by no other force is a parabola (G. Galilei). 
The tangent at a point X on a parabola makes 
equal angles with the straight line joining F 
and X and the direction of the principal axis 
(Fig. 6). Consequently, if the rays starting from 
the focus of a parabola are “reflected” by the 
parabola, they all become rays parallel to the 
principal axis. Let X' be 




Fig. 6 



the point of intersection of the tangent at 
^Uo,yo) and the x-axis, X" the point of inter- 
section of the normal at X and the x-axis, and 
Xq the foot of the perpendicular from X to the 
x-axis. Then FX = FX\ AFXX' is an isosceles 
triangle, and XX' is divided equally by the y- 
axis. Consequently, the locus of the foot of a 
perpendicular from F to a tangent is the y- 
axis. Also, the length of subtangent X'Xq = 

2xq, and the length of subnormal X"Xo = 2a = 
20F. Conversely, a curve whose length of 
subnormal is constant is a parabola. The locus 
of the midpoints of parallel chords of a para- 
bola is a straight line parallel to the principal 
axis. 

With respect to a polar coordinate system 
having the focus as origin and the ray directed 
positively along the x-axis as the initial line, 
the equation of a parabola is 

r = , I — 2a. (4") 

1 — cos (p 



The area bounded by a chord BC and an arc 
BC of a parabola (Fig. 7) is equal to 4/3 the 
area of AABC, where A is the point of contact 
on the tangent of the parabola parallel to BC 
(Archimedes). Also, the length of the arc OX of 
parabola (2) is 



yo+y/yl+^^ 






X having the coordinates (xo,yo)- 




G, Conjugate Diameters 

The diameter is a straight line passing through 
the center of a central conic. The locus of the 
midpoints of the chords parallel to a diameter 
d is another diameter d', called conjugate to d. 




297 



78 J 

Conic Sections 



Then the diameter conjugate to d' is d. The x- 
axis and y-axis of a canonical coordinate sys* 
tern form a set of conjugate diameters. Let 
2a', 2b' be the lengths of the segments (some- 
times called conjugate diameters) cut off from 
d, d’ by the curve or by the curve and the con- 
jugate one for hyperbolas and by m, the angle 
between d and d'. Then the following relations 
hold (as to the double signs ± , we take + in 
the case of an ellipse and — in the case of a 
hyperbola); a'^ ±b'^ = a^ ±b^, a'b' sinco = ab. 
The product of the slopes of d, d' is equal to 
±b^/a^. With respect to an oblique coordinate 
system having d and d' as axes, the equation of 
the curve is x^/fl'^±yVh'^ = l- 



H. Confocal Conic Sections 

The set of ellipses and hyperbolas having two 
fixed points F, F' as foci is called the family of 
confocal central conics with foci F and F' (Fig. 
8). The family of confocal central conics con- 
taining the ellipse x^ja^ Fy^jb^ = 1 is repre- 
sented parametrically by 
2 2 

a^ + 2 b^ + 2 

There exist only one ellipse and only one 
hyperbola that pass through a point inside 
each quadrant (for example, a point (xo,yo), Xq 
> 0, yo >0, inside the first quadrant) and be- 
long to the family of curves. The ellipses and 
hyperbolas of the same family cut each other 
orthogonally. Thus parameters corresponding 
to ellipses and hyperbolas belonging to a 
family of confocal central conics define an 
orthogonal curvilinear coordinate system, 
called an ^elliptic coordinate system (-^ 90 
Coordinates). 




Fig. 8 



The set of parabolas having a fixed point F 
as focus and a straight line passing through F 
as axis is called a family of confocal parabolas 
(Fig. 9). The family of confocal parabolas 
containing y^ — 4ax is the set of curves 

— 4{a + 2){x + X). 

Such families also give rise to orthogonal 
curvilinear coordinate systems. 




Fig. 9 



I. Curves of the Second Order 

With respect to a rectangular coordinate sys- 
tem, a curve represented by an equation with 
real coefficients of the second degree with two 
variables x, y, 

ax^ F 2hxy F by^ F 2gx F2fyFc = 0, (5) 

where {a, h, b) ^ (0, 0, 0), is called a curve of the 
second order. A curve of the second order is 
either an empty set, one point, one or two 
straight lines, or a conic section. For equation 
(5), we put 
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and call D the discriminant of the curve of the 
second order. If Dq^O, D^O, and the curve is 
not an empty set, then the curve is a central 
conic. If Dq > 0, then the curve is an ellipse or 
an empty set. If Dq < 0, then the curve is a 
hyperbola. If Dq = 0, Dt^O, then the curve is a 
parabola. If D — O, Dq> 0, then the curve con- 
sists of one point. If Dq<0, then the 
curve is two intersecting straight lines. If £> = 

Dq = 0, then the curve is an empty set, one 
straight line, or two parallel straight lines. 

J. Poles and Polars 

Let F(x, y) = ax^ F 2hxy -I- by^ F 2gx F 2fy + c 
— 0 be the equation of a conic C and (xq, yo) 
the coordinates of a point P on the plane. A 
straight line P* having the equation 

axqX + /j(xoy + xyo) FbyQyF g{x F Xq) 

+/(y+yo)+c=o 

is called the polar of P with respect to C (Fig. 
10). When the polar of a point P is I, we call P 
the pole of I and denote it by /*. In general, /* is 
uniquely determined by I, and P** = P, t** — 

1. If QeP*, then PeQ*. If P'el, then l*eP'*. 
When a straight line passing through P inter- 
sects C at A", Y and intersects P* at P', then P, 
P' are tharmonic conjugate with respect to 
X, Y. In particular, if PeP*, then PeC, and 
P* becomes the tangent of C at P. Given a 
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triangle APQR on the plane of C, we call 
the triangle with sides P*, Q*, R* the polar 
triangle of APQR. Let Q*r\R*^ F, R* fl P* 

= Q', and P*f\Q* — R'. Then the three straight 
lines PU P', QUQ', RU R' meet at a point (M. 
Chasles). When the polar triangle of APQR 
coincides with itself, then APQR is called a 
self-polar triangle. The polar of a focus is a 
directrix. 




Fig. 10 



K, Curves of the Second Class 

When the coefficients u, u, w of a straight line 
ux-hvy + w = 0 satisfy an equation with real 
coefficients of the second order, 

Au^-h 2Huv + Bv^ + 2Guw + 2Fvw + Cw^ = 0, 

(S') 

where {A, H, the curve enveloped 

by these straight lines is called a curve of the 
second class. Let A be the discriminant defined 
analogously to D in (6) by using A, H, B,... 
instead of a, h, b, .... A curve of the second class 
{with A^O) is essentially the same as a curve of 
the second order with In order for the 
curve (5') with A^^O to coincide with the curve 
(5) with D^0,h is necessary and sufficient that 
A, B, C, F, G, H be proportional to the tco- 
factors of a, b, c, fg,h in the determinant D 
given by (6). If A = 0, then (5') represents either 
the empty set, or a point (regarded as the set of 
straight lines passing through the point), or 
two points. 

From a projective point of view, a curve 
of the second order is defined as a locus of 
the point of intersection lfU' = X of corre- 
sponding lines I and /' when two tpencils of 
lines A{l,m ,... ), A'{1', m',...) passing through 
two different centers A and A' are in corre- 
spondence under a ^projective mapping / (J. 
Steiner) (Fig. 11). From this it can be proved 
that three points of intersection of three pairs 
of opposite sides {AB, DE), {BC, EF), (CD, FA) 
of a hexagon inscribed in a curve of the second 
order are on the same straight line (Pascal’s 
theorem. Fig. 12). In particular, if the curve 
of the second order in this theorem consists 
of two straight lines, the theorem coincides 
with Pappus’s theorem (Fig. 13). We call this 



straight line / the Pascal line of ABCDEF. 
Given a set of six points A, B, C, D, £, F on a 
curve of the second order, by considering all 
possible combinations of the points, we get 60 
Pascal lines. A configuration consisting of 
these 60 lines is called Pascal’s conflguration, 
and has been studied by Steiner, Kirkman, 
and others. As a tdual to Pascal’s theorem, 
Brianchon’s theorem holds: Three diagonals of 
a hexagon with a curve of the second class 
inscribed meet at a point (Fig. 14). 




Fig. 11 Fig. 12 



F 




Fig. 13 Fig. 14 

References 

[1] G. Salmon, A treatise on conic sections, 
Longmans, Green, sixth edition, 1879 (Chelsea 
1962). 

[2] FI. F. Baker, Principles of geometry II. 
Plane geometry, Cambridge Univ. Press, 1922. 

[3] M. Protter and C. Morrey, Jr., Calculus 
with analytic geometry, Addison-Wesley, 1962. 



79 ( 11 . 20 ) 

Connectedness 

A. General Remarks 

A ^topological space X is said to be connected 
if there are no proper closed subsets A and B 
of X such that Ar\B = 0 and AUB = X (C. 
Jordan, Cours d’ Analyse I, 1983). A subset S of 
X is connected if S considered as a ^subspace of 
X is connected. 

If a subset S of 26 is connected, then the 
^closure S is also connected. Let be a 
family of connected subsets of X such that 
either A^f] A^^ 0 or A„f] A 0 for any 
pair A^ and Ap. Then the union [j^A^ is con- 
nected. The continuous image of a connected 
set is connected. The '’'product space Tl^X^ of 
a family of connected spaces {26a} is also con- 
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nected. Let [A^] be a family of connected 
subsets of X and /Iq be a connected subset of 
X. If AqH 0 for every A^, then the union 
Aq{J [JxA^ is also connected. For a point p of 
a topological space X, the union of all connec- 
ted subsets containing p is connected and is 
called the connected component of p (F. Haus- 
dorff, 1927). 

The n-dimensional Euclidean space £" 
(n^O) and the n-dimensional unit sphere S" 
(n> 1) are connected, whereas S^, consisting 
of two points, is not connected. £"\{0} is 
connected for n^2, whereas £^\{0} is not 
connected. This fact implies that is not 
homeomorphic to £" (n^2). A connected 
open subset of a topological space X is called 
a domain (or region) in X. 

A topological space X is said to be locally 
connected at a point p if for every open set U 
containing p, there is an open set V containing 
p and contained in the connected component 
of p in ( 7 . A is said to be locally connected if it 
is locally connected at each point of X. A 
space X is locally connected if and only if 
every connected component of every open 
subset is open in X. 

There are connected spaces that are not 
locally connected. For example, the comb 
space {(x,_v)gR^ | x= 1/n and O^y ^ 1 {n = 
1,2,3,...) or 1 and y = 0} is not locally 

connected at (0, 1) (see Fig. 1). 




Comb space. 



B. Arcwise Connectedness 

Two points a and b of a topological space X 
are said to be joined by an arc in X if there is a 
continuous map f{t) of the closed interval I = 
[0, 1] into X such that /(0) = a and f{\) = b. 

A topological space X is said to be arcwise 
connected (or path-connected) if every two 
points of X are joined by an arc in X. For 
a point p of X, the union of all arcwise- 
connected subsets in X containing p is arcwise 
connected and is called the arcwise-connected 
component (or path-component) of p. A topo- 
logical space X is said to be locally arcwise 
connected at a point p if for every open subset 
U containing p, there is an open subset V 
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containing p such that every two points in V 
are Joined by an arc in U. X is said to be lo- 
cally arcwise connected if it is locally arcwise 
connected at each point of X. 

An arcwise-connected space is connected, 
but the converse is not true. The tsinusoid 
{ (x, y) 6 I y = sin 1/x and 0<x<l orx = 0 
and — 1 :^y^ 1} is connected but not arcwise 
connected. A complete metric space is arc- 
wise connected if it is connected and locally 
connected. 

C. Simple Connectedness and ^-Connectedness 

Denote by S" the n-dimensional unit sphere 
and by the (n -t- 1 )-dimensional unit disk. 

A topological space X is said to be n-connected 
if every continuous map / from S'" to X is 
extendable over (m = 0, 1, 2, . . . , n). 0- 
Connectedness is equivalent to arcwise con- 
nectedness. A 1 -connected space is also called 
simply connected. (For simple connectedness 
and n-ply connectedness of plane domains 
333 Plane Domains) A topological space X is 
said to be locally n-connected at a point p if for 
every open subset U containing p, there is an 
open subset V containing p and contained in 
U such that any continuous map/:S'"-> V is 
extendable to a continuous map/:D"'^^-> 

1/ (m — 0, 1, 2, . . . , m). is said to be locally 
n-connected if it is locally n-connected at 
each point of 2f. A space X is said to be cu- 
connected (or locally co-connected) if X is n- 
connected (or locally n-connected) for every n. 

S" is (n — l)-connected but not n-connected. 
Similarly for If X is n-connected 

(n^ 1), then the tsuspension SX is (n+ 1)- 
connected and the tloop space is (n — 1)- 
connected. The Hawaiian earring {(x,y)e 
R^|(x- l/n)^-l-y^ = 1/n^ (n= 1,2, 3, ... )} is 
arcwise connected but not locally simply con- 
nected. The tcone over the Hawaiian earring 
is simply connected, but not locally simply 
connected (Fig. 2). 





Fig. 2 

Hawaiian earring and its cone. 

A topological space X is said to be contrac- 
tible if the identity map 1 ^ is +homotopic to a 
constant map to a point Xq of X (K. Bor- 
suk, Fund. Math., 24, 1935). A topological 
space X is said to be locally contractible at a 
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point p of X, if for every open subset V con- 
taining p, there is an open subset V containing 
p and contained in U such that the inclusion 
i:V cz U is homotopic to a constant map to a 
point in L/. X is said to be locally contractible 
if it is locally contractible at each point of X. 

A contractible or locally contractible space 
is co-connected or locally co-connected, res- 
pectively. The n-dimensional simplex, the n- 
dimensional disk £)", and the n-dimensional 
Euclidean space £" are contractible. More 
generally, a ■''convex set in £" is contractible. 
^Topological manifolds and tpolyhedra are 
locally contractible. 



D. Continua and Discontinua 

A topological space is said to be totally dis- 
connected if each connected component con- 
sists of one point. Divide the closed interval / 

= [0, 1] into three equal parts, and let /j i and 
/j2 be the closed intervals obtained from / by 
removing the middle open interval. As the next 
step, divide 7^ and 7,2 into three equal parts, 
respectively, and remove the middle open 
intervals. Inductively we obtain 2"^^ closed 
intervals 7„ , (i — 1 , . . . , 2" ^ ) from 2" closed 

intervals j by removing open intervals lying 
in their middles. Let IJf", 7„ , and C 
— C‘"'. Then C is called the Cantor dis- 

continuum or simply the Cantor set or the 
ternary set (G. Cantor, Math. Ann., 21 (1883)) 
(Fig. 3). 




Cantor discontinuum. 

C is a subset of 7 consisting of points with 
coordinates t — (n,/3) + (02/3^) -h . . . + {nJ3‘) 

+ ..., where n, = 0 or 2. As a topological space 
C is homeomorphic to the Cartesian product 
of countably many copies of the discrete space 
7) = (0, 1 }. C has the power of the continuum c 
and is a compact, totally disconnected, tperfect 
set. The Cartesian product of infinitely many 
(countably or not) copies of D is called the 
general Cantor set. The continuous image of a 
general Cantor set is called a dyadic com- 
pactum. Compact metric spaces and compact 
groups are examples of dyadic compacta. 

A continuum is by definition a connected 
compact metric space consisting of more than 
one point. A metric space X is said to be well- 
chained if for every two points a, b and fi>0 
there are points x,, X2, ..., x„_, such that 
d(Xj,x, •+,)<£ {xo — a,x„ = b). A well-chained 
compact metric space is a continum. 

Let 7C be a continuum containing two 



points a and b. Then K is said to be irreducible 
between a and b if there is no proper subcon- 
tinuum of K containing a and b (L. Zoretti, 
Ann. Sci. Ecole Norm. Sup., 26 (1909)) (-^ 93 
Curves). 

A continuum K is said to be indecomposable 
if there are no proper subcontinua K^, K 2 
such that K = K, U7C2 (L. E. J. Brouwer, Math. 
Ann., 66 (1910)). Simple examples of inde- 
composable continua have been given by 

A. Denjoy, C. R. Acad. Sci. Paris, 151 (1910); 

K. Yoneyama, Tdhoku Math. J., 12 (1917); and 

B. Knaster, Fund. Math., 3 (1922). 

C. Kuratowski conjectured the following: 

If a plane continuum K is homogeneous (that 
is, for any two points a,beK there exists a 
homeomorphism h:K-^K with h{a) = b) then 
K is homeomorphic to the circle. A counter- 
example for this conjecture has been found by 
R. H. Bing and E. E. Moise. It is an indecom- 
posable, homogeneous, plane continuum and 
is called the pseudo-arc [8]. 
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Connections 

A. History 

The geometric notion of connections orig- 
inated with T. Levi-Civita’s parallelism [Rend. 
Circ. Mat. Palermo, 42 (1917)) and was later 
generalized to the notion of connections of 
differentiable fiber bundles. Notions such as 
affine connections, Riemannian connections, 
projective connections, and conformal connec- 
tions can be described in terms of bundles 
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constructed from the tangent bundles of dif- 
ferentiable manifolds. They are also standard 
examples of the Cartan connections formu- 
lated by E. Cartan and C. Ehresmann. 

B. Connections in Principal Bundles 

Let P = {P, 7T, M, G) be a differentiable tprin- 
cipal fiber bundle. (For the sake of conve- 
nience, we assume that differentiability always 
means that of class C**.) The total space P and 
the base space M are ^differentiable manifolds, 
and the projection tc is a differentiable map- 
ping. The tstructure group G is a +Lie group 
and acts on P from the right as a transforma- 
tion group. On each fiber, G acts transitively 
without fixed points. For elements a, x in G, P, 
we write Ra(x) = xa. The mappings induced on 
^tangent vector spaces by and n will be 
denoted by the same letters, namely R^: T^{P) 
-^T^aiPX ^■TxiP)^Tn{x)i^)- The tangent vec- 
tor space T^(P) at each point x of P is mapped 
by the projection n onto the tangent vector 
space Tp{M) at the point p — tc(x) of M. The 
kernel of this mapping is denoted by V^iP), 
and each vector in k^(P) is said to be vertical. 
The kernel V^{P) is the totality of elements of 
T^(P) that are tangent to the fiber. 

C. Connections 

We say that a connection is given in P if for 
each point xeP, a subspace of the tangent 
space 7^(P) is given in such a way that the 
following three conditions are satisfied; (i) 

Tx(P) = V^{P) + (direct sum); (ii) RJQJ = 
QxaiQ is invariant under G); and (iii) the map- 
ping is differentiable. A vector in 

is said to be horizontal. 

Now suppose that X is an arbitrary tvector 
field on P. By condition (i), the value of X 
at each point x of P can be expressed uniquely 
as X^= Y^ + Z^, where Y^e V^{P) and Z^eQ^. 
The vector fields Y and Z defined by and 
(xgP) are called the vertical and horizontal 
components of X, respectively. Condition (iii) 
implies that if X is a differentiable vector field, 
then its horizontal and vertical components 
are also differentiable vector fields. Let X be a 
vector field on the base space M. Since n de- 
fines an isomorphism of and Tp(M) (p = 
7i:(x)), we have a unique vector field X* on P 
such that (a) 7r(X*) = X and (b) X*eQ^. We 
call X* the lift of X, and it is invariant under 
G by condition (ii). 

Suppose that a connection is given in P. If C 
is a piecewise differentiable curve in the base 
space M, we can define a mapping cp that 
maps the fiber over the initial point p of C 
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onto the fiber over the endpoint g of C as 
follows: Take an arbitrary point x on the fiber 
at p. Then we have a unique curve C* in P 
starting at x such that (a) 7 t(C*) = C, and (b) 
each tangent vector to C* is horizontal. (C* is 
called a lift of C that starts at x.) The endpoint 
y of the curve C* belongs to the fiber over q. 
We set (p{x) = y. Because C*a = Ra{C*), the 
mapping (p commutes with transformations of 
G. We call this mapping ip the parallel dis- 
placement or parallel translation along the 
curve C. 



D. Holonomy Groups 

Fix a point p in the base space. If C is a closed 
curve in M starting from p, the parallel dis- 
placement along C maps the fiber over p onto 
itself. So if we fix a point x on the fiber over p, 
X is transformed by the parallel displacement 
to a point xa (aeG). Thus each closed curve C 
starting from p determines an element a{x, C) 
of G. If C varies over the set of closed curves 
that start from p, the totality of such elements 
of G forms a subgroup of G. This subgroup is 
called the holonomy group of the connection 
defined over P with the reference point x. If M 
is connected, holonomy groups with different 
reference points are conjugate. In the above, if 
we choose as the closed curves C starting from 
p only those curves that are null-homotopic, 
the elements a(x, C) form a subgroup of the 
holonomy group. This is called the restricted 
holonomy group. The holonomy group is a tLie 
subgroup of the structure group, and its con- 
nected component containing the identity 
coincides with the restricted holonomy group. 
Holonomy groups are useful in the study of 
the behavior of connections. 



E. Connection Forms 

Let g be the Lie algebra (-^ 249 Lie Groups) of 
the structure group of G of a principal fiber 
bundle P = (P,n,M, G). For each A in g, the 1- 
parameter subgroup exp rT(— oo<t<oo)of G 
defines a tone-parameter group Kexp,^ of trans- 
formations on P, and it determines a vector 
field T* on P (^ 105 Differentiable Mani- 
folds). Each element of the vector field A* 
is vertical at each point x on P, and the A* 
(Teg) at x generate V^{P). Moreover, for each 
element a of G we have Pa(T*) = (ad(a“‘)T)*. 

For a connection in P, we define the connec- 
tion form CO on P with values in g by the fol- 
lowing: (i) (dJ^A*) — A (Teg), and (ii) cOj,(X) = 0 
(XeQJ. The connection form co thus defined 
satisfies (iii) P^(co) = ad(a~')co (ueG), where 
R*(co) is the tdilTerential form induced by the 
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transformation from the differential form 
(o. Conversely, given a 1-form o) with values in 
g that satisfies conditions (i) and (ii), we can 
define a connection in P by defining vectors X 
such that CL)(2f) = 0 as the horizontal, and its 
connection form coincides with to. Thus giving 
a connection in P is equivalent to giving a 
connection form in P. 

In particular, when a principal fiber bundle 
P is trivial, i.e., when P = M x G, we can iden- 
tify the tangent vector space T^{P) at a point x 
= {p, g) of P with the direct sum of Tp{M) and 
Tg{G). If we set Qjc—Tp{M), then Q defines a 
connection in P = M x G. Such a connection is 
called flat. When a conneetion can always be 
expressed as above locally, it is called locally 
flat. Since each principal fiber bundle is locally 
a product fiber bundle, we see that loeally 
there exists a connection. If the base space M 
is tparacompact, we can show the existence of 
connections on P. 

F. Extension and Restriction of Connections 

When a principal fiber bundle P — {P,k,M, G) 
has a treduced fiber bundle P', we shall con- 
sider the relation between the connections of P 
and of P'. Let G' be a Lie subgroup of G and g' 
its Lie algebra. We shall denote by j both the 
injection of G' into G and also the injection of 
g' into g. If there exist a differentiable principal 
fiber bundle P' = (P', n', M, G') and a differenti- 
able embedding / of P' into P such that nof 
= n’ and f oR„ — RJ^a^ o / (ue G') are satisfied, 
then (P',/) is said to be a reduced fiber bundle 
of P. Then we have for each 

A eg! and xeP'. 

Suppose that a connection is given in P'; we 
denote the horizontal space at the point x of 
P by Q'x- At the point f{x) of P, we take 
as the horizontal space and transform it by 
right translations of G. Thus we obtain a con- 
nection on P. Let oj' and to be the correspond- 
ing connection forms. Then we have joco' = 
f*(co) on P'. Conversely, suppose that we 
are given a connection in P with the connec- 
tion form CO. If the induced form f*{to) on 
P' has values always in ;(g'), we can write 
/*{(!))= jo to' , and to' defines a connection in 
P'. In this case the conneetion in P is called 
an extension of the connection in P', and the 
connection in P' is called the restriction of the 
connection in P. 

G. Curvature Forms 

Suppose that a principal fiber bundle P = 

(P, n, M, G) has a connection. Let F be a finite- 
dimensional vector space and a be a differen- 
tial form of degree k on P with values in F. We 



define the covariant differential Da of a by 

where the X^ are vector fields on P and h de- 
notes the projection to the horizontal compo- 
nent. Da is a differential form of degree k+l 
on P with values in F. 

Let p: G^GL(F) be a trepresentation of a 
Lie group G on F. A differential form a on P 
with values in F is called a pseudotensorial 
form of type p if a satisfies R*{a) = p{a~^)a 
{aeG). In particular, if a pseudotensorial form 
a satisfies ;(A*)a = 0 for any Aeg 105 Dif- 
ferentiable Manifolds Q), it is called a tensorial 
form of type p. For each representation p of G, 
we can construct an associated vector bundle 
E over M with fiber F. A tensorial form of 
type p is identified with a differential form on 
M with values in E. If a is a pseudotensorial 
form of type p, then Da is a tensorial form of 
type p. 

For a connection form to on P, the co- 
variant differential Dto — Q of to is called the 
curvature form of the connection. Since to is a 
pseudotensorial form of type ad, D is a ten- 
sorial form of type ad. For the connection 
form we have the structure equation dto = 

— [co, cu] -t- Q [4, 6]. Let X and Y be vector 
fields on M, and let X* and Y* be their lifts, 
respectively. Then we have cy{[A*, T*]) = 
n(A*, Y*), which shows that the curvature 
form Q for X*, T* gives the vertical compo- 
nent of [A*, 7*]. 

It is known that the following three con- 
ditions for a connection are equivalent: (i) The 
connection is locally flat, (ii) The curvature 
form vanishes, (iii) The restricted holonomy 
group is trivial (i.e., the identity group). 

The following two theorems are fundamental: 

(1) Suppose that a connection is given in a 
principal fiber bundle P = (P, n, M, G). Then 
the structure group of P can be reduced to the 
holonomy group [4,6]. In fact, for xeP, let 
P(x) be the set of points y in P that can be 
connected to x by a piecewise horizontal curve 
in P. Then P(x) gives a reduced fiber bundle of 
P, and the connection in P is an extension of a 
connection in P(x) [4,6]. 

(2) The Lie algebra of the holonomy group 
with a reference point x in P coincides with the 
vector subspace of g spanned by {Dy(A, 7)| 
yeP{x),X, YeTyiP)} [4,6]. 

The curvature form O is used to express the 
^characteristic classes of the bundle P [1,2] 

56 Characteristic Classes). 

In some cases, a connection in the principal 
fiber bundle induces a connection in an tas- 
sociated fiber bundle. In particular, when G is 
GL(n, R) or GL{n, C), we can define a connec- 
tion in any associated vector bundle. The 
notion of connections in vector bundles can be 
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defined more algebraically (M. F. Atiyah, 
Trans. Amer. Math. Soc., 85 (1957)) and can 
also be defined as a kind of differential oper- 
ator on vector bundles [8]. 



H. Affine Connections 

Let M be a differentiable manifold of dimen- 
sion n and P be the tbundle of tangent n- 
frames over M. Then P has the structure 
group GL(n,R), and it is the principal bundle 
associated with the tangent vector bundle of 
M, which consists of all tangent vectors of M. 
A connection in the bundle of tangent n- 
frames is called an affine connection (or linear 
connection) on M. An affine connection on M 
defines (as well as the curvature form Q) a new 
form 0 called the torsion form on P, which is 
given as follows: Let F be an «-dimensional 
vector space with a fixed basis ((^^, (^ 2 » • ■ • ^ ^n)- 
Since the bundle of tangent n-frames P is 
the set of all bases (i.e., n-frames) (e j , . . . , cj 
in Tp{M) at each point p of M, every point 
x = (ci, ...,c„) of P is given as a mapping x 
of F onto Tp{M) (p = n(x)) defined by 
We define differential form 0 of degree 1 with 
values in F on P by 0^(A) = x"^(7t^(A)) (Ze 
7^(P)). 9 is called a canonical 1-form of the 
bundle of tangent n-frames of the manifold M 
and has the following property: Any diffeo- 
morphism (p of M onto itself induces a bundle 
automorphism 0 of P onto itself, and ^ pre- 
serves 6, that is, ^*{6) = 6. Conversely, we can 
show that any bundle automorphism of P that 
preserves 6 is induced by a diffeomorphism of 
the base space M. 

For an affine connection on M, we define 
the torsion form 0 by © = DO. © is a differen- 
tial form of degree 2 on P with values in F and 
satisfies Rg& = a~^ -0 (aeGL(n, R)). Further- 
more, we have the structure equation for 0, 
de^io), <?] + © [2,4, 6], 

For each element ^ in F, there exists a 
unique horizontal vector field B(^) on P such 
that 9(B(^)) = ^. B(^) is called the basic vector 
field corresponding to At each point xeP, 

• • • > form a basis of Q^. Let 
(A^,..., A^j (m = n^) be a basis of g = gl(«, R). 
Then at each point xeP, {{Af)^, ... ,{A*)^, 
B((^i)^, . . . , B(^„)^} is a basis of the tangent 
vector space 7^(P). Thus the bundle P of 
frames is a tparallelizable manifold. The pro- 
jection to M of any integral curve of a basic 
vector field is a geodesic, which is defined in 
Section I [4]. 

An affine connection on M gives a parallel 
displacement of the tangent vector space of M 
as follows: Let C — (0 < t ^ 1 ) be a curve in M 

and C* - X, be a lift of C to P. The parallel 
displacement of the tangent n-frame Xq at po 
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along the curve C is x„ and the mapping x,o 
Xq * : Tp^{M)^Tp^{M) is called the parallel dis- 
placement of the tangent space Tp^(M) onto 
Tp {M) along C. It is easily seen that the map- 
ping is independent of the choice of lifts. 



I. Covariant Differentials 

Let C = { p,} (0 ^ t < 1) be a differentiable curve 
in M. If we have a vector in Tp {M) for each 
t and the correspondence 5^ is differenti- 
able, then { 1^} is called a vector field along the 
curve C. For { we set 

Y,' {¥,+,)- X), 

«->0 

where (pt ^ is the parallel displacement of 
Tp^[M) onto Tp +ft(M) along the curve C. The 
vector field {1^'} along C thus obtained is 
called the covariant derivative of {1^}. {1^} is 
parallel along C; that is, Y, = (Pq ,{ Yq) if and 
only if 1^' = 0. In particular, if the tangent 
vectors to a curve C are parallel along C itself, 
then C is said to be a geodesic. 

Let X and Y be vector fields on a manifold 
M with an affine connection. The covariant 
derivative Vy Y of the vector field Y in the 
direction of the vector field X is defined as 
follows: Let Po be a point in M, C= (pj 
( — £ ^ t <s) be an integral curve of X through 
Po, and {(pj be the parallel displacement along 
C. We set 

(V,y),„=lim(l/t)te-'(y„)-V 

Then Vy Y is also a vector field on M. 

The mapping (X, satisfies the 

following three conditions: (i) Y is linear 
with respect to X and Y; (ii) VjxY=f- Vy T; 
and (iii) Vx{fY)=f-VxY+{Xf)-Y, where /is 
a differentiable function on M. Conversely, if a 
mapping satisfying conditions (i)-(iii) above is 
given, then there exists a unique affine connec- 
tion on M whose covariant derivative coin- 
cides with the given mapping [4,6]. 

Fix a vector field Y. Then the mapping X 
Vy Y defines a ttensor field of type (1, 1). This 
tensor field is called the covariant differential 
of Y and is denoted by VT. Now fix a vector 
field X. Then the mapping T can be 

naturally extended to tensor fields of arbitrary 
type, and it commutes with the tcontraction of 
the tensors. For a tensor field K this is denoted 
by Furthermore, the mapping 2f-> 

V;fX is called the covariant differential of K 
and is denoted by VK. We call V^K the co- 
variant derivative of K in the direction of X. A 
tensor field K is invariant under parallel dis- 
placements if and only if VK = 0 (^ 417 Ten- 
sor Calculus). 
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J. Curvature Tensors and Torsion Tensors 

For an affine connection on M, the curvature 
tensor R and the torsion tensor T are defined 
by 

RiX, Y)(Z) = V^(V,Z) - VyiV.Z) - V[;,, y,iZl 

T{X = Y-VxX~ IX, y], 

where X, Y, and Z are vector fields on M, and 
R and T are tensors of types (1, 3) and (1,2), 
respectively. Also, in terms of the curvature 
form Q and the torsion form © on the bundle 
of tangent n-frames P over M, they can be 
defined by 

i?p(A,y)(Z) = x-i Q,(A*, y*)-3c(Z), 

Tp(X, y) = x-‘(®x(^*, F*)), 

where 7r(x) = p, X, YeTp{M), and X*, Y* are 
lifts of X, Y, respectively. The curvature tensor 
and the torsion tensor satisfy the relations 
R{X, y)= -R{Y,X), T{X, y)= - T{Y,X). 
Moreover, Bianchi’s identities hold: 

®(K(A, Y)iZ))^B{T{T{X, Y),Z) 

+ (V;,T)(y,Z)) 
and 

^{(X,R)(Y,Z) + RiTiX, y),Z)) = 0, 

where ® denotes the sum of terms that are 
obtained by cyclic permutations of X, Y, Z 
[4]. For instance, in the case of a Riemannian 
connection (Section K), we have T=0, and 
Bianchi’s identities reduce to 

R(X, Y) (Z) + R{Y,Z) (A) + R(Z, A) ( F) = 0, 

(V;,R)(y,Z) + (VyR)(Z,A)+(V^R)(A, 7)^=0. 

We now consider a system of coordinates 
(x\ . . . , x") in an n-dimensional linear space M 
= R". The vector fields (Aj , . . . , X„) (AT; = d/dx‘) 
form a basis for vector fields on M. If we set 
V;f A~0, we get an affine connection qn R". 
For such a connection we have R = 0, T= 0, 
and any straight line in R" is a geodesic with 
respect to this connection. The connection is 
called the canonical affine connection on R". 

An affine connection on a manifold M satisfies 
R = 0 and T= 0 if and only if the connection 
on M is locally isomorphic to the canonical 
affine connection of R". 

Let (p be a diffeomorphism of a manifold M 
with an affine connection onto itself. We call <p 
an affine transformation of M if the induced 
automorphism (p on the bundle of tangent n- 
frames preserves the connection. In terms of 
covariant differentials, this condition is equiva- 
lent to the condition \^xM^) = <p{V^Y){ot 
any vector fields X, Y. An affine transforma- 
tion of the canonical affine connection on R" 



is an ordinary affine transformation. For an 
affine connection on a manifold M, the set of 
all affine transformations forms a Lie group 
and acts on M as a Lie transformation group 
[4,5]. 

Let M be a manifold with an affine connec- 
tion. M is called an affine locally symmetric 
space if VR =0 and VT = 0 are satisfied. These 
conditions are satisfied if and only if at each 
point p of M, there exist a neighborhood U of 
p and an affine transformation (p of U such 
that (p^ — 1 and p is the isolated fixed point of 
(p. If for each point p of M there exists an 
affine transformation of M such that (p^ = l 
and p is an isolated fixed point, then M is 
called an affine symmetric space. A symmetric 
Riemannian space is a special case of this type 
(-^413 Symmetric Spaces). 

At each point p in a manifold M with an 
affine connection, we can choose local coordi- 
nates (x\ ... ,x") such that x'(p) = 0 and the 
curve x‘ = fl‘t ( — <5) is a geodesic for each 
with L(n')^ = l- Such local coordi- 
nates are called geodesic coordinates at p [4]. 
With respect to geodesic coordinates, we have 
(Vx,(A,))p=^0(5/dx‘ = X,). 



K. Riemannian Connections 

When a Riemannian metric g (-^ 364 Rieman- 
nian Manifolds) is given on a manifold M, it 
defines a metric on the tangent space Tp(M) at 
each point p of M, and we can take ortho- 
normal bases in Tp(M). The set P' of all ortho- 
normal bases of tangent spaces is a subset of 
the bundle P of tangent n-frames of M and 
forms a subbundle of P; its structure group is 
the ^orthogonal group 0{n), and P' gives a 
reduction of the bundle of tangent n-frames. 
Conversely, when a reduction of the bundle 
of frames to 0{n) is given, we can define a 
Riemannian metric on M such that the re- 
duced bundle consists of all orthonormal 
frames. 

For a Riemannian metric g on M, there 
exists a unique affine connection on M such 
that (i) Xg — 0, and (ii) the torsion tensor T 
vanishes [4]. This connection is called the 
Riemannian connection corresponding to g. 
The first condition is equivalent to the invar- 
iance of the Riemannian metric g under paral- 
lel displacement. Thus the affine connection 
transforms orthonormal bases on M to ortho- 
normal bases and induces a connection in the 
bundle F. It is known that the restricted holo- 
nomy group of any Riemannian connection is 
a closed subgroup of 0(n) [4]. An affine con- 
nection on a manifold M is called a metric 
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connection if it preserves a Riemannian metric 
g on M, i.e., if it satisfies the condition (i). 

L. Representations in Local Coordinates 



(5) A curve x‘ = x'{t) is a geodesic if and only 
if 



d^x‘ 

~di^ 



^ . dx^dx 



... ,n 



(1) Let ...,x") be a local coordinate system 
in a manifold M and consider vector fields 
Xi — djdXj. For an affine connection on M, 
the covariant derivative can be expressed as 



(— ^ 178 Geodesics, 417 Tensor Calculus). 

M. Cartan Connections 






The rjk are called coefficients of the affine 
connection with respect to the local coordinate 
system (x\ ... ,x"). We denote by the coeffi- 
cients of connection with respect to another 
local coordinate system ...,y"). Then on 
the intersection of their coordinate neighbor- 
hoods, we have 



dy‘ fdx^dx^ 5^x® \ 

p = V I p -I 1 

^^.ydx^^ydy^ dy’^ VV/' 



Conversely, if the Tj^ are given in each local 
coordinate system of M and satisfy this rela- 
tion on each intersection of their coordinate 
neighborhoods, then there exists a unique 
affine connection such that the coefficients of 
the connection are given by Fj^. 

(2) The coefficients of the Riemannian con- 
nection corresponding to a Riemannian metric 
g = 'L 9 ijdx'dx^ on a manifold M are given by 



2Y 



dx'* 



^9ik 


\ 


dx^ 


dx‘ j 



and are called the Christoffel symbols. 

(3) With respect to each local coordinate 
system (x*, ... , x"), we express the torsion ten- 
sor T and the curvature tensor R of an affine 
connection by 



T=^7}idx^0dx'^0Xi, 

ijk 

R = Yj R}fc(dx'^(x)tix*® dx'0 

ijkl 

The components Tj[ and are given by 



T^i pi pi 

^jk — A ife ~ A kp 

Rjw = (SryBx‘ - drydx') 
+I(r,“rL-r.”;r;j. 

m 

(4) Let K = ; be a tensor field of type 

(r, s). Then the covariant differential VX = 






a=l \ I 








Let M be a differentiable manifold of dimen- 
sion n. Consider a homogeneous spaee F = 
G/G' of the same dimension n, where G is a Lie 
group and G' is a closed subgroup of G (-^ 199 
Homogeneous Spaces). Let B = {B, M,F, G) be 
a fiber bundle over M with fiber F and struc- 
ture group G, and P — {P, M, G) be the prin- 
cipal fiber bundle associated with B. Suppose 
that there exists a cross section / over M to B. 
Then the strueture group of P can be reduced 
to G'. We denote this reduced fiber bundle by 
P' — {F, M, G’) and the injection of P' into P 
by J (-^ i47 Fiber Bundles). 

Suppose that a connection is given in P. Its 
connection form ca is a differential form of 
degree 1 on P with values in g, and the in- 
duced form 0 )' =j*{o}) is also a differential form 
of degree 1 on P' with values in g. We call the 
connection in P a Cartan connection on M 
with the fiber F ~ G/G' if at each point x of 
P', ca^ gives an isomorphism of T^iP') onto g 
as linear spaces. Such a connection in P is 
equivalently defined as a 1-form ca' on P' with 
values in g satisfying the following three con- 
ditions: (i) oi'{A*) — A (A eg' (Lie algebra of 
G')); (ii) R*(ca') = ad(a"^)ca' (aeG'); and (iii) ca^ 
gives an isomorphism of Fc(P') onto g at each 
point xeF. For such ca', we can take a con- 
nection form ca in P such that ca'=j*(ca); ca 
defines a Cartan connection. 



N. Soudures 

A cross section / over M to B gives a vector 
bundle T{B) on M defined as follows: For 
each point p of M, the projection B-*M de- 
fines a mapping 7}(pj(B)-»Tp(M). The kernel of 
this mapping is denoted by VJ■^p^{B). Then 
r(B)=u p Fy(pj(B) forms a vector bundle over 
M, and the dimension of its fibers is equal to 
n = dim F. 

A Cartan connection in P gives a bundle 
isomorphism between T'{B) and the tangent 
vector bundle T{M) of M as follows: Let x be 
an arbitrary point in F, and put p — n{x). The 
projection tz: P'^M induces an isomorphism 
of T^(P')/F,(P') onto Tp(M). On the other hand, 
ca'x gives an isomorphism of TJ,F)jVJJ*') onto 
g/g'. As a point in P', x gives a mapping of F 
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= G/G' onto the fiber in B over p and sends 
the point {G'} in F to f(p). By this map- 
ping, Tq{F) — g/g' is mapped isomorphically 
onto VJ■^p){B). Combining these isomorphisms, 
we get an isomorphism between Tp{M) and 
that is independent of the choice of 
xeP' over p. The set of such isomorphisms for 
peM defines a bundle isomorphism of T{M) 
and T'{B). If a fiber bundle B over M has an 
isomorphism such as above through a cross 
section, then B is said to have a soudure [3], 

Conversely, if a fiber bundle B over M has a 
soudure with respect to a cross section /, then 
there exists a Cartan connection in P such that 
the soudure given by the connection coincides 
with the original one [3], There are many 
Cartan connections in P with the given sou- 
dure. However, when F = G/G' is a symmetric 
space of compact type such that G is noncom- 
pact and contains the identity component of 
the group of isometries, we can determine 
uniquely the so-called normal Cartan connec- 
tion among the Cartan connections which 
gives rise to the given soudure [10]. 

For the tangent vector bundle T(M) of M, 
the fiber F is an n-dimensional linear space 
and can be expressed as F — G/G', where G is 
the taffine transformation group of F and G' 

= GL{n, R). Then T{M) has the 0-section over 
M, and there exists a natural soudure. Fur- 
thermore, an affine connection on M canoni- 
cally induces a Cartan connection on M with 
the fiber F = G/G' [3]. 

For a Cartan connection on M, we can 
introduce the notion of development of a curve 
in M into the fiber and also the notion of 
completeness [3]. 



O. Projective Connections 

Let = Gi/Gj and F 2 = G2/G2 be homoge- 
neous spaces with dim Fj = dim F 2 = n. Sup- 
pose that Gi is a Lie subgroup of G2 and G\ is 
contained in G2 by the injection. Then we have 
a canonical injection F, ->F2 (F^ is an open 
subset of F2 by the assumption). 

Suppose that a fiber bundle with fiber F^ 
over M has a cross section /j . Using /i , we 
can construct a bundle B2 with fiber F2 over M 
which also has a cross section /2. The prin- 
cipal bundle of B2 is given by extending the 
structure groups from the principal bundle 
of Bj . We can show that if B^ has a soudure 
with respect to the cross section /i, then B2 
also has a soudure with respect to the cross sec- 
tion f 2 - A Cartan connection in the principal 
fiber bundle P^ associated with Bj that is com- 
patible with a given soudure on Bj induces 
a Cartan connection in the principal fiber 
bundle P 2 associated with Bj, which then 



induces a soudure on B 2 . The latter is called a 
Cartan connection induced from the former. 

Let F, be an n-dimensional linear space and 
F2 be the real tprojective space of dimension n. 
Then the affine transformation group of Fj 
can be embedded into the projective transfor- 
mation group of F2. Thus the tangent vector 
bundle of a manifold M induces a fiber bundle 
over M with the n-dimensional projective 
space as its fiber. A Cartan connection in this 
fiber bundle is called a projective connection on 
M. By the argument in this section, we see that 
every affine connection on M induces a projec- 
tive connection on M. 

Given two affine connections on M, we 
denote by V and V' their corresponding covar- 
iant differentials. The two affine connections 
on M induce the same projective connection 
on M if and only if there exists a differential 
form (p of degree 1 on M such that F— 

VxY = ip(X)Y ^ (f>(Y)X for any vector fields X, 
F[7]. If a diffeomorphism cp of M preserves 
the projective connection induced by an affine 
connection in M, then cp maps geodesics of M 
into geodesics. 



P. Conformal Connections 



Let Fj be an n-dimensional Euclidean space 
and F2 an n-dimensional sphere (a tconformal 
space). We can embed the group of tisometries 
of Fi canonically into the group of tconformal 
transformations of Fj. A Riemannian metric of 
the tangent vector bundle of a manifold M of 
dimension n gives a fiber bundle over M with 
fiber F2. A Cartan connection in this fiber 
bundle is called a conformal connection on M; 
a Riemannian connection on M induces a 
conformal connection on M. 

Two Riemannian metrics 02 on M in- 
duce the same conformal connection on M if 
and only if there exists a positive function / 
on M such that gj =fy\- Thus for a Rieman- 
nian manifold M with metric tensor 0 1 , a dif- 
feomorphism (p of M such that <p*{g)=fgi 
leaves invariant the conformal connection 
induced by 0i . Such a cp is called a conformal 
transformation of M with respect to the given 
Riemannian metric 0^. 

For a Riemannian manifold M with metric 
tensor g, we define Weyl’s conformal curvature 
tensor W by 

fYjki = Rjki — Rji5k -I- 0,'fcR; — GjiRk) 



R 

(n-l)(n-2) 






where the RJki and Rjk are components of the 
curvature tensor and Ricci tensor, respectively, 
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and R is the scalar curvature (-^ 364 Rieman- 
nian Manifolds, 417 Tensor Calculus). When 
dimM^3, the conformal connection induced 
by 3 on M is locally flat if and only if the 
conformal curvature tensor vanishes [7]. 

Q. Yang-Mills G-Connection 

Let P = (P, n, M, G) be a differentiable principal 
fiber bundle over a compact oriented Rieman- 
nian manifold M with group G. Then fiber 
bundles Gp = Px^G and Qp = P x asso- 
ciated with P are induced naturally from the 
group conjugation c: G->Aut(G) and the ad- 
joint representation Ad:G^Aut(g), respec- 
tively. A (local) section of Gp is called a (local) 
gauge transformation of P. The set of all global 
gauge transformations, which is denoted by 
^p, has a group structure. 

A locally faithful representation p of G to 
an n-dimensional complex vector space F with 
a fixed basis (ii,..., <^„) defines a differenti- 
able complex vector bundle E — PXpF asso- 
ciated with P. Every point x of P is identified 
with a linear mapping x of P onto the fiber 
defined by x; where denotes the 
equivalence class of (x, ^i]GP x F, 

In a manner similar to the case of an affine 
connection, a connection in P with connection 
form 0 ) gives a notion of parallel displacement 
of E as follows: Let c = p, (0 ^ < 1) be a curve 
in M and c* = x, be a lift of c to P. The map- 
ping x,oxo^ is called the parallel 

displacement of Ep^ onto £^^ along c. 

Let ^ be a vector field on M and <p be a 
differentiable section of E. The covariant de- 
rivative Vjf (p of ^ in the direction of X is de- 
fined as follows: Let Po be a point of M, c = p, 

( — — e) be an integral curve of X through 
Po and c* = Xf be a lift of c to P. We set 

1 _ 

( Vx (p)p^ = lim - (x, o xo {(pp) - (Pp J. 

Then is also a differentiable section of £. 

The mapping {X, (p)->Vy(p satisfies the fol- 
lowing conditions: (i) V^(p is linear with respect 
to X and (p; (ii) and (iii) Vxih) = 

{Xf)(p +/Vj^^(p, where / is a differentiable func- 
tion on M. From these conditions it is seen 
that the mapping A'-^Vjj-cp for a fixed section 
cp of £ defines a differential form of degree 
one with values in £, denoted by V<p. V is 
called a G-connection on £ (induced from the 
connection in P). A linear operator d'':r(A^(g) 
£)->r(A'’^^ ® P), defined by (f(a®(p) = d<x® 
(p-\-{~\y a A for a differential form a of 
degree p and a differentiable section <p of £ is 
called a covariant exterior differentiation. 

Here L(A'’ (x) £) denotes the set of all differ- 
ential forms of degree p with values in £. 
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The curvature form rX of V is defined by 
y ) Cp = Vx ( Vy <p) - Vy ( Vx (p) - Vjx , yi <P, 
where X and Y are vector fields on M and 
^ is a differentiable section of £. In terms of 
the curvature form Q of the connection in P, 
it can be defined also by 

RliX, Y)(p = xoQ^{X*, Y*)-x-^{(p), 

where X and Y are in TpM and xen^'^ip) and 
X*, y* are lifts of X, Y to P, respectively, and 
(p is an element of Ep. Since 12 is a tensorial 
form of type ad and the differential of 
p induces a faithful representation of g to 
gl(F), R^ can be regarded as a differential 
form of degree two with values in the bundle 

gp — P X Adg- 

The curvature form R^ satisfies Bianchi’s 
identity: — 0, where is a covariant 

exterior differentiation with respect to the G- 
connection on g^ canonically induced from 
the connection in P. 

We denote by ^ the set of all G-connections 
on £. Now let G be a compact semisimple Lie 
group. A functional defined by V 

-+5^(V) = — ijiif tr(R^ A * RX) is called the 
action integral (Yang-Mills functional) of V, 
where * is Hodge’s star operator, given by the 
fixed orientation of M. 

The group ^p acts on ^ by o V of 

for f o^p and then the curvature form is trans- 
formed by this action as R'^^Ad(/“*)R'^. 

Thus ^ is '^p-invariant. 

A connection V is called a Yang- Mills G- 
connection if V is a critical point of The 
Euler-Lagrange equation of is given by 
d'^*R^ = 0 by the aid of the formal adjoint 
operator d'^* of d'^. This equation, called the 
Yang-Mills equation, is a system of non- 
linear second-order elliptic partial differential 
equations. 

When M is a 4-dimensional vector space 
with Minkowskian metric and G is the Abelian 
group G(l), the Yang-Mills equation coincides 
with Maxwell’s equations for an electromag- 
netic field. Thus the Yang-Mills equation for a 
non-Abelian group G is a natural extension of 
Maxwell’s equations. In fact, the theory of 
Yang-Mills connections has its origin in the 
field theory of physics [11]. 

In the case of dim M = 4, special Yang- 
Mills G-connections occur. A G-connection 
V satisfying the condition *R'^ = R'^ (resp. * 

R^ = — R^) as a differential form of degree 
two is called a self-dual (resp. anti-self-dual) 
G-connection. From Bianchi’s identity and 
the expression of d'^*:d'^* = —*od^o*,iX fol- 
lows that every self-dual (anti-self-dual) G- 
connection gives a solution to the Yang-Mills 
equation. Since the first Pontryagin number 
Pi (£) is given by Pj (£) = - (1/47 t^) |m tr(R^ A 
R^) by virtue of the Chern-Weil theorem, the 
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action integral satisfies ^{V)^2n^\Pi(E)\ for 
every V in and the equality holds if and 
only if V is self-dual or anti-self-dual. 

Explicit forms have been obtained for (anti-) 
self-dual connections over the 4-sphere by 
many interesting methods [12-15], And 
it has been shown that moduli space of self- 
dual G-connections (i.e., the set of all solu- 
tions to the Yang-Mills equation modulo 
over has the structure of a HausdorlT mani- 
fold of dimension Pi(gp) — dimG for every 
principal bundle P with group G [16, 17], It is 
not yet known whether there exists a Yang- 
Mills G-connection over whose holonomy 
group is an open subgroup of G and which 
is neither self-dual nor anti-self-dual [18]. 

The following is one of the few known facts 
concerning the properties of Yang-Mills G- 
connections: If a Yang-Mills G-connection, 

G = SU(2), SU(3), or 0(3), over S'*" is weakly 
stable, i.e., if the second variation of 6^ is posi- 
tive semidefinite, then it is self-dual or anti- 
self-dual [19]. 
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81 ( 1 . 11 ) 

Constructive Ordinal 
Numbers 

A. General Remarks 

To extend the theory of trecursive functions to 
transfinite ordinal numbers, A. Church and S. 
C. Kleene [1] considered the set of effectively 
accessible ordinal numbers and defined the 
concept of constructive ordinal numbers as 
explained later in this article. Their work 
became the basis of fruitful research by Kleene, 
W. Markwald, C. Spector, and others [2-5]. 

A constructive ordinal number was origin- 
ally introduced as an “expression” in a tformal 
system utilizing the /l-notation. Since such a 
system is “effective,” we can arithmetize it 
utilizing tGodel numbers and assume from the 
outset that each ordinal number is represent- 
able by a natural number. The notations, 
terminology, and theorems mentioned in this 
article are mainly those for constructive 
ordinal numbers of the tsecond number class. 

B. Definition and Fundamental Properties 

We call a set of natural numbers satisfying 
conditions (I) and (II) a system of notations for 
ordinal numbers, and an ordinal number a 
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constructive ordinal number when it is represent- 
able by a natural number belonging to such 
a system of notations: (I) No natural number 
represents two distinct ordinal numbers. (II) 
There are three tpartial recursive functions 
K{x), P(x), and Q{x, n) defined as follows: (i) for 
any natural number x representing X, K{x) 
takes the value 0, 1, or 2 according as X is 
zero, an ^isolated ordinal number, or a dimit 
ordinal number, respectively; (ii) when X is the 
ordinal number Timmediately after an ordinal 
number Y, P{x) represents Y for any natural 
number x representing X; (iii) when X is a. 
limit ordinal number, for any natural number 
X representing X there exists an increasing 
sequence { Y„} of ordinal numbers such that 

= lim„ Y„ and Q{x,n) represents Y„ for each 
natural number n. 

The system called Sj by Kleene is the most 
useful and convenient among systems of nota- 
tions for ordinal numbers. Let Hq be a tprimi- 
tive recursive function of the variable n defined 
by 0o= 1, (n-i- l)o = 2"<^. The fundamental 
notion aeO and relation a < of the system 
S 3 are introduced by the following inductive 
definition: (1) 1 eO; (2) if yeO, then 2 ^eO and 
y <t»2^; (3) if a sequence {y„} of natural num- 
bers has the property that for each n, y„eO 
and y„ < (,y„+i , and if y is a Godel number 
that defines y„ recursively as a function of 
Hq (i.e., y„^{y} (n^) (-^ 356 Recursive 
Functions)), then 3 • 5^ e 0, and for each n, 
y„<o3-5^; (4) if x, y, zeO, x<oy, and 
y < qZ, then x < qz; (5) ae 0, a < gb hold only 
when they follow from (l)-(4). 

Now, for brevity, we write gb for 
(a < gb) V (a = b). The following propositions 
hold for S3: (1) If a<gb, then 6^1; (2) If 
a<gb, then a,beO; (3) If a<g2^, then a^gy; 
(4) If fl <q 3- 5^ then there is a natural number 
n such that a^gy„, where y„^{y}{ng); (5) If 
aeO, then (6) If aeO, then for any 

^number-theoretic function a such that a(0) = 
a and Vn(a(n) =^l^ix{n+\)< g(x{n)), there 
is a k such that a(/c)= 1; (7) For each a, “| 

{a < gu); (8) If c e 0, a ^ gc, and b ^ gC, then 
a<gb or a — b or b<ga. 

Each member a of 0 represents an ordinal 
number |a| as follows: |1| = 0; |2^|==|y|-f 1 for 
yeO; |3-5^| = lim„|y„| for 3-5^eO, where y„ = 
{y}(«o)- Let & be a member of O. Then \a\ < 

\b\ when a<gb; and conversely, for each a< 

\b\, there is a number a such that \a\=a 
and a < gb. Hence the set {a | a < gb) is a 
t well-ordered set with respect to <0, and its 
border type is \b\. The least number i greater 
than |u| for every member a of 0 is the least 
ordinal number that is not constructive. It is 
denoted by (Church and Kleene denoted 
it by oil). There is a subsystem of S3 that is well 
ordered with respect to ^ ^ contains a 
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unique notation for each constructive ordinal 
number a. For such a subsystem we can take 
a IT I set K such that K is trecursive in O (S. 
Feferman and Spector, R. O. Gandy) (^ 356 
Recursive Functions H). 

C. Constructive Ordinals and the Kleene 
Hierarchy 

Let R(x,y) be a ^predicate on natural numbers. 
We write x^^y for any natural numbers x, y 
for which R(x,y) holds. We consider only the 
case where is a ^linear ordering on the set 
Du = {x| 3y(R(x,y) vR(y,x))}. If is a well- 
ordered set with respect to we denote its 
torder type by |R|. (1) For each (constructive) 
ordinal number there is a tgeneral 

recursive (more strictly, tprimitive recursive) 
predicate R such that | R| = a (Markwald, 
Spector, Kleene [3]). (2) Conversely, if R is a 
^hyperarithmetic predicate (e.g., R is general 
recursive), then |R| (Markwald, Spec- 
tor). The following theorems are the most 
fruitful ones in the theory of constructive 
ordinal numbers, and they fully support the 
validity of Kleene’s idea of tanalytic hierarchy. 
(3) The set O is Ul (^ 356 Recursive Func- 
tions H), and so is the predicate a<gb. Name- 
ly, for 0, there is a primitive recursive predi- 
cate R{a, X, a) such that 

aeOo V(x3xR(a, x, a) 

(Kleene [3]). (4) For each ordinal number a< 
the set {u I a ^ |a] } is a hyperarithmetic 
set (Spector). (5) 0 is a tcomplete set for 77/. 
That is, for any 77j set E, there is a primitive 
recursive function cp such that aeE<^(p{a)eO 
(Kleene [3]). Accordingly, O is not a set 
(-^ 356 Recursive Functions H). 

D. Relativization and Extension 

Given a (number-theoretic) predicate Q of one 
variable (or a set of natural numbers), we can 
trelativize to Q the notion of constructive 
ordinal numbers. The least ordinal that is not 
constructive relative to Q is denoted by cof. 

The relativization to Q of the fundamental 
notion aeO and relation a <gb of the system 
S 3 of notations are denoted by a eO^ and a < 
g^b, respectively. Then we can relativize the 
results of the preceding paragraphs to Q. For 
example, as the relativization of (3), we have 
the following: There is a predicate R^{a, x, a) 
which is primitive recursive tuniformly in Q 
such that 

aeO^<^ Va3xR ^{a, x, a). 

When Q is hyperarithmetic, we have no gen- 
eralization of the constructive ordinal numbers 
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by relativizing them to Q, that is, co? = co i 
holds (Spector). Now by relativizing to O the 
concept of constructive ordinal numbers, we 
obtain a proper extension of it (aii^ < and 

then, performing such extensions successively, 
we have a (transfinite) sequence O, O^, 

0*^°, ... . On the other hand, we can extend the 
constructive ordinal numbers to those corre- 
sponding to any like higher number class, 
beyond the second, in which partial recursive 
functions are used at limit levels to provide an 
“accessibility” mapping from previously de- 
fined number classes. There are several exten- 
sions done by Church and Kleene, H. C. 

Wang, D. L. Kreider and H. Rogers, Jr., H. 
Putnam, W. Richter, A. Kino and G. Takeuti, 
and others. Richter [9] has shown that these 
two ways of extending the constructive 
ordinals are equivalent, provided the sets of 
notations for the higher number classes satisfy 
certain natural conditions. Specifically, the 
ordinals of the Addison and Kleene [6] con- 
structive third number class are exactly the 
ordinals less than and the set 02 ^ of nota- 
tions for those is recursively isomorphic to 
0° (Richter). 

E. Constructive Ordinals in Higher Number 
Classes 

Putnum [8] has defined a system C of nota- 
tions for constructive ordinals of the Cantor 
higher number classes, improving that of 
Kreider and Rogers [7], as follows: is the 

set of notations for the ordinal number a; if 
for some a, xeN^, let |x| = /X(^[x6 AJ, which 
is called a hyperconstructive ordinal; and 
let C„ = U { 1 < a). Define i to be a Godel 
number of the identity function, and n to be 
an index in if 3" • 5'e C^, for some t. There 
are four cases: 

Case 1. a^O. Then A„ = {1}. 

Case 2. a = ^-|- 1, where Np is already defined. 
Then N^ = {2'‘\xGNp}. 

Case 3. a is a limit ordinal such that A^ is 
already defined for all C < a, and there exists 
an ordinal /?<« such that 3“-5‘eNp for some 
a, and a partial recursive function / giving an 
order-preserving +cofinal mapping (o.p.c.m.) 
from Cp into C,. Then is taken to be the set 
of all numbers 3“ -5" such that 3“-5‘eA^ (where 
(i is any ordinal with the above property) and 
{«} is an o.p.c.m. from Cp into C^. 

Case 4. a is a limit ordinal such that A^ is 
already defined for all ^ < oc, there is no /? as in 
Case 3, but there is a number a e which is 
not an index in C^. Let /)o be the least // for 
which such a's belong to Np. Then A^ = 
{3‘'-5"|aG A^ and {«} is an o.p.c.m. from 
into Q. 



We have that xeN^ only as required by Cases 
1-4. Finally, let be the least ordinal num- 
ber which is not hyperconstructive, and let 

Extending Kleene’s Hy for yeO (^ 356 
Recursive Functions H), Putnum has defined a 
hierarchy for xeC, and shown that hyper- 
constructive ordinal numbers are uniqueness 
ordinals, i.e., for x, yeC, if |x| = |y|, then 
and Hy are of the same '•'degree of (recursive) 
unsolvability. Ej is the type-2 object (^ 356 
Recursive Functions F) introduced by T. 

Tugue such that E,(a) = 0 if VyS3x[a(jS(x)) = 0]; 
otherwise, E,(a)=l. It is known that cuc = 

CO*'! , where cu*"' is the least ordinal which is 
not the order type of any well-ordering recur- 
sive in Ej (Richter [10]). 
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Contact Transformations 

A. General Remarks 

A transformation of 2n + 1 variables z, x-, p,- 
(j=l,2,...,n). 
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Z = Z{z,Xi,...,x„,pi,P2,...,p„), 
Xj^Xj(z,Xi,X2,.--,x„,Pi,P2,...,p„), 

j=\,2,...,n, 

Pj = Pj{z,X-^,X2,...,X„,Pi,P2,...,P„), 

J=l,2,...,n, (1) 

is called a contact transformation in the («+!)- 
dimensional space with the coordinate 
system (z, , . . . , x„) if the ttotal differential 

equation 

dz — pidXi^~P2dx2 — ---—p„dx„ — 0 ( 2 ) 

is invariant under the transformation, i.e., if 
the equality 

dZ-P^dXi-P2dX2-...-P„dX„ 

^p{dz-pidxi-p 2 dx 2 - ...-p„dx„) (3) 

holds identically for a suitable nonzero func- 
tion p of z,Xj,pj. Here we assume that (1) has 
an inverse transformation. Using Lagrange’s 
bracket 



n 



J-1 



dpj \dXjJ 



—\ = — + p — 
dXjJ dXj ^dz’ 



8pj \dxjj ) 



we see that (1) is a contact transformation if 
and only if \_X^, Z J = Z] = [P^, PJ = 0, 

IPj, X^ =p8jk, \_Pj, Z] = pPj, where <5,^ is tKro- 
necker’s delta. 

From this fact it follows that the composite 
of two contact transformations and the inverse 
transformation of a contact transformation are 
also contact transformations. Since the iden- 
tity transformation Z = z,Xj — Xj, Pj = pj is a 
contact transformation, the set of all contact 
transformations forms an infinite-dimensional 
ttopological group. Given a set of scalars 
Pi , . . . , p„, a pair consisting of a point (z, Xj) 
and an n-dimensional hyperplane z* — z = 

Z"=i Pj{xf — Xj) in an (n -f l)-dimensional space 
is called a hypersurface element, and the set 
of hypersurface elements satisfying (2) is called 
a union of hypersurface elements. Using these 
concepts, we can state that a transformation 
(1) of coordinates z,Xj,pj (j= 1,2, ... ,n) is a 
contact transformation if it transforms each 
union of hypersurface elements into another 
one. Consequently, if two n-dimensional 
hypersurfaces are tangent at a point (z, x^) in 
the (n + l)-dimensional space, their images 
under a contact transformation, which are 
again two n-dimensional hypersurfaces, are 
tangent at the image point of (z, Xj). The name 
“contact transformation” is derived from this 
fact. 

For instance, the tcorrelation with respect 
to a hypersurface of the second order gives a 
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contact transformation. In fact, from the re- 
lation between tpoles and tpolar lines with 
respect to the parabola x^ - 1 - 23 ; = 0 in a plane, 
we have Legendre’s transformation X = —p, 
Y^xp-y, P= -X (p= -x). 

In general, an invertible transformation 
defined by the three relations H(x,y, X, y) = 0, 
dQ/dX + PdQ/dY^ 0, dQ/dx + pdCl/dy - 0 
derived from a function Q(x, y, X, T) is a con- 
tact transformation. The function Q is called 
the generating function of this transformation. 
In this transformation, to each point (xo,yo) 
there corresponds a curve Q(xq, yo> = 

0. These results are valid also in the case of 
several variables. For instance, in an (n-t- 1)- 
dimensional space, a transformation Z — z — 
XiPi - ... -x,p,; Zi -Pi, ..., A:, = p„ A:,+i = 
^v+] » • • • > x„, P^ x^ , . . . , Py X,,, 

Py+i =Pv+i , . . . , P„ = p„ represents a contact 
transformation. Here v is an integer between 
1 and n. In the case n = 2,v = 2, this transfor- 
mation reduces to a Legendre transforma- 
tion; and in the case n = 2, v= 1, it is called 
Ampere’s transformation Appendix A, 
Table 15.IV). 



B. Canonical Transformations 

A transformation of 2n variables Xj, pj {j — 

1,2, ...,n) 

Xj = Xjix ^ , X 2 , . . . , x„. Pi , P 2 , . . . , pJ, 
Pj^Pj{Xi,X 2 , . . . , x„, p, , P 2 , . . . , p„), 

j=l,2,...,n, (4) 

is called a canonical transformation if the ^dif- 
ferential form Y^j^^iPjdXj — PjdXj) is texact in 
Xj, Pj, i.e., if there exists a function U of x, p 
such that 

tiPjdXj-pjdXj) = dU. (5) 

j=i 

Let (1) denote a contact transformation, and 
let X denote a new variable which is different 
from zero. Set Xq = z, Pq = 2, Xj — Xj, Pj — —Xpj 
(j=l,2, ...,n) and define a transformation of 
(2 h 4 - 2) variables by 

Xo{x,p)^Z{z,x, p), 

Pq{x, p) = Po/p{z, X, p), Xj{x, p) = Xj{x, p), 

Pj(x, p) = - Po Pj(z, X, p)/p{z, X, p), 

)=l,2,...,n, (6) 

where x, p, x, and p denote points (xj, X 2 , 

(pi,p 2 , ...,p„), (xo,Xi,...,x„), and (po, 

Pi , . . . , p„), respectively. Then the identity (3) 
when multiplied by 2 on both sides becomes 

PfydXfj -\- Pi dX^ -j- . . . -L P„ dX„ 

-P(jdxQ + Pi dXj + ...+p„dx„. 



( 7 ) 
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Thus the transformation (6) represents a 
canonical transformation ([/ = constant). 
Moreover, from the definition the relation 
(6) is homogeneous, i.e., for all ^¥^0, 

Xj{x, fip) = Xj{x, p), Pjix, pp) = pPjix, p), 

; = 0 , 1 , 

Conversely, let a transformation of {2n + 2) 
variables 

Xj = Xj{x,p), Pj = Pj(x,pl 

y-0, (8) 

be a homogeneous canonical transformation. 

Define a transformation of {2n+ 1) variables 
z,Xj,pj{j=^\,2,...,n) by 

Z(z,x,p) = Xq(z,Xi^, ... ,x„,2, —Xpi,..., —Xp„), 
l/p{z,x,p) 

=\po(z,Xi, ...,x„,A, ~Xpi,..., -Xp„), 

A 

Xj(z,x,p)-^Xj(z,Xi,...,x„,X, -Xpi,...,-Xp„), 
Pj{z, x,p) 

= —^Pjiz,Xi, ...,x„,X, —Xpi, ..., —Xp„), 

A 

j^l,2,...,n. (9) 

Since the necessary and sufficient condition for 
(8) to be a homogeneous canonical transfor- 
mation is that (7) hold, it follows that 

~{dZ-PidXi-...-P„dX„) 

P 

= A(dz~Pi dXi — ...—p„dx„). 

Thus the transformation (9) represents a con- 
tact transformation. Therefore the most general 
contact transformation of (2n-l- 1) variables 
and the most general homogeneous canonical 
transformation are identical concepts, differing 
only in the choice of notation. 

The necessary and sufficient condition for 
(4) to be a canonical transformation is ex- 
pressed by the relations 

(Xj,X,)^0, (Xj,P,) = Sj„ (Pj,P,) = 0, 

J,k = l,2,...,n, (10) 



From this fact it follows that the set of all 
canonical transformations forms an infinite- 
dimensional topological group. 

Suppose that we can take Xj, Xj, ,x„, Xi, 
X 2 , ...,X„ as 2n independent variables. Denote 
by Oi(x, X) the function U of (5) represented 
by X, X. By means of this Qj, 2n functions 
which give a canonical transformation (4) are 
derived from the relations 



Pj=^~ 



dX; 






5Qi 



j=l,2, ...,n. 



Conversely, for Q(x,X) such that detl - — ;; — 

\dxjdX^ 

/ 0, a transformation determined by the 
relations 



Pj^~ 



dn 

dxt ' 



P = 



dQ 

dx/ 



j — 1,2, ... ,n, 



represents a canonical transformation. For 
a general canonical transformation, x^, X 2 , 

. . . , x„, Xi , X 2 , . . . , X„ are not necessarily 2n 
independent variables. But for a canonical 
transformation there exists n new variables 
X,-^ ,Xi^,...,Xi^,Pj^,..., Pj^_^ such that x j , 

X2,...,x„, X,-j, X(^, ...,Xi^, Pj^, Pj^, ...,Pj^_^ 

are 2n independent variables, where (ij, 1 * 2 , 

■ • ■ , ik\ iJi Jz’ ■ • • Jn-k) is a partition of the set 
(1, 2, . . . , n) to two disjoint subsets. Denote 
by Q 2 the function C — 2?= iXj. /*,■.. Then the 
transformation (4) is derived from relations 



^^2 . I 



X = 1=1,2,. ..,n-k, 

j, ’ 

P.^^, 1=1,2,... ,k. 

" ax,; 



These functions Qj, Q2 are called the generat- 
ing functions of a canonical transformation. 

Consider a transformation of 2n variables 
depending smoothly on a parameter t: 



Xj = Xj(t,Xi,X2,...,X„,Pi,P2,...,P„), 
Pj^Pj{t,Xi,X2,...,X„,Pi,P2,.--,P„), 

j=l,2,...,n. (11) 



where (•, ) is Poisson’s bracket, i.e., for a pair 
of functions /(x, p), g{x, p). 



n 



(/,0)=I 

j=i 



dxjdpj dxjdpjj 



Denote Poisson’s bracket in the coordinate 
system X,,X 2 , ...,X„, P^,P 2 , by (•, •)', 
and for f(x,p) and g{x,p) denote their trans- 
formations by /'(X, P) and g'{X, P). Then the 
relation (10) is equivalent to 



if,9) = if\g')' for all / and gf. 



If (11) represents a canonical transformation 
of 2n variables for each t, then Xj and Pj, 
j = l,2, ...,n, satisfy. tHamilton’s canonical 
equations 



dXj 

dt 



dH 



(t,X,P), 




dH 

'dX, 



it,X,P), 



j=l,2,...,n, (12) 



for some ^Hamiltonian function H{t,x,p). 
Conversely, for any solution X,, Pj of Hamil- 
ton’s canonical equations (12) depending on 
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parameters x^,X 2 , Pi,P 2 > ••• if 
transformation (1 1) is a canonical transforma- 
tion for t = 0, (11) represents a canonical trans- 
formation for each t. By a canonical trans- 
formation (4), the solutions of Hamilton’s 
canonical equations 



dXj dH 

dt dpj 



(t, x,p). 




8H 

dXj 



it,x,p), 



1,2, (13) 



are transformed to the solutions of 



dXj_dK 
dt dPj 



it,X,P), 





it,x,P), 



7=1,2,. (14) 



where the new Hamiltonian function is given 
by K{t,X,P)^H{t, x,p). 



C. Applications to the Integration of 
Differential Equations 

Contact transformations have applications to 
the integration of differential equations since 
they transform each union of surface elements 
into another one. 

As an example, we shall describe an outline 
of their application to a partial differential 
equation of the first order 

F{x, y, z,p,q) = 0; 

p = dz/dx, q = dz/dy. (15) 

For this purpose, we regard (15) as an equation 
defining unions of surface elements and trans- 
form it into a simpler equation by means of a 
contact transformation. If the transformed 
equation can be solved, then the solution of 
the original equation can be obtained by 
means of the inverse transformation. Now, let 
z — co(x,y,a,b) be a tcomplete solution of (15). 
Then ( 1 5) is reduced to Z — c = 0 by the trans- 
formation generated by the function 



q(x, y, z; a) from the solution G(x, y, z, p,q) = a 
of (16) and f = 0, then the total differential 
equation dz = pdx -I- qdy is tcompletely inte- 
grable, and the ^general solution of this equa- 
tion is a complete solution of (15). Also, if we 
know two independent solutions G(x, y, z, p, q) 
= a, H{x, y, z,p,q) = b of {1 6) such that [G, H] 
= 0, we can obtain a complete integral of (15) 
by eliminating p, q among the three equa- 
tions F — 0, G = a, H = b. This method is called 
the tLagrangc-Charpit method, which is ap- 
plicable also to the equation F(z, x j , . . . , x„; 

Pi > ■ • • j Pn) = 0 (^ 322 Partial Differential Equa- 
tions (Methods of Integration)). 



D. Applications to Analytical Dynamics 



Consider Hamilton’s canonical equations (13). 
An example of system (13) is the equation of 
motion for a dynamical system which is de- 
rived from i5jL(xj,Xj)dt — 0. Here the Xj denote 
generalized coordinates, pj — dL/dxj and FI = 

T]=tPjXj-L. 

Let S(xi,X 2 , ... ,x„, Xi,X 2 , ■■■,X„) be a 
solution depending on parameters AjjXj, 
...,2f„ of the tHamilton-Jacobi equation 



Hi t,xi,x 2 , ...,x„. 



ds 

dxi 



dS 

dx. 



dX„ 



= K{Xi,X2,. 
such that det 



d^S 

dXjdX^ 



(17) 

7^0. Then by a canon- 



ical transformation of a generating function S, 
i.e., by a transformation determined by the 
relations 

SS ^ dS . ^ ^ 

the system (13) is transformed to (14). Here 
K{t,X,P)^HUx,-^^ = K{X). 



Q = Z — z + (o{x, y,X,Y) — c = 0, 



Then (14) becomes 



where c is a constant. In this equation the 
solution = a, Y—b,Z — c,aP + liQ = 0{a,b, 
c, a, are constants) plays an important role, 
and this line element will be called a charac- 
teristic line element. The characteristic line 
element satisfies equations that can be trans- 
formed by means of the inverse transformation 
into Charpit’s subsidiary equations for (15): 

dx dy dz 

dF/dp dF/dq pdF/dp + qdF/dq 

= . (16) 

dFIdx + pdFtdz dFjdy + qdFjdz 

Consequently, if we have p = p{x, y, z; a), q — 



dX: ^ dP; ^ ^ 

— ^ = 0, ~^= (A), 7=l,2,...,n, 

dt dt dXj 

and the solutions are 



Xj(t) = X/0), P/t) = P/0)-t 



dK 

Ix, 



X=X(0) 



In particular, a transformation that makes 
K = 0 is called a transformation to an equilib- 
rium system. Thus solving Hamilton’s canon- 
ical equations reduces to finding a tgeneral 
solution of the Hamilton- Jacobi equation (17). 

When the variables of the Hamilton-Jacobi 
equation (17) are totally separable we can 
obtain a general solution of (17) by tquadra- 
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tures. Then one can integrate by quadratures 
the system of equations (13). For many im- 
portant problems exact solutions are obtained 
by this method 271 Mechanics). 

Furthermore, every mapping in an optical 
system is a canonical transformation; in optics, 
the quantity corresponding to S is called an 
eikonal(^ 180 Geometric Optics). 



If {h„} (n = 0, 1, ... ) and {c„} (n= 1,2, ... ) are 
infinite sequences, the expression 




^2 



...+ 
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Continued Fractions 

A. The Notion of Continued Fractions 

Let {b„} {n — 0,...,m) and {c„} (n= 1, ...,m) be 
finite sequences of elements in a tfield F. A 
fraction of the form 



^0 + 



bi + 



b2 + ... 



Cl 

Cl 



...+- 



^m-l + 



is called an infinite continued fraction. By 

analogy with the finite case, it is expressed by 



, , ''2 '-n 

On 3 

hi + &2 T • • • + 3" . . . 

orby(^o+ r 
b„ 



For an infinite continued fraction, the quantity 



— ^0 + , ,7 

hi + •••+&„ 



is called its nth convergent, ho is called the 
initial term, and c„ and b„ (n^ 1) are called the 
partial numerator and partial denominator, 
respectively. If F is a ttopological field (e.g., 
the real or complex number field) and the se- 
quence {/c„} of its elements converges, then the 
infinite continued fraction is said to converge, 
and the limit is called its value. 

A finite or infinite continued fraction in 
which c„ (n ^ 1) are all equal to 1, h^, is a ra- 
tional integer, and h„ (u ^ 1) are all positive 
rational integers is called a simple continued 
fraction. It is expressed by [ho,hi, ...]. h„ is 
often called the nth partial quotient. In the 
following paragraphs we shall mostly discuss 
simple continued fractions. 

For a real number x we mean by [x] the 
greatest integer not exeeeding x. [ ] is called 
the Gauss symbol. Let cu be any given real 
number, and put 



is called a finite continued fraction. It expresses 
an element in the field F unless devision by 0 
occurs in the process of reduction. Symboli- 
cally, it is also written in the forms 



ho3- 



G G c^ 
hi 3-h2-l l-h„,_i +b„ 



ho 3- 77 — f 77 — f ’ 
ihi |h2 



^m — \ \ \ 



, ^1 : ^2 ■ -Cm-l 

ho + — + — +---+T 

hi bj h„,_i 




Cj C2 

bn, , 

hj h2 h„_i h„J 
etc., or more briefly, 



1 

fu-ho3 , ho = [(u]; 

CO, 



1 

co„ = h„H , h„ = [co„], n=l,2, .... 

Then an expansion of co into a simple con- 
tinued fraction 



CO — hQ -|- 



1 1 

hj -|- ••• -|- h„ -!-••• 



is obtained. If co is irrational, this expansion is 
determined uniquely. Conversely, any infinite 
simple continued fraction converges to an 
irrational number. If co is rational, the process 
is interrupted at a finite step (co„, = h„), result- 
ing in 
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An alternative representation of a rational 
number by a simple continued fraction is given 
by replacing above hy 
Examples of infinite simple continued frac- 
tions are 

e^>P + l_ 1 1 

g2/p_j ’ 

where p is a natural number (J. H. Lambert), 
and 



^111 111 

c — 2 H — — — — — — 

l-l- 2 -|-l-|-----|- 1 -f- 2n 1 + • • • 

(L. Euler). 



B. Convergents 



Let the nth convergent of a simple continued 
fraction be expressed in the form of an irreduc- 
ible fraction 



Qn 





n^O, 



and for convenience put f _2 = 0 , P_i = 1 , 

0-2 = 1, 6-1 = 0. Then we ha ve the recurrence 
relations 



P„ = b„P„_^+P„ 



Qn — ^nQn-l +0n-2> 



whence follows 



n^O, 



n>-\. 

Any simple continued fraction represents a 
real number oj which satisfies 



quence {P 2 n/G 2 n} (resp. {E 2 „+i/ 02 „+i}) is 
monotonically increasing (decreasing). Ap- 
proximation by convergents is shown in the 
relations 



CO — 



Qn 






1 



0n + l Qn 

Qn ^ n%Qn.,Qn 



s 

1 


< 


Pnr 

CO 


1 0«l 




6n-l 



P ® (-1)" 

"=lim-^ = ho+X ' 



In particular, Ico — P„/0„| < 1/0^. On the other 
hand, if P, 0 are relatively prime positive 
integers with jco — P/0| < 1/20^, then P/0 is a 
convergent to co (A. M. Legendre). There are 
several results concerning the measure of 
approximation by convergents. Any co satisfies 
|co-P„/0„l < l/(y50^) for infinitely many n, 
while there exists an co which satisfies |cu — 
E„/0„| < iHXQl) for only a finite number of n 
provided that 2>y5(e.g., 00 = ^5, A. Hur- 
witz); at least one of two adjacent convergents 
satisfies the inequality |oj — P„/0„| < 1/(20^) (K. 
T. Vahlen); at least one of three consecutive 
convergents satisfies the inequality |cu — P„/0„| 
< 1 /(a/ 50^) (E. Borel); |w-P/0| < 1/^802 
has infinitely many rational solutions P/0 
whenever co is not ^equivalent to l)/2 

(Hurwitz). One calls an irrational number co 
badly approximable if there is a constant c = 
c(co)>0 such that |m — P/0| >c/0^ for every 
rational P/0, co is badly approximable pre- 
cisely if the partial quotients of the continued 
fraction expansion of co are bounded. In partic- 
ular, real quadratic irrationals have bounded 
partial quotients (-^ Section C; 182 Geometry 
of Numbers F). 



® -f P ,-i)/(co„+i Q„ + 0„-i) 

in terms of the notation defined in this and the 
previous section. 

In particular, let ao be an irrational number. 
Then each of the fractions 



Pj,^> P„-2 + feP„-i 

0 ^ 0„-2 + ^ 0 «- l ’ 






which are inserted between two conver- 
gents P„- 2 /Q „-2 and PJQ„ = {P„-2 + b„P„-r)/ 
iQn-2 + b„Q„-i), is called an intermediate con- 
vergent, while the original convergent P„/0„ is 
called a principal convergent. 

If a fraction P/0 approximating an irra- 
tional number co satisfies | co — P/Q \<\oo — p/q\ 
for any other fraction p/q with q^Q, then it is 
said to give the best approximation. The frac- 
tion giving the best approximation of co is 
always a principal or intermediate convergent 
P[f V0n** of CO with k > bJ2 or k = bJ2, Q„ > 

0n-if^«- 

The convergents satisfy the relation P„/0„ — 
Pn-JQn-^ =(-l)"“^ 0 « 0 « i; hence the se- 



C. Quadratic Irrationals 

If an infinite simple continued fraction 
[bo, hi , • . . ] satisfies = b^+, (v = 

0 , 1, 2, . . . ), it is called a periodic continued 
fraction and is denoted by the symbol 
[bo,bi, ...,b„,, ...,b„+fc_i]. According as m = 0 
or m > 1 , the periodic continued fraction is said 
to be pure or mixed, and the sequence 
b^+i , . . . , is called a period. In order for 

the continued fraction of co to be periodic, it is 
necessary and sufficient that co be a quadratic 
irrational number, i.e., a root of ax^ + bx + c 
= 0 with rational integral coefficients a, b, c 
and nonsquare discriminant b^ — 4ac (J. L. 
Lagrange). In order for co to be represented by 
a purely periodic continued fraction, it is neces- 
sary and sufficient that co be an irreducible 
quadratic irrational number, i.e., that co satisfy 
co> 1 and 0>co'> — 1 , where co' is the conju- 
gate root of CO (E. Galois). In order for co to be 
equal to the square root of a nonsquare ra- 
tional number, it is necessary and sufficient 
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that its continued fraction be of the form 
lho,bi, ... 2b(j] (Legendre). 



D. Application to Pell’s Equation 

Let ax~hy=\ ((a, 6) = 1 ) be a t Diophan- 
tine equation of the first degree, and a/b — 
lbo,bi,...,bJ = PJQ„. Since 
Pm-iQm = i — ^)"'~^’ a solution of the equation 
is given by Xo = (-l)'"“^C!„, yo = (-l)'"“‘P„. 
The general solution is then represented in the 
form XQ + bt, yo + (it (reZ). This method of 
obtaining a solution is essentially the same 
as the method which uses the tEuclidean 
algorithm. 

Pell’s equation — Dy^ = 1 (D is a non- 
square integer > 1) was solved by Lagrange in 
terms of continued fractions. If the length of 
the period of D is k, all positive solutions of 
Pell’s equation are given by x = P 2 v)i-i. y = 

Qzvk 1 if ^ is odd, and by x = Pv/t-i. y = 
if k is even (v = 1, 2, . . . ), where P„/Q„ denotes 
the nth convergent of the continued fraction 
expansion of Incidentally, x — P^ 2 v i)k-i, 

I' = G( 2 v-i)ic-i (v= 1, 2, . . . ) are the positive 
solutions of x^ — Dy^ — —\ provided that k 
is odd. There are no solutions of x^ — Dy^ — 

± 1 other than x„,y^,(v -1,2,...) given by 
(xj -i-^/Dyi)'' = x„ + ,yD y^, where x,, is 
the least positive solution. For instance, the 
least positive solution of x^ — 21 ly^ = 1 is x = 
278,354,373,650, y= 19,162,705,353. 

Lagrange made further use of continued 
fractions in order to obtain approximate 
values of roots of algebraic equations. The 
method is especially useful for precise compu- 
tation of neighboring roots. 

The theory of continued fractions may be 
investigated geometrically making use of lat- 
tices ([2, 3]; 182 Geometry of Numbers) (F. 

Klein, G. Humbert). For instance, a measure 
of approximation of PJQ„ to co in Diophantine 
approximation is represented by the closeness 
of a lattice point (P„, Q„) to the straight line y 
= cox on the plane. 



E. Continued Fractions with Variable Terms 



There are few results on continued fractions 
with variable terms. It is noteworthy that 
from the expansion of tan z into a continued 
fraction 



z — z 

tanz=- - 

1+ 3 + 



-z — z 

“5“ + ^ + 



(Lambert), the irrationality of n and of tanz 
for rational z{^Q) can be deduced (A. 
Pringsheim). 

Among continued fractions with variable 



terms, those of the form 



[ao,a„z]f 



Uq u,z a„z 

1 + 1 +•••+ 1 -h--- 



are called normal continued fractions. Let the 

convergent of such a continued fraction be 
P„(z)/Q„(z), and for convenience put P_i(z) — 
0, Q j(z)== 1. Then we have the recurrence 
formulas 



^«(Z) = 'Pn-l (^) + 

There are the further relations 



P„(z)(2„-i (z) - P„~i(z)QJz) - ( - 1 )”z" n flv, 



00 ( l)”z" ” 

n = oQn~l{^}Qn\^) v=0 

where the latter is formal. Let the tpower series 
expansion of the nth convergent of [aQ,a„z^f 
be 



PM 

Qni^) 



CO 



= E 

v = n 



n ^ 1. 



Then b^^ = b„^ (0^ v^m^n). If has a 

power series expansion about the origin, then 



[ao,a„z]f = E 
«=o 

If the supremum ^ of is finite, then 

converges uniformly for |z|<(l/4) g, 
and hence it represents an ^analytic function 
which is holomorphic in |z| < (1/4)0. 
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Continuous Functions 

A. General Remarks 

The notion of continuity is defined for a map- 
ping or a function f:X^Y from a topolog- 
ical space A to a topological space Y 425 
Topological Spaces G). In the present article, 
however, we are concerned mainly with the 
case where both X and Y are tmetric spaces 
with the distances px and py, respectively. The 
most usual case is A = R" (Euclidean space), 
T=R (real numbers). 

A function /: A -» T is said to be continuous 
at a point XoS A if for every positive number 
we can select a suitable positive number d 
(depending on £ and also on Xo) sueh that 
P;f(x,Xo)<<5 implies py(/(x),/(xo))<£. This is 
equivalent to the condition that x->Xq implies 
/(x)^/(xo) (^ 87 Convergence). We call / 
continuous (on A) if it is continuous at every 
point Xo of A. If for every positive number e, 
we can select a suitable positive number S 
independent of x and y such that p;^^(x, y) < 5 
implies py(f(x),/(y))<£ for all x, ye A, we call 
/ uniformly continuous on A. 

The tsupremum <x)((5) of py(f(x),f(y)) for 
X, ye A satisfying px(x,y)<S is called the 
modulus of continuity of the function / in A. 
Uniform continuity means that a>(6)->0 for 
<5-^0. 

If co(5X for suitable constants M, a> 

0, that is, if the inequality py(/(x),/(y))^ 
M(px(x,y)y holds for x, ye A, then / is said 
to satisfy the Holder condition of order a, also 
known as the Lipschitz condition of order a. 

If 1, this condition is called simply the 
Lipschitz condition. A function satisfying one 
of these conditions is uniformly continuous. 
The family of functions satisfying the Lip- 
schitz condition of order a is sometimes de- 
noted by Lip a. 

In general, the composite function go/: 

X -^Z is continuous if both functions f:X^Y 
and g .Y^Z are continuous. If the ranges of f, 
g are both the real field R (or the complex field 
C, or more generally a ttopological field), then 
/±p, /p are continuous if / and g are contin- 
uous; and fig is continuous provided that 
g{x)^0. If R is the range of both / and g, then 
min(/,p) and max(/,p) are continuous when / 
and g are continuous. If A is tconnected (for 
example, an interval / in R) and if / is contin- 
uous, the image /(A) is also connected. 

B. Continuity from One Side 

In this section, we always assume that the 
domain A is an interval / in R and / is a func- 
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tion from / to a metric space Y. A point Xq 
of A is called a discontinuity (point) of the 
first kind of / if both limits lim^|j,^/(x) and 
Iim;<,|j,p/(x) exist in Y and are different. Then 
we say also that / has a jump (or gap) at Xq. If 
these two limits exist and have the same value, 
then / is continuous at Xq. 

We say that / has a discontinuity of at most 
the first kind at x^ if / is continuous at Xo or if 
Xq is a discontinuity of the first kind for f 
(Sometimes the phrase “discontinuity of the 
first kind” is used to mean “discontinuity of at 
most the first kind.”) A discontinuity point of / 
(i.e., a point at which / is not continuous) that 
is not of the first kind is called a discontinu- 
ity point of the second kind. When lim^|j^^/(x) 

= /(Xq), we call / right continuous (or con- 
tinuous from the right) at Xq. In this case, 
lim^l^^/(x) need not exist. Replacing x|Xo by 
x^Xq, we can similarly define the concept of 
being left continuous (or continuous from the 
left). If a function / has a finite number of 
discontinuity points of the first kind in the 
interval [a,/i] and is continuous at all other 
points, we call / a piecewise continuous func- 
tion in [a, 6]. 

C. Semicontinuous Functions 

In this section, we assume that the domain of 
the functions is a subset £ of a metric space A, 
and that the range is the set of real numbers 
extended to include ±oo. Let x be a point in 
the closure of E. We denote by M(x, ^) and 
m(x, 3), respectively, the supremum and the 
infimum of the values of a given function / in 
the (5-neighborhood of x. We put 

M(x) = limM(x, (5), m(x) = limm(x, 5), 

a-»o 5-0 

and call them the upper limit function and 
lower limit function, respectively. We have 
— 00 <m(.x)<M(x)^ +00. If M(xq)= /(Xq) at 
XqgE, then / is called upper semicontinuous at 
Xq. If m(xo)= /(Xo) at XqeE (i.e., if — / is upper 
semicontinuous at Xq), then / is called lower 
semicontinuous at Xq. The function with one 
of these two properties is said to be semicon- 
tinuous at Xq. 

Either of the following two conditions is 
necessary and sufficient for the function / to 
be upper semicontinuous at XqGE: (1) /(Xq) = 
+00, or for every constant 2 such that /(xq)< 
X, there exists a ^-neighborhood such that 
Mixo,3)<A. (2) For every sequence x„ of E 
converging to Xq we have limsup„^„/(x„) = 
/(Xo)- 

A function / is called upper (lower) semi- 
continuous in E if it is upper (lower) semicon- 
tinuous at every point xeE. A necessary and 
sufficient condition for the upper semicontinu- 
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ity of the function /(x) in E is that {x|/(x)<cc} 
be a trelatively open set in E for every real 
number a. We can define semicontinuity for 
functions on a topological space by using this 
latter property. 

A real-valued function /(x) is continuous at 
X(,6£ if and only if it is upper and lower semi- 
continuous at Xq and /(Xq) is finite. A function 
/(x) is continuous on E if and only if it takes 
finite real values on E and both {x|/(x)<a} 
and {x!/(x)>a} are relatively open in E for 
any real number a. When E is ^compact, an 
upper (lower) semicontinuous function on E 
attains its (supremum) infimum at a point in 
E. In particular, a continuous function on a 
compact set E is bounded and assumes its 
maximum and minimum on E (Weierstrass’s 
theorem). Furthermore, if E is connected (e.g., 
the interval / in R), it follows from the con- 
nectedness of the image f{E) that if ot, j5e 
f{E) and y lies between a and then yef{E) 
(intermediate-value theorem). 

A real- valued function /(x) on a set £ of R 
satisfies the Lipschitz condition if it is tdiffer- 
entiable and the derivative is bounded. Such a 
function is also ^absolutely continuous, con- 
tinuous, and of tbounded variation. (For the 
polynomial approximation of real continuous 
functions ^ 336 Polynomial Approximation.) 

The limit function /(x) of a monotone de- 
creasing sequence of upper semicontinuous 
functions /„(x) is also upper semicontinuous. 
The limit function /(x) of a uniformly converg- 
ing sequence of continuous functions is con- 
tinuous. (Regarding the tequicontinuous family 
of functions ^ 435 Uniform Convergence.) 



D. Baire Functions 

The limit function of a pointwise converging 
sequence of continuous functions defined on a 
metric space X is not necessarily continuous. 
R. Baire {Ann. Mat. Pura AppL, (1899)) intro- 
duced the notion of Baire functions as follows: 
He named continuous functions the functions 
of class 0. Then he called a function that is a 
pointwise limit of a sequence of continuous 
functions a function of at most class 1. A func- 
tion is said to be of class 1 if it is of at most 
class 1 and is not of class 0. He similarly de- 
fined the notion of class n for arbitrary natural 
number n. 

Further, a function is said to be of at most 
class CO if it is a pointwise limit of a sequence of 
functions of class n^ for a sequence of natural 
numbers n^. A function is said to be of class 
CO if it is of at most class co and is not of class 
n for any finite number n. In general, using 
ttransfinite induction, we can define the notion 



of functions of class for an arbitrary tordinal 
number ^ [4]. 

All these functions are called Baire functions. 
Actually, there is no function of class for an 
uncountable ordinal number If X is a tper- 
fect set in Euclidean space, then there is actu- 
ally a function defined on X of class ^ for an 
arbitrary countable ordinal number Here- 
after, we shall be concerned with this case only. 

If X has the cardinality of the tcontinuum, 
then the set of all Baire functions on X has the 
cardinality of the continuum. On the other 
hand, the cardinality of all functions on X is 
actually greater than that of the continuum. 
Hence there exist functions that are not Baire 
functions on X. A function is a Baire function 
if and only if it is tBorel measurable (H. Le- 
besgue). Therefore a necessary and sufficient 
condition for a function / to be a Baire func- 
tion is that the set (x | /(x) > a, x e X} be a Borel 
set for any real number a (^ 270 Measure 
Theory J). The limit of a countable sequence of 
Baire functions is also a Baire function. If/(x) 
and ^(x) are of at most class a on X, then the 
following functions are also of at most class 
a: l/(-^)l. /(^) ± S(^), /(^) • 9{x), and f{x)!g{x) 
(provided that g{x)i^0 on X). 

The condition that a function / is of at most 
class 1 on X is equivalent to either of the 
following two conditions: (1) For any closed 
subset F of X, the restriction /* of / to £ has 
a continuity point in F. (2) For every real 
number a, the set {x|/(x)<a,xeX} is an 
set (Baire). In a tcomplete metric space, a 
necessary and sufficient condition for a func- 
tion / to be of at most class 1 is that the set of 
continuity points be dense in X. 

For example, the Dirichlet function, which 
takes the value 1 at rational points and 0 at 
irrational points, is expressed as 

lim I lim (cos v! 7rx)^'‘ ), 

v-*oc \ Ic-^oo / 

which is of class 2. A function f{x,y) of two 
real variables that is continuous in each vari- 
able X and y separately is a function of at most 
class 1. 
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Continuous Geometry 

A. General Remarks 

The structure of a ^projective geometry is 
determined by the tlattice (lattice-ordered set) 
of subspaces of the projective space. For this 
reason, this lattice itself is sometimes called 
a projective geometry. The concept of con- 
tinuous geometry was introduced by J. von 
Neumann as an abstraction of lattice-theoretic 
properties from a special class of tlattices 
(lattice-ordered sets) which he encountered in 
his research on toperator rings in Hilbert 
spaces [1]. Continuous geometry contains 
projective geometry as a special case when the 
dimension is discrete; but more usually the 
lattices are of continuous dimension. 

A continuous geometry is a tcomplete and 
tcomplemented tmodular lattice L (^ 243 
Lattices F) that satisfies the following prop- 
erty and its dual (both called properties of 
continuity): For any element a of L and any 
subset W oi L which is twell ordered with 
respect to the ordering in L, we have a fl sup w 
= sup(an w) (wG W). The ^center Z of the 
lattice L is called the center of the continuous 
geometry L, which is said to be irreducible 
when Z has no elements other than the tleast 
element 0 and the tgreatest element /; other- 
wise L is said to be reducible. A reducible 
continuous geometry is isomorphic to a sub- 
lattice of a tdirect product of irreducible con- 
tinuous geometries. 

On any continuous geometry L, there can 
be defined a function d{x) whose values belong 
to a complete lattice-ordered linear space M 
and which satisfies the following four con- 
ditions: (1) d(x)^Q; (2) d{x)~d{y) implies the 
existence of a common complement of x and y; 
(3) d(xUy)-|-d(xny) = d(x)-|-d(y); (4) supd(w) = 
(f(supw) (w6 W) for any subset IF of L which 
is well ordered with respect to the ordering in 
L. Such a function d{x) is called a dimension 
function on L. Irreducibility of L is equivalent 
to the property that a real- valued dimension 
function can be introduced; in this case, if d{w) 
takes only a finite number of values, then L is 
a finite-dimensional projective geometry; on 
the other hand, if d{w) takes every number in 
the closed interval [0, 1], then L is called a 
continuous geometry in the strict sense. An 
example of the latter can be constructed as a 
limit of a sequence of projective geometries of 
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increasing dimensions; another example is the 
lattice of projection operators of the tvon 
Neumann algebra of type Up If a group G of 
^automorphisms of L is given, there can be 
introduced a generalized dimension function 
which is invariant under G satisfying slightly 
weaker conditions (T. Iwamura [3]). 

B. Representation of Continuous Geometry 

A ring R is called a regular ring if it has a unity 
element and if, for any element a of R, there 
exists an element x in R such that axa — a. A 
continuous geometry L is isomorphic to the 
lattice (with as its ordering) of tprincipal left 
ideals of a regular ring R provided that d{I) 

= n ■ d{x) {n ^ 4) for some natural number n 
and some element x of L. The decomposition 
of R into a tdirect sum of ideals corresponds 
to the decomposition of L into a direct prod- 
uct of lattices. The condition that L is irreduc- 
ible and finite-dimensional is equivalent to 
the condition that R is a matrix ring over a 
tskew field; when these conditions hold and L 
is considered as a projective geometry, then 
the coordinates are given by this skew field. In 
continuous geometries, join and meet are often 
denoted by the symbols for sum and product, 
respectively, and sometimes a direct product of 
continuous geometries is called their direct 
sum. Sometimes, the requirement that a con- 
tinuous geometry be complete is weakened to 
the requirement that it be tconditionally o- 
complete. 
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A. General Remarks 

The classical theory of automatic control 
mostly deals with linear feedback control 
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systems with single input and single output. 
Mathematical structures of such systems must 
be, in principle, described in terms of ordinary 
linear differential equations with constant 
coefficients. Hence control engineers use block 
diagrams to describe systems, and operational 
calculus based on tLaplace transforms to ob- 
tain response characteristics 240 Laplace 
Transform). Thus the input/output relation of 
a system is described in terms of transfer func- 
tions. The main objectives of control theory 
are to ensure system stability, to maximize 
closed-loop response characteristics by choos- 
ing the best feedback, and to obtain a desir- 
able transient response to an impulse or step 
input. One of the remarkable contributions to 
classical control theory is Nyquist’s criterion 
for stability testing of linear feedback systems 
[1]. The test consists of plotting the Nyquist 
diagram of a transfer function in the frequency 
domain (complex plane), and differs essentially 
from the tRouth-Hurwitz stability test for 
linear differential equations with constant 
coefficient. Classical control theory was almost 
complete by the end of World War II. 

Revolutionary technological innovations in 
electronics and computers and the invention of 
new control instruments and systems devel- 
oped after World War II have opened the way 
to modern control theory. Around 1960, three 
remarkable contributions were made con- 
currently; they are dynamic programming 
(proposed by R. E. Bellman [2]), Pontryagin’s 
maximum principle (L. S. Pontryagin et al. 

[3] ), and linear system theory (R. E. Kalman 

[4] ). The first two give rise to mathematical 
tools to solve optimal control problems and to 
design optimal controllers or regulators. In 
eontrast to the classical theory of control, 
optimal control problems are formulated in 
terms of the system of linear or nonlinear 
multivariable differential equations with multi- 
input forcing terms called control variables. 
This leads to the state-space approach, which 
has become ubiquitous in modern control 
theory. Linear system theory derives from the 
concepts of controllability and observability. 
The former was introduced by Pontryagin et 
al. [3] and later modified by Kalman [5], and 
the latter was introduced by Kalman [5]. 
Controllability and observability are con- 
cerned with the interrelation between internal 
states of a system and its inputs and outputs. 
Hence linear system theory, profoundly related 
to the state-space approach, remains a prin- 
cipal theme of modern control theory. More 
advanced control theories, such as adaptive 
control or learning control, are mostly de- 
scribed and formulated in the framework of 
linear system theory. 

Modern control theory has stimulated 



the development of tcybernetics, the science 
of control and communication, and con- 
trol theory is important in the information 
sciences. In fact, current control theory has 
many features that might be considered to 
belong as well to other areas, such as mathe- 
matical programming (^ 264 Mathematical 
Programming), operations research (^ 307 
Operations Research), game theory (^173 
Game Theory), prediction and filtering theory, 
digital signal processing, circuit theory, and 
computer or microprocessor technology. 

Current control theory embodied in modern 
instrumentation and eomputer technology has 
a vast range of applications. We do not con- 
sider these applications here; we confine our- 
selves to some important theories that con- 
tinue to be major topics in control theory. 



B. Linear Dynamical System 



Let the state of a physical process to be con- 
trolled be represented by a real n-dimensional 
column vector x(t)==(xi(f), ... ,x„(f))^eR", 
where ^ denotes the transpose of a vector or 
matrix. If the state is determined by a differen- 
tial system with input u(t) = (uj(t), ...,u,(t))^e 
R'' and output y(0 = (yi(0, 

d 

—x(t) = A(t)x(t) + B(t)u(t), 
dt 

y(t) = C(t)x(f), (1) 

then the process is called a linear dynamical 
system or simply a linear system. The dimen- 
sions of the coefficient matrices are given by 

/4(t)eR"^", B(^)eR"'^^ 

If A{t), B{t), and C{t) are constant matrices, the 
system described by 

d 

x =—x = Ax + Bu, 
dt 



y — Cx 



( 2 ) 



is called a linear time-invariant system. Corre- 
spondingly, the system described by equation 
(1) is called a linear time-varying system. 

Given an initial state x(to) and input func- 
tion u(t) for the state of system (1) is 
represented by 



x(t) = d)(t, to)x(to) + 



0(t, r)B(T)u(z)dz, 



where d)(t, r) is a tfundamental solution to the 
matrix differential equation 



^d>(t,z)=A(tmt,z), 



<h(T, •::) = / (nxn identity matrix). 
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Hence the output of the system is described by 
y(r) = C(t)0(t, fo)^(^o) 

't 

Jto 

It is seen from this equation that the input- 
output relation of the system is governed by 

W{t, t) = t)B(t)6 R“ ^ ^ 

which is called a weighting matrix. As a spe- 
cial case, the weighting matrix for the time- 
invariant system (2) becomes 

W{t, t) = W{t - t) = 

where is a matrix exponential function 
defined by 

e'*‘-I + tA+^A^ +~A^ + .... 

2! 3! 

The Laplace transform of the weighting matrix 
is 

G(s) = | e ^‘Wit)dt^Cisl-Ay^B, 

which is called a transfer function matrix of the 
system (2). First we note that the weighting 
matrix W(t, t) for the system (1) is invariant 
under any linear transformation x(t) = P{t)x{t) 
of the state, and also that the transfer func- 
tion matrix G(s) for the system (2) is invariant 
under jc = Px. For example, the transformation 
x = Px yields 

x = PAP~^ x + PBu, 

y = CP”‘x. (3) 

In fact, the transfer function matrix for the 
system (3) becomes equivalent to that for the 
system (2), that is, 

G{s) = CP'^{sI-PAP-T'PB = C{sI~Ay^B 
= G(s). 

As a converse to this, there arises the problem 
of whether any linear time-invariant systems 
with the same transfer function matrix are 
similar to each other. To solve this problem, it 
is necessary to introduce the concept of con- 
trollability and observability for the system (1) 
or (2). 

C. Controllability and Observability 

The concept of controllability and observa- 
bility plays a fundamental role in linear system 
theory. Roughly, controllability implies the 
possibility of steering the state from the input. 

Definition of controllability: The linear sys- 
tem (1) is said to be controllable at time if 
for any state x(to)eR" and x^ eR" there exists a 
control input u{t), te[to, fj, that transfers the 



state x(to) to the state x^ at a finite time ti > fp. 
Otherwise, the system (1) is said to be un- 
controllable at time Iq. 

The controllability theorem states: The 
system (1) is controllable at time [q if and only 
if there exists a finite ti( > tp) such that the 
matrix 

D(tp,ti)= I '0(to,t)B(t)B'^(t)0%A)dt (4) 

V ^0 

is nonsingular. Similarly, the linear time- 
invariant system (2) is controllable if and only 
if the n x (nr) matrix 

D = [B,AB,...,A'’~^B] (5) 

has rank n. The matrix D is called the con- 
trollability matrix of the system (2). 

The concept of observability is dual to that 
of controllability. It implies the possibility of 
estimating the state from the output. 

Definition of observability: The linear system 
(1) is said to be observable at tp if for any state 
x(to)eR” there exists a finite ti(>tp) such that 
the knowledge of the input u(t) and the output 
y(t) over the time interval [tp, L] suffices to 
determine the state x(tp). Otherwise, the sys- 
tem (1) is said to be unobservable at tp. 

The observability theorem states: The sys- 
tem (1) is observable at time to if and only 
if there exists a finite such that the 

matrix 



M(to,ti)= 0^it,to)C^it)Cit)0{t,to)dt ( 6 ) 

J to 

is nonsingular. Similarly, the linear time- 
invariant system (2) is observable if and only if 
the n X (nm) matrix 

M-[C\/1^C^,...,(A^)" (7) 

has rank n. The matrix M is called the ob- 
servability matrix of the system (2). 

We deal only with linear time-invariant 
systems for simplicity, although some of the 
main results to be stated here can be extended 
to linear time-varying systems. If the controlla- 
bility matrix D has rank ( < n), there exists a 
linear transformation x = Px with nonsingular 
matrix P which transforms the system (2) into 



UJ [o A,\[x,\ [oJ“’ 



so that the n^-dimensional subsystem 



x^ — A^Xc + B^u, 
y = -CcXc 



( 9 ) 



is controllable. Clearly this subsystem has the 
same transfer function matrix as the system (2). 
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If the observability matrix M has rank i<n), 
there exists a linear transformation x — Qx 
that transforms the system (2) into 




y-lCo 0] 



( 10 ) 



so that the n„-dimensional subsystem 



Xo = ^o^o + BoU, 



( 11 ) 



is observable. This subsystem also has the 
same transfer function matrix as the system (2). 
Combination of these two properties yields the 
canonical decomposition theorem (found by 
Kalman [6]), which states: There is a nonsin- 
gular transformation x = Pxhy which the 
system (2) can be transformed into the canon- 
ical form 



^co 




^co 


0 


Xco’ 




^21 


^22 


Xc'o 




0 


0 


Jc'o’ 




_0 


0 




tern that has a prescribed rational function 
matrix or (2) a linear time-varying system that 
has a prescribed weighting matrix. Here we 
consider only the former, although there are 
many interesting results for the latter as well. 

The rational function matrix G(s) is said to 
be strictly proper if and only if G(oo) = 0. A 
linear time-invariant system with a prescribed 
G(s) as a transfer function matrix is called a 
realization of G(s). A realization of G(s) with 
the least possible dimension is said to be mini- 
mal. A minimal realization should be irreduc- 
ible by definition. The fundamental theorem 
on minimal realization, which answers the 
question raised at the end of Section B, states: 
For a given strictly proper rational matrix 
G(s), any two minimal realizations of 5(s) are 
mutually similar in the sense that there exists a 
linear transformation x — Px that transforms 
one of the systems into the other. 

A minimal realization of a scalar transfer 
function 



9(s)^ 



his" ^ + ... + b„-iS + b„ 
s"-f -f- ... -t-a„_iS-l-a„ 



is given by 



(13) 



y=[Q 





Bco 








" 0 


1 


0 


0" 


+ 


B2 

0 


u, 


(12) 


x = 


0 


0 


1 


0 




_0 _ 








0 


0 


0 


1 


0 




0] pco “1 , 






— a„ 


-«n-l 


-««-2 


... 



(14) 






where the subvector x^„ of x is controllable 
and observable, x^^. is controllable but not 
observable, „ is not controllable but obser- 
vable, and Xc'o' is neither controllable nor 
observable. Furthermore, the transfer function 
matrix of the system (2) is equal to that of the 
system (12), which is described by 

G{s) = CJsI-AJ-^B,,, 

that is, the transfer function matrix of the 
system (2) depends solely on the controllable 
and observable part of the state. 

A linear system (2) is said to be reducible if 
and only if there exists a linear time-invariant 
system of smaller dimension that has the same 
transfer function matrix. Otherwise, the system 
is said to be irreducible. The canonical decom- 
position theorem implies that a linear time- 
invariant system is irreducible if and only if it 
is controllable and observable. 

D. Realization Theory 

Realization theory is concerned with determin- 
ing (I) a linear time-invariant dynamical sys- 



0 

1 

y = [b„ b„^2 bjx. 

It is easy to see that this system is controllable. 
Hence system (14) is called a controllable 
canonical realization. Kalman [4] showed that 
this realization is observable and therefore 
minimal if and only if there is no common 
factor between the denominator and the nu- 
merator of ^(s). An observable canonical re- 
alization of ^(s) can be similarly written down 
as a dual of the system (14). 

For a given strictly proper matrix G(s), the 
degree of the least common denominator of 
all entries of G(s) is called the degree of G(s), 
provided that there is no common factor be- 
tween the denominator and the numerator 
of each entry. It was shown by Kalman [7], 
together with a realization procedure, that 
the dimension of minimal realization of G(s) 
is equal to the degree of G(s). Other impor- 
tant realization procedures for rational func- 
tion matrices were proposed by B. L. Ho and 
Kalman [8], W. A. Wolovich and P. L. Falb 
[9, 10], J. Rissanen and T. Kailath [11], and 
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B. Dickinson et al. [12]. Procedures for time- 
varying system realization for prescribed 
weighting matrices were presented by Kalman 
[4], D. C. Youla [15], and L. M. Silverman 
[16], Realization problems for a given impulse 
response matrix were first discussed by C. A. 
Desoer and P. P. Varaiya [17], and a fast 
algorithm for discrete-time system realiza- 
tion was proposed by C. T. Muilis and R. A. 
Roberts [18]. 

E. State Estimation 

When a linear system is observable, it is pos- 
sible to construct a device, called a state esti- 
mator, that approximates the state vector. For 
the given system (2), define a linear dynamical 
system 

z — Dz + Ey + Gu, 

x — LiZ + L2y (15) 

with imputs yGR™ and weR'', state zgR^, and 
output xeR". If the system (2) is observable, 
rank (C) — m, and q^n — m, there exist matrices 
D, E, G, Lj, L 2 , and M that satisfy 

MA-DM = EC, 

L,M + L 2 C = I„ (n X n identity matrix), 
G=^MB. 

Then it follows that 

je(r) - x(t) = Li e""' [z(0) - Mx(0)]. 

Furthermore, it is possible to choose a matrix 
D which is asymptotically stable, that is, every 
eigenvalue of D has negative real part. Thus 
i(t)^x(t) as r->oo, and system (15) yields a 
state estimator. This result was first shown by 
D. G. Luenberger [19,20], and thus the system 
(15) is called a Luenberger observer. 

The Kalman filter proposed by Kalman [21] 
for discrete-time processes and by Kalman and 
R. S. Bucy [22] for continuous-time processes 
is an optimal state estimator for a stochastic 
system. Let 

x(t) = A{t)x{t) + 

y(0 = C(t)x(t) + u(t), (16) 

in which u{t) is a white Gaussian noise (tBrown- 
ian motion) (^ 45 Brownian Motion) such 
that 

Eu{t) = 0, £w(t)w^(s) = U(t)Sit - s), 

and v(t) is also a white Gaussian noise such 
that 



sumed that 

£x(fo) = 0, £x(to)xT(fo) = Xo, 

and that x(tQ), u(t), and v(s) are uncorrelated. 
Under these conditions, the process x(t) be- 
comes a wide-sense Gauss-Markov process 
(J. L. Doob [23]). The Kalman-Bucy filter is a 
linear dynamical system described by 

Mt)-^[A(t)-L(t)c(t)]x(t)-i-ima 

where 

L(t) = P(t)C^(t)V-^(t) 

and P(t) is a solution to the matrix Riccati 
differentia) equation 

P(t) = A(t)P(t) + P(t)A'^(t) + B(t)U([)B'^(t) 
-P(t)C^(t)V-^(t)C(t)P(t), 

P(t 0 )^X 0 . 

The Wiener filter proposed in 1949 [24] is 
a special case of the Kalman filter in which 
the signal process is stationary and of single 
input and single output. Extensions to non- 
linear filtering were investigated first by 
H. J. Kushner [25] and subsequently by 
W. M. Wonham [26], R. S. Liptzer and A. N. 
Shiryaev [27], and M. Fuzisaki et al. [28]. 

F. Optimal Control 

Consider a nonlinear dynamical system de- 
scribed by a system of differential equations 

x=/(x, u), (17) 

where xgR" and mgR''. For a given set U in 
R*", a piecewise continuous function u(t) de- 
fined over [to, f 1 ] is said to be an admissible 
control function if u(t)eU for every tG[to,ti]. 
When a functional 

J tQ 

is given as a performance index of control 
and two points x° and x^ are given in R", the 
optimal control problem consists of finding an 
admissible function u(t) that minimizes J and 
simultaneously transforms the state from x(to) 
= x*^ to x(ti) = xL To solve this, a necessary 
condition for optimality was developed by 
L. S. Pontryagin and his colleagues [3] and 
termed the maximum principle. It states: In 
order for an admissible control u{t) and a 
corresponding solution trajectory of equation 
(17) to be optimal, it is necessary that there 
exist a nonzero vector i/^(t) = (i/^i(t), 
such that (i) \f/{t) satisfies 



Ev{t) = 0, Ev{t)v'^{s)=V(t)S{t-s), 

where E and ^ denote the texpectation and the 
transpose, respectively. In addition, it is as- 



cH{ij/{t},x{t),u{t)) 

dx(t) 
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where 

X, u) = -/o(x, u) + lA «), 

(ii) the function //(i//(t),x(t),w) of the variable 
ueU attains its maximum at the point u — u{t) 
almost everywhere in [to, tj, and (iii) at the 
terminal time t, it holds that 

When /(x, u) is linear in x and u, /q(x, m)= 1, 
and ti is free, we have the time-optimal control 
problem investigated by Bellman et al. [29] 
and J. P. LaSalle [30]. When /o(x, u) is instead 
a quadratic function of x and u, we have the 
optimal regulator problem solved by Kalman 
[31]. The relation between the maximum 
principle and the tcalculus of variations (-^ 46 
Calculus of Variations) has been pointed out 
by L. D. Berkovitz [32]. Generalizations of 
the maximum principle and other results 
on optimal control problems are found in 
[33-35], 

G. Miscellany 

There are many other areas of control theory. 
For details, we refer the reader to [36] for 
adaptive control, [37] for stability theory, and 
[38] for system identification. 
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Convergence 

A. Introduction 

The notion of convergence was first intro- 
duced in the real number system to deal with 
sequences of numbers, functions, series, or 
definite integrals (-^ 379 Series; 216 Integral 
Calculus). The notion was then extended to 
the case of generalized sequences where the 
index moves over a directed set, and the terms 
are in a topological space. 

B. Convergence of Sequences of Numbers 

A sequence {a„} of numbers is said to be con- 
vergent to a number a or to converge to a, 
written \im„_^a„ = a or a„->a as n^oo, if for 
any positive number e we can choose a (suflfi- 
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ciently large) natural number Hq such that for 
every n larger than Hq the inequality \a„ — a\ 

<s holds. Then a is called the limit (or limit 
point) of the sequence {a„}. If {a„} has a limit, 
it is unique. A sequence which is not conver- 
gent is said to be divergent or to diverge. 

A set A of real numbers is said to be 
bounded from above if there is a real number b 
such that 0^6 for all UG A, bounded from 
below if there is a real number c such that 
u ^ c for all aeA, and bounded if it is bounded 
from above and below. A sequence {a„} of real 
numbers is said to be monotonically increasing 
(monotonically decreasing), written (u„],), 

if Uj ^ ^2 ^ ^ 0-„ ^ ^n+l ^ • (^1 ^ ^2 ^ ^ 

a„ ^ u„+i ^ . . . ). A monotonically increasing or 
decreasing sequence is called a monotone 
sequence. 

C. Criteria for the Convergence of Sequences 
of Numbers 

Every bounded monotone sequence of real 
numbers is convergent; its limit is sup{a„} 
(inf{a„}) (^ 355 Real Numbers B) if it is 
monotonically increasing (decreasing). For 
any bounded sequence {a„} of real num- 
bers, setting a„ = inf{a„, , . . . } and = 
sup{a„,u„+i, ... }, we have a„t, and 

Hencc lim„^^ a„ = a ( = sup {a„} ) 
and lim„^^ ) exist, a is called 

the inferior limit (or limit inferior) of {u„}, 
written lim inf„_„ a„ or lim„^^ a„, while 
is called the superior limit (or limit supe- 
rior), written limsup„_(„a„ or lim„_^,Qa„. If 
lim sup„^^ a„ = lim inf„^„ a„ = a, then lim„^^ a„ 
exists and equals a. The limit of a conver- 
gent subsequence of a sequence {a„} of num- 
bers is called an accumulation point of the 
sequence. Here we should distinguish an ac- 
cumulation point of a sequence {a„} from 
an accumulation point of {a„} viewed as 
a set (^ 425 Topological Spaces O); for ex- 
ample, if a„ = 1 for every n, then the former 
is 1 and the latter does not exist. For any 
bounded sequence of real numbers, its supe- 
rior (inferior) limit is the maximum (minimum) 
of its accumulation points. Moreover, if fi is 
the superior limit of a sequence {a„}, then for 
any positive number e, there exist only a finite 
number of n’s for which a„ is greater than j9-t-£, 
while there may exist an infinite number of n’s 
for which a„ is less than fi — e. The inferior limit 
of the sequence has a similar property. 

Suppose that we are given a sequence {a„) 
of real numbers and that there exist two se- 
quences {«„} and {t;„} such that 
lim(M„ — £„} = 0 , {m„} is monotonically increas- 
ing, and {£„} is monotonically decreasing. 

Then lim a„ exists and is equal to lim u„ = 
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iim v„ (principle of nested intervals). In partic- 
ular, if lim sup fl„ — lim inf a„, then lim a„ exists. 
The converse also holds. 

If {fl„} is convergent, then |a„ — as n, 
m-voo, and vice versa; that is, {a„} is conver- 
gent if and only if for any positive number e 
there exists a positive integer Hq such that 
I I < £ for all n, m > «o (Cauchy’s 
criterion). 

D. Infinity 

For a set A of real numbers, the expression 
sup^ = -l-cx) means that A is not bounded 
from above; inf/1 = — oo means that A is not 
bounded from below. For a sequence {a„} of 
real numbers, \ima„ = -|-oo means that for any 
real number b there exists a positive integer Hq 
such that a„>b for all n^riQ, the notation 
lima„= —00 has a similar meaning. The sym- 
bols -foo and — oo are called positive (or plus) 
infinity and negative (or minus) infinity, res- 
pectively. We say that the limit of |a„} is 
•foo (— 00 ) if lima„= foo (— oo). In these 
cases, we customarily say that {a„} diverges (or 
is divergent) to foo (— oo), or that a„ becomes 
positively (negatively) infinite as n^oo. We 
also define limsupa„= foo (liminfn„= — oo) 
to mean sup{a„} = foo (inf{a„| = — oo). A se- 
quence {a„} is said to oscillate if limsupa„> 
lim inf a„. 

We now have the following propositions 
concerning sequences of numbers: If lima„ = a 
and lim b„ = b, then lim(au„ f pb„) — aa f pb, 
\\m{a„b„) = ab, and \\m{ajb„) = a/b (provided 
that 7 ^ 0, 6 # 0). For sequences of real num- 

bers, these formulas also hold when a or h is 
infinity. In those cases we set tx-(f oo)= foo 
(a>0), a-(±oo)= f 00 (a<0), af 00 = f 00 , 
a/(f oo) = 0 for a real number a. The cases 
0-(±oo), f oof (— oo), foo/(foo)are 
excluded. 

E. Convergence of Sequences of Points in a 
Topological Space 

A sequence {a„} of points in a topological 
space (-^ 425 Topological Spaces) is said to 
converge to a point a if for any tneighborhood 
U oi a there exists a positive integer such 
that a„eU for all n^riQ. The point a is called 
a limit (or limit point) of {a„} and we write 
lim„_^a„ = a or a„-^a as n-»oo. A sequence 
is said to diverge if it does not converge to any 
point. 

In particular, the set R of all real numbers is 
a topological space in which the set of inter- 
vals (a — e, a f e) for some e > 0 is a ^base for 
the neighborhood system of a point a, so that 
the notion of limit in R explained previously is 



a special case of the same notion in a topolog- 
ical space. By adding the symbols foo and 
— oc to R, we obtain the topological space R, 
in which any set containing (x j x > a, x e R} 
({x|x<a,xeR}) for some aeR is a neighbor- 
hood of foo (— oo), where the ordering is 
defined as — oo <a< f oo (aeR). Then lima„ = 
foo (— oo) is interpreted as convergence in 
the topological space R. The elements of R are 
called extended real numbers. 

In the case where the topological space is a 
^metric space (^ 273 Metric Spaces) with 
metric p, a„-*a is equivalent to p(a„,a)^0. 

For convergence of sequences of points in a 
topological space, the following properties (S) 
hold: (S) (1). If fl„ = a for all n, then lima„ = a; 

(2) if a„^a, then for any subsequence 
{a„ }; (3) if there is a point a such that any 
subsequence {a„J of {a„} has a suitable sub- 
sequence converging to a, then a„^a. In a 
^Hausdorff space (e.g., a metric space), the 
additional property (S*) holds; If a sequence 
{a„} has a limit, this limit is unique, and can be 
denoted lima„. 

F. Limits of Functions 

Let a real-valued function /(x) of a real vari- 
able X be defined for xf a belonging to a 
neighborhood of the point a. We say that the 
limit of /(x) is fe as X tends to a, and write 
lim^^a/(x) = h or f{x)^b as x^a, if for any 
positive number ^ there exists a positive num- 
ber d such that 0/|x — a| <5 implies |/(x) — 
b\<e. Replacing Ot^Ix — a|<<5 bya<x< 
af ^ (a — (5<x<a), we define /(x)^h as 
x^flf 0 (x-»’a — 0) and say that b is the 
limit on the right (left) of /(x) as x tends to a. 
We define /(x)-> foo or fix)-*' — oo as x->a 
analogously to the case of sequences. The 
expression f{x)~*b as x-> foo means that for 
any positive number e there is a real number k 
such that |/(x) — h|<e for any x>k. There 
are similar definitions for x-+ — oo and h = 
foo. When /(x)^ f oo as x^a, we often say 
that / diverges definitely at a. 

In general, for a mapping / from a subset D 
of a topological space X into a topological 
space y, with a point a in the closure of D and 
a point b in Y, lim^_^/(x)=^h or f{x)-*b as 
x-*a means that any neighborhood F of h 
contains f[U HD — {a}) for some neighbor- 
hood U of a. If y is a Hausdorff space, b is 
unique (if it exists) for given / and a. This 
point b is called the limit (or limit value) of fix) 
as x-»-a. 

It is easy to see that this definition of 
lim^_a/(x) = h is a generalization of the cases 
where the topological spaces are R or R. Let N 
be the set of all natural numbers, and let N = 
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NU{ + 00} be supplied with the relative 
topology as a subspace of R. A sequence {a„} 
of real numbers or points can be identified 
with a mapping / from N into R (N^R) 
defined by f{n) — a„. It converges to a if and 
only iflim„^^/(n) = a. 

Suppose, in particular, that / is a mapping 
from a metric space {X, p) into a metric space 
(1^, (t). Then, f{x)-^b as x^a means that 
for any e > 0 there exists a (5 >0 such that 
a{f{x),b)<^ for all xeD such that 0<p(x, a) 

< (5. Thus f{x)-^b as x^a if and only if /(x„) 
for any {x„} in D with x„—^a. If we set 
p(zi,Z2) = |zj — Z2I for complex numbers z^, 

Z2, the function p supplies the set of all com- 
plex numbers C with a metric and C becomes 
isometric to the plane R^ 74 Complex 
Numbers). Thus the cases = C or T=C are 
particular cases of the above generalization. 
Furthermore, we introduce the tRiemann 
sphere C = CU{oo}by adding the tpoint at 
infinity 00 to C. We can define a topology 
on C such that any set containing {00} U 
{z I |z| > r} for some positive number r is a 
neighborhood of 00. Thus we can define the 
notions f{z)~*b as z^ao, f(x )-*- 00 as x-*a, 
etc., for a complex-valued function / by con- 
sidering f as a mapping from the topological 
space C into itself Then /(x)-+oo is equiva- 
lent to l//(x)->0. 

G. Orders of Infinities and Infinitesimals 

Let / be a complex-valued function defined 
on a topological space X and a a point of X. 
Then / is called an infinity (at a) or an in- 
finitesimal (at a) if/(x)^oo as x->fl or/(x)-> 

0 as x-»a, respectively. Suppose that / and 
g are infinities and / jg is an infinitesimal. Then 
/ is said to be of lower order than g, and g is 
said to be of higher order than f. If both fjg 
and gjf are bounded, then / is said to be of the 
same order as g. This last relation, written / ~ 
g, is an equivalence relation. An infinity / is 
said to be of the nth order with p if /~p". For 
two infinitesimals / and g, f is called of higher 
order than g and g of lower order than f f/g 
is an infinitesimal. For infinitesimals, the terms 
of the same order and of the nth order are 
defined similarly as above. In particular, when 
X = C and a = 00, we usually omit the phrase 
“at 00.” Also, for such a function /, we cus- 
tomarily say that the order of an infinity (in- 
finitesimal) is n if/~z" (z~"). 

To describe the order of an infinity or an 
infinitesimal simply, the following notions, due 
to E. Landau [10], are in common use. Let 
/ and g be two functions. If \f{x)/g{x)\ is 
bounded as x-*a, then / is called at most of 
the order o( g as x^a, and we write /(x) = 
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0(p(x)) as x^a. Second, if f(x)/g{x) is an in- 
finitesimal at a, then / is said to be of lower 
order than g as x-^a, and we write /(x) — 
o(p(x)) as x->a. The symbols 0, 0 , indicat- 
ing the word “order,” are called Landau’s 
symbols. The notation f{x) — h{x) + 0{g{x)) 
means f{x)~h{x) = 0{g{x)). When we use the 
symbols 0, 0 we should indicate clearly the 
phrase “as x->a,” which is sometimes omitted 
when no confusion is to be feared (e.g., for the 
case of a complex variable with a = 00 ). These 
symbols are employed for sequences as well, to 
describe their behavior as n-^00. 



H. Convergence of Nets (Moore-Smith 
Convergence) [7, 8] 

Let 51 be a (preordered) tdirected set. A family 
of points in a set X with index set 51 (namely, a 
mapping from 51 to X) is called a net in X. A 
net is denoted by ({x*},j, or {x„}). A net 

in X is called a subnet of {Xj,}^! if there 
exists a mapping (^:®->^5l such that (1) = 

x^^p^ and (2) for any age 51 there exists a 
such that implies (p{[i)^aQ.ln particu- 
lar, if 53 is a cofmal directed subset of 51, (x^j^, 
is called a cofmal subnet of {Xj,},^,. A net is 
called a universal net if either {alx^^e T} or 
{a|x^G2f — T} is residual in 51 for any subset 
Y of X. For any net there is a universal subnet. 

For a net {x„},^( in a topological space X, 

{xjJ is said to converge to a point x in if for 
any neighborhood (7 of x there is an olq such 
that {x^ I a > Xq] a U. Then a is called a limit 
of the net {x^}. We then write x„->x (ae5I) 

(or simply x,^x). The convergence of se- 
quences of points is the special case where 
51 = N. The notion of convergence using nets 
was introduced by E. H. Moore and H. E. 
Smith. 

Concerning this convergence, we have the 
following propositions (D): (D) (1) If x„ = x for 
all a, then x^->^x. (2) If x„^x and {y^} is a 
subnet of {x}, then yp-*x. (3) If for a net {x,} 
there is a point x such that any subnet of 
{Xj(} has a suitable subnet converging to x, 
then x„-^x. (4) Suppose that x^^^x (a €51) 
and y*^^x^ (/ieSJ for each a. Then let (£ = 

51 X n be the direct product of directed 
sets with projections p:(£-^5l and 
and for 7 € C define Zy = y^p, where a = p(y) and 
l^ = Pxiy)- Then z^->x (ye(£). Furthermore, the 
space is a tHausdorff space if and only if 
we have the condition (D*): Any net in X has 
at most one limit. 

A limit of {Xj,} is denoted by lim x* or 
lim5,g,j,Xj(. Then x^^x if and only if x is con- 
tained in the closure of any subnet {yp\[}e^] 
of {x^}. (We may consider this to be a defini- 
tion of Xj-)-X.) 
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I. Convergence of Filters [9] 

Let X he a set. A set of subsets of X is called 
a filter if the following conditions are satisfied: 
(i) 0 ^ O (0 is the empty set); (ii) A<^Bc:X 
and i4eO imply BeO; (Hi) /I, Be<l> imply 
v4 n Be<I). Let ® be a set of subsets of X and 
<D be the collection of subsets of X such that 
each element A of <I) contains a subset belong- 
ing to ®. If O is a filter, then ® is called a filter 
base which generates O. ® is a filter base if and 
only if (i) 0^®; (ii) A, BeS implies that there 
is a CeS with AClB^C. A filter O is called 
an ultrafilter (or maximal filter) if there exists 
no filter which contains O properly. For any 
filter there exists an ultrafilter containing it. If 
is a family of filters, then the intersection 

<J)^ is a filter. If 5 is a filter base in the index 
set A, then IJatss is also a filter. 

We denote by U(x) the tneighborhood sys- 
tem of a point x in a topological space X. A 
filter O in X is said to converge to a point a, 
written 0->a, if U(x)e(I). A filter base 93 is 
said to converge to a if the filter generated by 
® converges to a. 

The convergence of filters just defined has 
the following fundamental properties (L): (1) 
for a point a in X the filter <^^ = {A\aeAci 
X} converges to a; (2) for two filters O and 
'F, 0-*a and imply (3) if 
a for all members in a family of filters, 
then (4) suppose that filters 

are assigned for all points y in a subset 
Y of X, and that we are given a filter 'F in X 
which converges to a, generated by a filter 
base ® in Y; then Fur- 

thermore, the space X is Hausdorff if and only 
if we have the condition (L*): Each filter in X 
has at most one limit. 



J. Relations among Various Definitions of 
Convergence 

Convergence of sequences of points is a special 
case of that of nets. Properties (1), (2), and (3) 
of (D) imply (1), (2), and (3) of (S), respectively, 
and (D*) implies (S*). Consider a net {x,},^ in 
X. Then the set { (x^J a e 91, a ^ ao} | ao g 91} of 
subsets of X is a filter base in X which gener- 
ates a filter O, and 3>-^x if and only if x„->x. 

In this situation, (L) implies (D), and (L*) 
implies (D*). Suppose that we are given a 
function /: X -> 7 with the domain D and a 
point aeX. Let U(a) be the neighborhood 
system of a and assume that, for any UeU{a\ 
UC\D — {a}¥^0. Then the set {/{UflD — 

{a})| f/eU(a)} is 4 filter base. Let <F be the 
filter generated by it. Then /(x)-vh as x-^a if 
and only if <!>-»■&. Consequently, the various 
types of convergence described previously can 



be expressed by means of convergence of 
filters. 



K. Convergence and Topology 

In a topological space X, the concept of con- 
vergence of nets and that of filters can be 
defined. Conversely, convergence of nets in X 
defines a topology of X. In fact, let us assume 
that we are given a set X and a definition of 
convergence of filters which satisfies the prop- 
erties (L). Then convergence of nets that satis- 
fies (D) can be introduced as above. If A is 
defined as the set of limits of all nets contained 
in a subset A of X, then A satisfies the axiom 
of closures (-^ 425 Topological Spaces), and a 
topology can be defined on X. Then we have 
the following propositions: (i) ae A if and only 
if there is a net {x„} with x^eA converging to 
a; (ii) C is a neighborhood of a if and only if 
x^^a implies that there exists an Xq such that 
XjG 1/ for all Thus, if X is a topological 
space, it carries a “new” topology defined by 
way of convergence of nets. But this “new” 
topology coincides with the original one. 
Similarly, starting from convergence of filters 
(or nets), we can obtain a “new” definition of 
convergence of filters (or nets), which coincides 
with the initial one. In conclusion, defining a 
topology on a space X is the same thing as 
defining convergence of filters in X or of nets 
in X. 

We shall describe here a few notions on 
topological spaces in terms of convergence. 

The fact that a topological space X is compact 
is equivalent to the fact that every universal 
net in X converges, and to the fact that every 
ultrafilter in X converges. Also equivalent is 
the fact that every net has a convergent subnet. 
A mapping / from a topological space X into 
a topological space 7 is continuous at a if and 
only if one of the following conditions is satis- 
fied: (1) for any net (x^d in X converging to 
aeX, we have /(xj^ f{a) in 7; (2) for any filter 
$ in X converging to ogX, we have /(^) = 

7; (3) /(x)^/(a) in 7as 
x^a in X (in the sense of the limit of a func- 
tion at a). 

M. Frechet [6] gave a definition of a topol- 
ogy on a space using the notion of conver- 
gence as a foundation. A set is called an L- 
space (or Frechet L-space) if convergence of 
sequences of points in it is defined so as to 
satisfy conditions (1) and (2) of (S) and (S*) 
(1906). Such convergence is called star conver- 
gence if it also satisfies (3) of (S), and in that 
case the space is called an L*-space. For any 
subset A of an L-space X, define A as the set 
of all points a such that x„ -»■ a for some se- 
quence {x„} contained in A. Then the axioms 
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A^A, AUB ^AUB, and 0 = 0 are satis- 
fied, so that X is a +generalized topological 
space (the axiom A = A \s not necessarily 
satisfied). For a Hausdorff space X with the 
+first countability axiom and convergence of 
sequences defined by means of its topology, 
the closure operation defined above gives the 
same topology as the initial one. 



L. (o)-Convergence 

A sequence {«„} of elements of an ordered set 
S is said to be (o)-convergent to an element a of 
S if there exist two sequences {m„} and {!;„} 
such that u„^a„^v„, and 

and a = supu„ = infi)„. When we write this 
a„^a, properties (1) and (2) of (S) and (S*) con- 
cerning the convergence of sequences hold. 
Next, a sequence {a„} is said to be (o)-star 
convergent to a if any subsequence of {a„} has 
a suitable subsequence which converges to a. 
Then (o)-star convergence satisfies the prop- 
erties (S) and (S*). 

For any set X the set of all subsets of 
X is an ordered set under the inclusion rela- 
tion. The fact that a sequence {A„} of subsets 
is (o)-convergent to a subset A is equivalent to: 

00 GO 00 00 

n u u n 

m=l n~m m = l n = m 

The set A is also equal to limy4„, which is the 
tlimit of a sequence {A„} of subsets. 
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A. Convex Functions 

A real- valued function f(x) defined on a tcon- 
vex set D in a linear space over R is called a 
convex function if for every x, ye D and 0^ 

A ^ 1 we have 

f(Xx + ( 1 - A)y) ^ A/(x) + ( 1 - my)- ( 1 ) 

The function f{x) is called a strictly convex 
function if the sign ^ in (1) is replaced by 
< for x^y except when x is either 0 or 1. If 
— ij/{x) is convex (strictly convex), the function 
tl/{x) is called a concave function (strictly con- 
cave function). The notion of convex function 
was introduced by J. L. W. V. Jensen [1] for 
the case where the domain D is an interval on 
the real line R. 

Sometimes the condition for a convex func- 
tion is weakened in such a way that (1) is as- 
sumed only for A = 1/2. However, if D is a 
■^topological linear space and / is continu- 
ous, then the weakened condition implies the 
original one. Hereafter, we mainly consider the 
case where D is an interval on the real line. In 
this case, a convex function f{x) (in the weaker 
sense) is continuous in the interior of the inter- 
val if f{x) is tmeasurable or bounded from 
above on a set of positive measure (the latter 
was proved by A. Ostrowski [2]). In particu- 
lar, suppose that /(x) is defined in the interval 
I and is bounded from below. Then either /(x) 
is continuous or its graph is dense in the set 
{(x, y)|xe/,y^0(x)}, where ^(x) is a suitable 
convex continuous function (Hukuhara [3]). 
We note here that the original definition of a 
convex function /(x) implies the continuity of 
/(x) in the interior of the interval. In such a 
case, /(x) always has tright and deft deriva- 
tives and satisfies /-(x)^/+(x)^ fL{y)^ 
f'+ (y) for X < y. Hence it is differentiable ex- 
cept for at most countably many points. 

A function /(x) is a continuous convex 
function in a ^ x ^ h if and only if it is expre- 
ssible in the form 

nx)=m+l 



(p(t)dt, 
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where (p{t) is a monotone increasing function. 
If f{x) is twice differentiable, then /"(x)^0 
(a < X < h) is a necessary and sufficient condi- 
tion for f{x) to be convex in {a, b). 



B. Convex Functions and Inequalities 

If f{x) is convex (in the original sense), we 
have, for a„>0, 

(2) 

Similarly, we have, for (p>0. 



D. Subdifferentials of Convex Functions 

Currently, convex analysis is playing an im- 
portant role in the study of nonlinear evolu- 
tion equations. This analysis treats convex 
functions on infinite-dimensional spaces [9]. 

In convex analysis, it is sometimes conve- 
nient to consider a proper convex function q> 
on a Hilbert space X. A mapping <p from X 
into (— 00 , -l-oo] is called a proper convex 
function if (p is not identically equal to -foo 
and if 

<p(/lx -H (1 - X)y) < /l(p(x) + (1 - X)(p{y) 

for every x, ye X and 0 ^ A < 1 . The convex set 



/ (pif/dxi 



(pdx < (pf{ij/)dx. 



(pdx. (3) 



The functions x" (a > 1 or n < 0), — x" (0 < a < 
1), — logx, xlogx are strictly convex for x>0, 
a nd the fu nctions x^" (n ^ 1), expx, log(l -I- e^), 
yja^ + x^ (n #0) are strictly convex in — cx) < 
x< - 1 - 00 . Applying the inequalities (2) or (3) to 
these functions, we obtain various inequalities, 
including the inequalities on means (^211 
Inequalities). 

A continuous convex function /(x) over a 
topological linear space satisfying the relation 
f{ocx) — af{x) for an arbitrary positive number a 
is called a subadditive functional and is often 
utilized in functional analysis. 



C. M. Riesz’s Convexity Theorem 



Let X ((^ 1 , . . . , <^„) be an n-tuple of com- 
plex numbers, and let v>0. We put N^{x) = 
(I”=i for v>0 and No(x) = sup|(^^t. 

Let be an m X n complex matrix, x = 
(<^i,...,0> 2 = (Ci,...,CmX v^O, andjU^O. 
We put 



M{v,fi)= sup 



m n 



E E “oC,{j . 

f=i ;=i 



Then \ogM{v,pi) is a convex function of iv,fx) 
in the following sense: Let 0 < V; ^ 1, 0 < /r,- ^ 1, 
and + 1 (i= 1,2). Then logM((l — t)vi + 

tV 2 , (1 — O/ii +t/^ 2 ) is a convex function with 
respect to t for 0^ t ^ 1 [4, 5]. These results are 
called M. Riesz’s convexity theorem. Famous 
inequalities such as the tHolder inequality or 
the tMinkowski inequality follow from this 
theorem. For example, let T be an tadditive 
operator from the ^function space Lp(Q.) into 
Lp(Q) for all 1 ^ go. If T is a continuous 

operator for p = 1 and p=oo, and the norm 
of T is < C for p = 1 and p = oo, then T is con- 
tinuous for all p (1 <p < 00 ), and its norm is 
always ^C. 



D{(p) = {xeX\(p{x)<oo}'^X 

is called the effective domain of (p. The subdif- 
ferential d(p of (p is defined to be a multivalued 
function which assigns to each x in D{(p) all 
elements yeX gatisfying (p{^)'^ (p{x) + {y, ^ — 
x> for any ^eX. Then dcp is monotone in 
X in the following sense: If yj e5(p(xi) and 
y 2 ed(p(x 2 ), then 



<yi-y2,Xi-X2>>0. 



Under some conditions on cp, dcp becomes a 
single-valued function, as is seen in the follow- 
ing example: Let A be a nonnegative self- 
adjoint operator in a Hilbert space X. For the 
tfractional power .J~A of A, define <p by 




-I- 00, 



xeD{^), 

otherwise. 



Then D{(p) = D(y/A) and d<p = A. 



E. Convex Functions and Nonlinear 
Semigroups 

Let (p be a lower semicontinuous proper con- 
vex function on a Hilbert space X. Since A — 

— d(p is a. maximal tdissipative operator in X, 
it generates a nonlinear semigroup { 1 1 ^0} 
on the closed convex set D(A): 

Tx^lim (/-T/4)"t‘/'’x 

(^ 286 Nonlinear Functional Analysis, 378 
Semigroups of Operators and Evolution Equa- 
tions). It can be shown that for any aeD{A), 
TtaeD{A) whenever t>0. Thus the ^abstract 
Cauchy problem 

du 

— + d(p[u)30 (t>0), 

dt 

m( + 0) = a, 

is considered to be “parabolic.” F urthermore. 
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we have 




1 

^--\\a—T^a 



it>0) 



for all aeD{A) (H. Brezis [10]). 
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A, General Remarks 

A nonempty subset X of the n-dimensional 
Euclidean space R" is called a convex set if 
for any elements x, y in X and any number a 
such that 0 a ^ 1 , the element ax + (l—a)y 
of R" is also contained in X. The tinterior 
and the tclosure of a convex set are also con- 
vex. A point X of a convex set X is called an 
extreme point of X if x cannot be expressed as 
(Xi -t-X 2)/2 in terms of a pair of distinct points 
Xj, Xj in X. A bounded closed convex set is 
called a convex body if it has tinterior points. 



Given an arbitrary nonempty subset X of R", 
the minimum convex set containing X exists, 
called the convex hull of X and denoted by 
[X]. Each point x of [X] can be expressed as 
X = 2"=i fl.oCi, where x,- belongs to X and the a, 
are nonnegative numbers such that Z”=i fli — L 
When X is a finite set, [X] is called a convex 
polyhedron. If X denotes the set of extreme 
points (also called vertices) of a convex poly- 
hedron X, then X = [X]. 

For elements x, y of R", denote the inner 
product by (x, y). Given a nonzero element v of 
R" and a fixed number a, the thyperplane H = 
{x|(i7,x) = a} divides the space R" into two 
thalf-spaces {xl(r,x)^a} and {xl(t;,x)^a}, 
each of which is a closed convex set. If a con- 
vex set X is contained in one of the half-spaces 
S determined by the hyperplane H and the 
boundaries of X and H intersect, then we say 
that H is a supporting hyperplane of X and S is 
a supporting half-space of X. A closed convex 
set X is the intersection of its supporting half- 
spaces. A boundary point of a convex set X is 
contained in a supporting hyperplane of X. 
Given mutually disjoint convex sets X and 
Y, the separation theorems (1) and (2) hold. 

(1) If X has inner points, then there exist a 
nonzero element v of R" and a number a such 
that X is contained in the set {x | (x, r) ^ a} and 
Y is contained in the set {xl(x,i;)^a}. 

(2) If X and Y are closed and X is bounded, 
then we can replace the signs ^ and ^ in (1) 
by < and > , respectively (when the separa- 
tion of convex sets X and Y is described by 
strict inequalities, we say that X and Y are 
strongly separated). 

As an immediate consequence of the separa- 
tion theorems, we obtain the following pro- 
position: Suppose that A is an w x n matrix 
with real entries. For an element z in a Eu- 
clidean space, we write z ^ 0 ( > 0) if each 
component of z is > 0 ( > 0). Now if I4y > 0 
never holds for an m-dimensional vector y > 0, 
then there exists a nonzero n-dimensional 
vector X ^ 0 such that /4x^0(-^ 173 Game 
Theory). 

The definitions given previously for subsets 
of R" can be naturally extended to the case of 
treal topological linear spaces (— ^ Section G). 
Also, in the theory of analytic functions of 
several complex variables, various notions of 
convexity of the subsets of C" are considered 
(^21 Analytic Functions of Several Complex 
Variables). 



B. Helly’s Theorem 

Suppose that we are given an index set A of 
cardinality greater than 1, and bounded 
closed convex sets (2e A) in R". If any n -I- 1 
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sets of the have nonempty intersection, then 
the intersection of all the Q is nonempty 

(Helly’s theorem). 

This theorem has a wide range of applica- 
tions. For example, we have propositions (1)- 
(4). 

(1) If a convex set X of R" is covered by a 
finite number of half-spaces, then X can be 
covered by no more than n -I- 1 half-spaces 
among them. (2) Let X and Y be finite subsets 
of R". X and Y are strongly separated by a 
hyperplane if for an arbitrarily chosen subset S 
of 2HJ y consisting of at most n + 2 points, the 
sets S n and S fl y are strongly separated by 
a hyperplane. (3) If the tdiameter of a subset X 
of R" is not greater than 2, then X is contained 
in a tball of radius {2n/{n + l)y>^. (4) Let X be 
a convex body in R". There exists a point x 
in X such that ||x-u||/||i;-m|| ^n/(«-|- 1), 
where u, v are points of intersection of an arbi- 
trary straight line passing through x with the 
boundary of X, and ||x|| denotes the length 
(x,x)*''^ of X. 

Helly’s theorem can also be applied to 
problems of approximation of functions. 

C. Ovals 

The boundaries of convex bodies in R^ and R^ 
are called ovals and ovaloids, respectively. An 
oval £ is a tJordan curve C{t) which admits at 
every point = C(to) left and right “tangents” 
Ip^ and Ip^, where is a straight line expressed 
as the set of points P{a), 2eR such that F(2) 
-PQ = Xa-, with a- =limj_+o(C(t)--Po)/|C(t) 

— PqI)- There may exist exceptional points P 
on £ for which left and right tangents do not 
coincide, but the set of such points is at most 
countable. Each tangent shares a point or a 
segment with £. A line satisfying this condition 
is called a supporting line of the oval. It is also 
a supporting hyperplane in R^ of the convex 
body [£] in the sense of Section A. If we fix an 
interior point O of a convex body X and take 
an arbitrary point P different from O, then the 
boundary £ of A admits one and only one 
supporting line 1{P) which is perpendicular to 
the line OP and meets the half-line OP. Take a 
rectangular coordinate system with the origin 
0, and let (x, y) denote the coordinates of P. 
Then the points on /(£) satisfy the equa- 
tion ^x-hrjy = H{x, y), where H{x, y) is a func- 
tion determined for all (x, y) in R^ and satisfy- 
ing the following conditions: (i) //(0,0) = 0; (ii) 
H{tx, ty) = tH{x,y), for t^O; (iii) //(Xj -l-X2,yi 
-l-y2)^//(xi,yi)-|-//(x2,y2). The function 
H{x, y) is called the supporting line function of 
£. The magnitude and shape of £ are deter- 
mined by H, and any function satisfying con- 



ditions (i)-(iii) is a supporting line function of 
an oval. An oval £ has a finite length L — L(£), 
and the convex body [£] has a finite area F = 
£(£). If OP' denotes the half-line with direc- 
tion opposite to that of OP and /' denotes the 
supporting line 1{P’), the distance between the 
parallel lines I and /' is called the breadth of £ 
in the direction PP'. Let D = D{E) and A = A(£) 
be the maximum and minimum of the breadth 
of £, respectively. D is the tdiameter of £ (or 
[£]), and A is called the thickness of £ (or 
[£]). In particular, if D = A, then the oval £ is 
called a curve of constant breadth. In the fol- 
lowing inequalities, equality holds only when 
£ is one of the figures mentioned in paren- 
theses: (1) (circles, J. Steiner (1838)) 

(^ 228 Isoperimetric Problems); (2) nD^'^AF 
(circles, L. Bieberbach (1915)); {3) 

(curves of constant breadth, W. Blaschke 
(1916)); (4) F^A^fy/s (regular triangles, J. Pal 
(1921)). See T. Kubota, Tdhoku Sci. Bull., I, 

12, 13; Tdhoku Math. J., 24, 49. 

D. Linear Combinations of Ovals 

Let Hi and H 2 be supporting line functions of 
ovals El and £2, and let ti and t 2 be positive 
numbers. Since the function tiHi + t 2 H 2 satis- 
fies conditions (i)-(iii), given before, it is a 
supporting line function of an oval E{t 1 , 12 ). In 
this case, we can also write E{ti,t 2 ) = ti £j + 
tjFz call it a linear combination of £j 
and £3. In particular, the oval (£j -t-£2)/2 is 
called the mean oval of £^ and £3. In general, 
there exists a quantity M, called the mixed 
area of £j and £ 3 , such that F(E{t 1 , 12 ))^ 
F{Ei)t] + 2Mtit2 + FiE 2 )tj. M does not 
depend on the choice of and t 2 , and 
£(£i)£(£ 2). Here, the equality holds if and 
only if El and £2 are homothetic and situated 
in a position of homothety. Furthermore, if 
0 ^ t ^ 1 , then the square root of F{E{t, 1 — t)) 
is a tconvex function of ( (H. Minkowski). 

E. Specific Ovals 

Suppose that we are given an equilateral 
triangle ABC. Draw three circles Cj, C2, and 
Cj with centers A, B, and C and radii equal to 
the length of the sides of ABC. The minor arcs 
AB, BC, and CA of the circles form an oval 
which is called a Reuleaux triangle. This oval 
is of constant breadth. Furthermore, given a 
fixed breadth D, the area £(£) of an oval £ of 
constant breadth D attains its minimum when 
£ is a Reuleaux triangle. A Reuleaux triangle 
obtained from a triangle ABC revolves freely 
within the square of side AB and touches each 
side. In general, an oval which revolves touch- 
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ing the sides of a convex polygon from the 
inside is called an inrevolvable oval. Any such 
oval revolves inside some regular polygon (M. 
Fujiwara, S. Kakeya). 

Various properties of an oval already de- 
scribed can be generalized to the case of a 
boundary of a convex body in R". For the 
volumes of subsets A, B in R" and A + B = {x + 
y\xe A,yeB} in R", the Brunn-Minkowski 
inequality 

[vol(/l + B)] ^ [vol(A)] 1/" + [vol(B)] 1/” 

holds. 

F. Convex Cones 

A nonempty subset X of R” is called a convex 
cone if for any elements x, y of A" and a non- 
negative number a, ax and x -h y are contained 
in X. A convex cone is a convex set. Given any 
nonempty subset X of R", the minimum con- 
vex cone K{X) containing X exists. Given two 
convex cones X and Y, a convex cone X +Y, 
called the sum of X and Y, is defined as the set 
of elements x -I- y, where x, y are elements of X, 
Y. The intersection of convex cones X and Y is 
also a convex cone. Given a convex cone X, 
the subset of R" consisting of the elements y 
such that (x, y) ^ 0 for any element x in X is a 
convex cone which is called the dual convex 
cone (or conjugate convex cone) of X, denoted 
by A*. If A is a finite set, K(A) is called a 
convex polyhedral cone. For example, if v 
is a nonzero vector, then the half-line (y) = 

{x\x — av,a'^Q} or the half-space (u)* = 

{x I {v, x) < 0} is a convex polyhedral cone. A 
convex polyhedral cone is closed. A convex 
cone A is a convex polyhedral cone if and only 
if A is the sum of a finite number of half-lines. 
Given convex cones A and Y, we have pro- 
positions (l)-(3): (1) If Ai cz Aj, then X} a 
X* ; (2) (Ai + A 2 )* = A* n A 2 * ; (3) A* + X} c 
(Aj n Aj)*. If Ai and A 2 are convex poly- 
hedral cones, then A* -I- X* — (A^ fl Aj)*. Gen- 
erally, A c(A*)* — A** for a convex cone A. 

If A is a closed convex cone, then A = A**. 
Namely, the duality principle holds for closed 
convex cones. A linear subspace of R" is a 
convex polyhedral cone. Also, if A is an m x n 
real matrix, the subsets {x|Ax = 0, x^O} and 
{x I Ax ^ 0} are convex polyhedral cones. Since 
the duality principle holds for convex poly- 
hedral cones, we obtain the tMinkowski- 
Farkas theorem (i.e., if A is an wi x n real 
matrix and v is an element of R™, then the 
equation Ay = 1 ; has a solution y^O in R" if 
and only if (t>, x) ^ 0 for all x e R"* such that 
'Ax 5=0). (For linear inequalities ^ 255 Linear 
Programming.) 
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G. Convex Sets in Function Spaces 

The definitions of convex sets and convex 
cones in R" can be naturally extended to the 
case of any real linear space. Some of their 
properties can be generalized and applied to 
the case of function spaces. 

(1) Let £ be a tlocally convex real topolog- 
ical linear space satisfying Hausdorff’s tsepara- 
tion axiom. Let A and B be convex sets in E, 
and assume that B has interior points and 

A n B is empty. Then A and B are separated by 
a hyperplane. Namely, there exists a nonzero 
tcontinuous linear functional f on E such that 
sup/(A)^inf/(B). 

(2) Let E be as in (1), and let C be a convex 
set in E. If a boundary point x of C admits a 
nonzero continuous linear functional / such 
that /(x) = sup /(C), we call such a point x a 
supporting point of C, and / a supporting func- 
tional of C. If C has interior points, then any 
boundary point x of C is a supporting point of 
C. 

(3) Let C be a closed convex set of a 
tBanach space E. The set of supporting points 
of C is dense in its boundary. 

A convex set C contained in the dual space 
E* of a real topological linear space E is called 
a regularly convex set if for any /q in E* not 
contained in C, there exists a point Xq in E 
such that sup{/(xo)|/eC}</o(xo). 

Let £ be a real topological linear space 
satisfying Hausdorff’s separation axiom, and 
let C be a closed convex cone having 0 as its 
extreme point. Furthermore, assume that CD 
( — C) — {0}. If we set x^y when y — xeC, 
a partial ordering ^ is defined in £. For 
example, if £ is R”, then the positive orthant 
C= {x = (x,)|x;^0, i= 1, ••• satisfies these 
requirements, and the partial ordering x^y 
defined by means of C is equivalent to the 
relation Xj<yj for all i. 

Some of the properties of matrices of posi- 
tive entries or ^integral operators whose tker- 
nel functions are positive-valued can be gen- 
eralized to properties of mappings / :£^£ 
such that /(C) czC (^ 255 Linear Program- 
ming; for the tKrein-Milman theorem ^ 424 
Topological Linear Spaces). 
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A. General Remarks 

Suppose that we are given a Euclidean plane 
and two lines X’X and Y'Y in per- 
pendicular to each other. Let 0 be the point of 
intersection of XX and Y’Y. We identify each 
of the straight lines X'X and Y'Y with the set 
of real numbers R; the point 0 on each line is 
identified with zero. Let P be an arbitrary 
point in E^. We draw lines PQ, PR parallel to 
Y'Y, X'X, where Q, R are on X'X, Y'Y, respec- 
tively. Let X and y be the real numbers corre- 
sponding to Q and R. Thus we obtain a map- 
ping sending the point P to the ordered pair 
(x,y) of real numbers. This mapping gives a 
one-to-one correspondence between the points 
P of E^ and the ordered pairs (x, y) of real 
numbers in R^. The numbers x and y are 
called the coordinates of P. 

In general, given a set of mathematical 
objects, if we have a mechanism that assigns 
quantities to each element of the set, then such 
a mechanism is called a coordinate system on 
the set, and the quantities corresponding to 
each element are called its coordinates. In the 
previous example, the mechanism is called a 
rectangular coordinate system. Coordinate 
systems are also useful in expressing quantita- 
tive concepts by geometric ones which are 
intuitively easier to grasp, c.g., diagrams of 
train schedules and 'f'nomograms. tMap projec- 
tion, tgraphical calculation, tdescriptive geom- 
etry, etc., may be viewed as applications of the 
concept of coordinate systems. 

In many cases, when we introduce a coor- 
dinate system in a space, it is determined 
uniquely by fixing a basic figure in the space. 



In the case of a rectangular coordinate system 
on a plane E^, the basic figure consists of X'X 
and Y'Y, which are called coordinate axes (the 
point 0 is called the origin). Sometimes it is 
convenient to consider real-valued functions / 
and 0 on R and a coordinate system on the 
plane E^ determined by the function that 
sends an arbitrary point P to {f{x),g{y)), 
where (x, y) are the coordinates of P in the 
rectangular coordinate system. Logarithmic 
papers, tprobability papers, and tstochastic 
papers (binomial probability papers), etc., are 
constructed in this way to fit their respective 
purposes. 

In various branches of mathematics there 
are many varieties of coordinate systems. In 
this article we deal with frames and coordi- 
nates, curvilinear coordinates, and local 
coordinates. 



B. Frames and Coordinates 

Suppose that we are given a space M and a 
ttransformation group G acting on M. It is 
desirable to introduce a coordinate system 
that best represents the geometric structure of 
M. Let be a set of figures in M such that G 
acts tsimply transitively on G,^. Each element 
of G^ is called a frame. Utilizing each frame as 
basic figure, we introduce a coordinate system 
that is “G-invariant” in the following sense: 

Let ReG^, XeM, and Q(X) be the coordi- 
nates of X with R as basic figure. Then the 
coordinate system is G-invariant if C^{X) — 
CgnigX) for any element g in G. If we have 
such a coordinate system for each ReG^, then 
the expressions of geometric properties of M in 
terms of the coordinates are independent of 
the choice of frames. 

(1) Projective Coordinates. Let M be an n- 
dimensional tprojective space P" over a field 
K, and let G be the ^projective transformation 
group of P". As a frame we can take the system 
of n -I- 1 points ( Aq , A , , . . . , A„) in general po- 
sition. The ^homogeneous coordinates of an 
arbitrary point X e P" are given by the (n + 1)- 
tuple (xq , X 1 , . . . , x„) satisfying the equation X 
— 1/j=oXjAj, XjSK. They are called tprojec- 
tive coordinates. In fact, if (xo, x j , . . . , x„) 

(0,0, ...,0), then (xo,x,, ...,x„) and {kxo,Xx^, 
..., kx„) (2^0) represent the same point in P". 

A thyperplane n of P" is expressed as the set 
of points whose coordinates (xq,Xi, ...,x„) 
satisfy a linear homogeneous equation 
'Lj=oXjUj = 0, UjEK. Therefore, the hyperplane 
n is represented by the homogeneous coor- 
dinates (uq, u^, ..., u„), called thyperplane 
coordinates of n. 
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(2) Afflne Coordinates. Let M be an n- 

dimensiona! taffine space and let G be the 
tgroup of affine transformations of £". As a 
frame we can take the system (0;ei,e2,..., e J, 
where 0, called the origin, is a point in £”, and 
the set of vectors {ej is a basis of the tstan- 
dard vector space of £". Then tinhomogeneous 
coordinates of an arbitrary point XeE" are 
given by the n-tuple {x j , ^2 , . . . , x„), where X = 
O + x,e,-. They are also called taffine 

coordinates of X in £”. Sometimes we replace 
G above by the group of tequivalent affinities 
and consider the frames (0; , 62 , • - ■ , e„) such 

that the volume of [e^ , . . . , e„] = 1 . 

Furthermore, if £" has the structure of a 
tEuclidean space, we sometimes replace G 
by the tgroup of motions and consider a sys- 
tem of rectangular coordinates determined 
by an torthogonal frame, that is, a frame 
(O; e 1 , e2 , . . . , e„) such that the inner product 
(e;,ej) = ^ij, where d is the tKronecker delta. By 
contrast, the general affine coordinate system 
of a Euclidean space is called a system of 
oblique coordinates. In this case the inner prod- 
ucts (e,-,e^) = 0y are invariants of Euclidean 
geometry, and the distance p between two 
points (x,) and (yj is given by p = (Llj^i gfi/y,- 
— X()(yj — Xy))^^^. We sometimes consider an 
oblique coordinate system satisfying (e,-, ej = 1 
(j = 1, . . . , n). In such cases the angle between 
two basis vectors C; and is determined by 

= cos 

(3) Barycentric Coordinates. In an n- 

dimensional affine space £", we take « -H 1 
linearly independent points Aq, and 

denote the position vectors from a point O to 
these points by ao, aj , . . . , a„, respectively. 

Then for any point XeE” there exists a unique 
set of numbers ... ,X„) such that X = 0 

+ 1. We call these numbers 

barycentric coordinates of X in £". They are 
independent of the choice of the point O. 

(4) Plucker Coordinates. Let V{n, m) be the 
set of all m-dimensional subspaces in an n- 
dimensional projective space P". Then V{n,m) 
has the structure of a tGrassmann manifold. In 
order to introduce a coordinate system on 
V(n,m), we fix a projective coordinate system 
on P". An m-dimensional subspace ne V{n,m) 
in P" is spanned by m -l- 1 independent points 
Bo. £1, sP”. We denote projective co- 
ordinates of these points by (hoj), (bij), 

and construct the determinants 

^ojo^oji • • • boj^ 

- 

h h h 

Then the subspace n can be represented by 
homogeneous coordinates ( . . . , ...). 



These coordinates are independent of the 
choice of m -h 1 points that span n and are 
called Plucker coordinates (or Grassmann 
coordinates) of n in V{n, m). In these coordi- 
nates, the are alternating and satisfy 

the Plucker relations 

m + l 

k -0 

where means that j\ is removed. In partic- 
ular, when « = 3 and m — 1, we have only one 
Plucker relation 2 :poi P23 - P02P13 + P03P12 = 
0, which is a homogeneous equation of the 
second degree. In other words, the set F(3, 1) 
of all lines in a 3-dimensional projective space 
P^ is realized as a quadric surface Q in a 5- 
dimensional projective space P^ that has 
(P01.P02.P03.P12.P13.P23) as projective co- 
ordinates. Moreover, when P^ is a complex 
projective space, we put 

P 01=^0 + f<^ 3 , P 02 = il+iU, P 03 = <^2 + f^ 5 . 

P 23 = ^ 0 -»^ 3 . Pl 2 ^^ 2 -i <^5 

and obtain a relation 

^O + ^? + ^l + a + ^4+^l=0 

corresponding to the Plucker relation. Thus 
every line in P^ can be represented by homo- 
geneous coordinates (^q . . • • • . <^5). which we 

call Klein’s line coordinates. 

(5) (n + 2)-Hyperspherical Coordinates. Let 
(xo,Xj, ...,x„,x^) be projective coordinates in 
an (n-H l)-dimensional real projective space 
P'’^^ An n-dimensional tconformal space S" 
is realized as a quadric hypersurface S" in 
:2"^=i OijXiXj-lxQX^^O, where is a 
positive definite symmetric matrix. A general 
point in P"^^ represents a thypersphere of S”. 
That is, a hypersphere represented by a point 
^gpn+i realized as the intersection of S” 
with the tpolar hyperplane of X with respect 
to S”; according as X lies outside of S", on S", 
or inside of S", it represents a real hypersphere, 
a point hypersphere, or an imaginary hyper- 
sphere. Therefore any hypersphere of S” in 
P"^^ is expressed by homogeneous coordinates 
(xq, X i , . . . , x„, Xqp), called {n -h 2)-hyperspherical 
coordinates. When n — 2, they are called tetra- 
cyclic coordinates, and when n = 3, pentaspher- 
ical coordinates. Therefore, if we restrict (« -h 2)- 
hyperspherical coordinates for points on S”, 
then they satisfy the quadratic relation stated 
before. In the frame (Aq,Aj, ...,A„,A^) of P"^^ 
which defines the (n + 2)-hyperspherical co- 
ordinates, Aq and A^ are points on S", and the 
other Ai are real hyperspheres passing through 
the points A^ and A^. It is possible to choose 
a frame {Aq,Ai, ...,A„,A^) such that the equa- 
tion for S” becomes S"=i xf — 2 xqX^ =0 (i.e.. 
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9ij = ^ij)- Among hypersurfaces in S", one that 
is expressed by a homogeneous equation of 
the second degree with respect to {n + 2)- 
hyperspherical coordinates is called a cyclide. 

It is an algebraic surface of the fourth order 
and is an enveloping surface of the family of 
hyperspheres that are tangent to n fixed 
hyperspheres. 

(6) Moving Coordinates. When we study the 
differential geometry of an m-dimensional 
surface in a space M on which a transfor- 
mation group G acts, it is often preferable to 
take a frame or frames at each point of W and 
consider a +connection among them. These 
frames are called moving frames, and the set of 
coordinate systems with respect to moving 
frames is called a moving coordinate system 
(-^111 Differential Geometry of Curves and 
Surfaces). 

C. Curvilinear Coordinates 

Let (xj , ^ 2 , . . . , x„) be a rectangular coordinate 
system on an u-dimensional Euclidean space 
E". If Xi — Xi{Ui,U 2 , ... , w„), t — 1, ... ,n, are func- 
tions of n variables {u^^,U 2 , of class 

C'' (r^ 1) and the tfunctional determinant 
D(xj, ...,x„)/D(ui, ...,u„) is not equal to zero 
in some open domain, then (m,,U 27 are 

considered local coordinates in £". We call 
them curvilinear coordinates of E”. A hypersur- 
face M; = constant (obtained by fixing the value 
of one of the variables u,) is called a coordinate 
hypersurface, and a curve constant {j¥=i) is 
called a coordinate curve. The line element ds 
of a Euclidean space E” is given by 

n n 

Z 9ijdUiduj, 
fc=l i,j=l 

k=i dui dUj 

Thus E" is equipped with a tRiemannian 
metric. However, as £" is tflat, its tcurva- 
ture tensor satisfies Rj^i — O. If the metric is 
diagonal, namely, if ds^ = 9 iduf, the co- 
ordinates are called orthogonal curvilinear 
coordinates. Moreover, ii 0 ^ = the 

coordinates are called isothermal coordinates. 
The metric is diagonal if and only if coordi- 
nate curves are mutually perpendicular at the 
points of intersection. Actually, curvilinear 
coordinate systems that are often used practi- 
cally are diagonal. The concept of curvilinear 
coordinates has been generalized to the case of 
tdiflferentiable manifolds and is utilized to 
determine their local coordinates. 

On any 2-dimensional Riemannian manifold 
there always exist isothermal coordinates in a 
neighborhood of any point [7]. 



(1) Curvilinear Coordinates on Planes or Spaces 

(-^ Appendix A, Table 3.V). Let (x,y) be rect- 
angular coordinates of a point in a Euclidean 
plane £^. We have the following coordinate 
systems on £^: 

Polar coordinates (r, 0), where 

x = rcos9, y = rsin0. 

Elliptic coordinates (2, g), where 
x^ = {;. + a^){H + a^)/{a^-b^), 

a>b>0, A> —b^> fi> —a^. 

Parabolic coordinates (a, jS), where 
x=—{a + P), y = y/—4aP, cc>0>(i. 

Equilateral (or rectangular) hyperbolic co- 
ordinates (u, v), where 

x^uv, y — (u^ — v^)/2. 

Bipolar coordinates (<^, rj), where 
X = a sin c^/(cosh ^ -j- cos r/), 
y = a sin >j/(cosh ^ + cos t]), 

—GO <^<00, 

Next we consider the case of a 3- 
dimensional Euclidean space £^ and let (x, y, z) 
be rectangular coordinates on £^. The follow- 
ing systems of coordinates on £^ are some- 
times useful. 

Cylindrical coordinates (r, 9, z), where 
x = rcos0, y — r sin 9, z — z. 

Spherical coordinates (r, 9, <p), where 
X — rsin0cos<p, y = rsin0sin(p, z = rcos0. 

Ellipsoidal coordinates (2, g, v), where 
x" = (2 + a^) ifi + a^){v + a^)/{a^ - b^) (a^ - c"), 
y^ = (2 + b^){g + b^) (v -t- b^)/{b^ — c^){b^ — a^), 
z^ = (2 -i- c^){g + c^)(v -I- c^)/(c^ — a^) (c^ — b^), 
a>b>c>Q, A> —c^> fi> —b^>v> —a^. 

These coordinate systems are all systems of 
orthogonal curvilinear coordinates. Suppose 
that we are given two rectangular coordinate 
systems (c^, t], Q and (x, y, z) sharing the same 
origin. The correlation of the two is given by 
Euler’s angles (0, (p, t/^), where 0, cp, and if/ are 
the angles between the z-axis and C-axis, zx- 
plane and z(-plane, and C<^-plane and C^-plane, 
respectively. The Euler angles 0, q>, and ijj are 
subject to the inequalities 0 ^ 0 ^ tt and O^cp, 
ij/ < 2n. They are often utilized in the dynamics 
of rigid bodies. 

(2) Multipolar Coordinates. Let P^, P 2 , ... ,P„ 
be m points in general position in an n- 
dimensional Euclidean space £", If we 
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denote by pi { ^ 0) the distance between a point 
X of E" and Pf, then {pi,p 2 , ■■■ Pm) can be re- 
garded as coordinates of a point X contained 
in a suitable domain of They are called 
multipolar coordinates. In particular, if m = 2 
they are called bipolar coordinates, and if m = 3, 
tripolar coordinates. When m>n, these coor- 
dinates satisfy m — n relations. Next let otj, 
a2, . . . , a„, be m hyperplanes in general position 
in £", m^n. For an arbitrary point X of we 
denote by the directed distance of X from 
each hyperplane a,-. The m-tuple ((^1,(^3, 
provides coordinates of X that are called 
multiplanar coordinates in £". When m>n, 
these coordinates satisfy m — n relations. In 
particular, when n = 2 and m = they are 
called trilinear coordinates. In this case, if we 
denote by 5 the area of the triangle defined by 
three lines aj, “3> by a^,a 2 , the 
lengths of the three sides of the triangle, then 
the trilinear coordinates (^1,^2’ ^3) satisfy a 
linear relation + aj <^2 + ^3 <^3 ^ 2S. 

(3) Tangential Polar Coordinates. In a Eu- 
clidean plane £^, we take a directed line Iq 
passing through a point O. For an arbitrary 
directed line g, let p be the directed distance 
between 0 and g, and let 0 be the angle be- 
tween Iq and g. Then {p, 0) are called tangential 
polar coordinates (Fig. 1). They are useful for 
representing tangent lines to curves in £^. Let 
C be an toval in £^. A line is called a ^^support- 
ing line of C if its intersection with C consists 
of a point or a line segment. In this case we 
take the origin O inside C and consider the 
coordinates {p, 11) of the supporting lines of C. 
Then the equation of C can be represented as 
p = p(6), where p(0) is a periodic function of 
period 2 tj:. The coordinates (p, 6) are especially 
useful when the function p(0) can be expanded 
in a tFourier series. In the case of Euclidean 
space £^, the notion of tangential polar co- 
ordinate system can also be defined by using 
tangent planes. 




Tangential polar coordinates. 

(4) Normal Coordinates. Let M be an n- 
dimensional tRiemannian manifold, and let 
T^(M) be the tangent space to M at a point A. 
For each tangent vector ve T^(M), we draw a 
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^geodesic through A with the initial direction v 
and take a point P on the geodesic such that 
the distance from A to F is equal to the length 
of V. Then the correspondence that sends v to 
£ is a tdiffeomorphism of a neighborhood of 
the zero vector 0 of T^{M) with a neighbor- 
hood of A in M. Therefore the components 
...,ti") of V with respect to a basis of 
T^(M) give the coordinates of the points P 
contained in a suitable neighborhood of A. We 
call them normal coordinates about the point 
A of M. In these coordinates, each geodesic 
passing through A is given by equations v‘ 

— Q(.‘r {i — 1,2, where the (a‘) are compo- 
nents of the unit vector in the direction of v 
and r is the parameter that represents the arc 
length from A to the point \ . . . , v"). In partic- 
ular, w hen n = 2, we fix a tangent vector Vq at 
A and denote the angle between v and \q by 0. 
Then (r, 0) are coordinates of P called geodesic 
polar coordinates. The notion of normal co- 
ordinates can also be defined for tLie groups 
or differentiable manifolds with taffine 
connections. 

D.' Local Coordinates 

Suppose that we have a space M that has a 
covering by a family of open neighborhoods 
with coordinate systems. If, for each pair of 
neighborhoods with nonempty intersection, 
the coordinate transformation in the intersec- 
tion satisfies certain specified conditions, then 
a mathematical structure on M can be defined. 

Let £ be a ttopological space. Suppose that 
<I> is a family of open sets in £ such that the 
union of any number of open sets in and the 
intersection of any finite number of open sets 
in (D also belong to d). A set F of thomeomor- 
phisms is called a pseudogroup of transforma- 
tions on £ if T satisfies the following three con- 
ditions; (i) Any homeomorphism /e F is de- 
fined on an open set C/e®, and /(C/)e®. (ii) 
When an open set C/eO is expressed as the 
union of a family (C/J of open sets C/, e®, a 
homeomorphism / defined on U belongs to F 
if and only if its restriction to each C/; belongs 
to F. (iii) For any open set C/g®, the identity 
mapping on U belongs to F, and if f,geT, 
then the inverse and the composition gof, 
if it exists, belong to F. 

Let £ and M be topological spaces. A 
homeomorphism (p : C/ K of an open set U in 
£ to an open set F in M is called a local co- 
ordinate system of M with respect to £. For 
two local coordinate systems and 

(p2 : C/2-^ F2, the homeomorphism 

(p2^ o : (pf ‘ n V2)~^(p2HV\ n F 2 ) 

is called a transformation of local coordinates. 

Let r be a pseudogroup of transformations 



90 Ref. 
Coordinates 



338 



on E. A set 27 of local coordinate systems of M 
with respect to E is said to define a F -structure 
on M if 27 satisfies the following two con- 
ditions: (1) the totality of the images of local 
coordinate systems belonging to 27 covers M; 

(2) if two local coordinate systems q>i and ^2 of 
27 have a transformation of local coordinates, 
it belongs to F. Now suppose that two sets 27 
and 27' of local coordinate systems define F - 
structures on M. If the union of 27 and 27' 
defines a F-structure, then we say that the first 
two F-structures are equivalent. Let F be a 
pseudogroup of Miffeomorphisms, each de- 
fined from an open subset of the n-dimensional 
space R" onto another open set. If a F- 
structure is defined on a space M, then M is 
an n-dimensional tdifferentiable manifold. 

On the other hand, let F be a pseudogroup 
of complex analytic homeomorphisms in an n- 
dimensional complex number space C”. If a F- 
structure is defined on a space M, then M is an 
^-dimensional tcomplex analytic manifold. 
Locally homogeneous spaces, tfoliated mani- 
folds, the tfiber bundles are all equipped with 
local coordinate systems with suitable F - 
structures. 
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Covering Spaces 

A. General Remarks 

A continuous mapping p: Y-^Y of an tarcwise 
connected topological space Y onto a con- 



nected topological space Y is called a covering 
mapping (covering map) if the following con- 
dition (C) is satisfied: (C) Each point of Y has 
an open neighborhood V such that every 
^connected component of p"^(F) is mapped 
homeomorphically onto V by p. Here we note 
that Y is in fact arcwise connected. 

If there is a covering mapping p:Y^Y,v/c 
call y a covering space of Y and ( Y, p, 7) a 
covering. In particular, for a tdifferentiable 
manifold Y, if Y is also a differentiable mani- 
fold and p is differentiable, then f is called a 
covering (differentiable) manifold of Y. (In the 
theory of tRiemann surfaces, a tcovering sur- 
face may have some tbranch points violating 
condition (C). Upon removing such points, we 
obtain a covering space as defined above.) 

For each tpath w.I~*Y {1 = 10, 1]) of Y, a 
path vv : /-> y with pow = w is uniquely deter- 
mined by the point w(l)ep“^(w(l)), and a 
bijection :p“’(w(l))-»p^^(w(0)) is deter- 
mined by w^(w(l)) = vv(0). Thus there exists a 
one-to-one correspondence between 
and p Hy) for every pair of points y, y' of Y, 
and (y, p, y,p"^(yo)) is a tlocally trivial fiber 
space with discrete fiber p^Hlo)- When the 
cardinal number of p^^(y) is a finite number n, 
we call ( y, p, y) an n-fold covering. In this case, 
for a fioop w(/,/)^(y,yo) with base point yo? 

is a permutation of the n 
elements in p“'(yo), and we obtain a homo- 
morphism of the fundamental group 7ti(y) = 
7ri(y,yQ) of y into the tsymmetric group S„, 
given by the correspondence w-»w^. The 
permutation group 93i, which is the image of 
this homomorphism, is called the monodromy 
group of the «-fold covering. 

Two coverings (y,P;, T) (i= 1,2) are said to 
be equivalent if there is a homeomorphism 

: Ti -> 72 with P 2 o ip = p, ; such a (p is called 
an equivalence. In particular, a self-equivalence 
<p : 7-^ y of a covering {Y,p,Y) is called a cover- 
ing transformation. The set n of all covering 
transformations forms a group by the compo- 
sition of mappings, which is called the covering 
transformation group of 7. We call (7, p, 7) 
a regular covering if for each ye 7 and y^, 
y 2 ^P~\y\ there exists a unique covering 
transformation that maps yj to y 2 - Iri this 
case, the torbit space Y/n is homeomorphic 
to y, (7, p, 7, n) is a tprincipal bundle, and 
the monodromy group 931 is isomorphic 
to n. 

For a covering (7, p, 7), we call 7 a covering 
group of 7 if 7 and 7 are topological groups 
and p is a homomorphism. Then (7, p, 7) is a 
regular covering, and its covering transfor- 
mation group is isomorphic to p~^(e) {e is the 
identity element of 7), which is a discrete 
subgroup lying in the center of 7 (^ 423 Topo- 
logical Groups O). 
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B. Universal Covering Spaces 

For a covering {Y,p, Y), we have the following 
relations of the thomotopy groups: p^:ni(Y)-* 
ni(Y) is isomorphic (monomorphic) for i^2 
(i = 1), and ^i(Y)/p^(ni(Y)) is in one-to-one 
correspondence with p~'(yo) (yo^ 5^)- ff ^ 
is tsimply connected, Y is called a universal 
covering space of 7; if in addition 7 is a cover- 
ing group, 7 is called a universal covering 
group of 7. 

For a tlocally arcwise connected space 7, 
a covering ( 7, p, 7) is regular if and only if 
P*(^i(^)) is a normal subgroup of n^(Y), and a 
universal covering space of 7 is a covering 
space of any covering space of 7. Moreover, if 
7 is a topological group, any covering space 7 
of 7 can be given a unique topological group 
structure with which 7 is a covering group of 
7. 

Let 7 be an arcwise connected, locally arc- 
wise connected, and tlocally simply connected 
space. Then the following classification theo- 
rem of coverings holds: The set of equivalence 
classes of coverings of 7 is in one-to-one cor- 
respondence with the set of conjugate classes 
of subgroups of the fundamental group tii( 7); 
in particular, the equivalence class of a cover- 
ing (7,p, 7) corresponds to the conjugate 
class of the subgroup p,^(7ii(7)). Also, there is a 
unique universal covering space 7 of 7 up to 
homeomorphism. If in addition 7 is a topolog- 
ical group, then 7 is a unique universal cover- 
ing group of 7 up to isomorphism of topolog- 
ical groups. Such a space 7 is obtained as 
follows: Consider the tpath space Q(7;_Vo> 
of all paths in 7 starting from a fixed point 
y^e 7, and define two paths Wq, :(/,0)-> 
(7,yo) such that Wo{l) = w,(l) to be equiva- 
lent if and only if there is a thomotopy w,: 

(/, 0) —( 7, yo) with wd 1) - Wo( 1 ) (0 < t < 1 ). 

Then we obtain the tidentification space 7 of 
0(7; yo! L) by this equivalence relation and 
the mapping p: 7->7by p{w} = w(l); this 7 is 
the universal eovering spaee of 7. 

Let ( 7, p, 7) be a regular covering with the 
covering transformation group n. Then there is 
a tlocally trivial fiber space {Y',q, B, 7) such 
that the total space 7' has the same t(co) ho- 
mology groups as 7, and the base space B is 
an tEilenberg-MacLane space K{n, 1). The (co) 
homology tspectral sequence of this fiber space 
is called that of the given regular covering 
(7,p, 7), is a bigraded module tassociated 
with a certain filtration of the tsingular (co) 
homology module H{Y), and £3 is the (co) 
homology module H{n; H{Y}) of the group n, 
where n operates on the coefficient module 
H{Y) via the induced homomorphisms of 
covering transformations (^ 148 Fiber 
Spaces). 
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For any group n, there is a regular covering 
(£„, p, BJ with the covering transformation 
group n such that E„ is tcontractible; B^ is an 
Eilenberg-MacLane space K{n, 1). We can take 
(1 -sphere) as B^, of the infinite cyclic group 
Z, and the following infinite lens space as B^^ 
of the finite cyclic group (-^ 70 Complexes). 

C. Lens Spaces 

Let /c be a positive integer and /i ,...,/„ be 
integers prime to k. Let 

l^oP + ••• + k«l^ = 1} be the unit sphere 
in the (u-t- l)-dimensional complex linear space 
and define the rotation y by y(zo,Zi, 

. . . , z„) = (zq exp 2ni/k, Zj exp 2nlii/k , . . . , 
z„exp2nlj/k). Then the torbit space 
= L{k; /i , . . . , /„), where Z^ = Z/kZ is interpreted 
as the cyclic group generated by y, is called 
a lens space. It is art orientable {2n -h 1)- 
dimensional tdifferentiable manifold. Also, the 
infinite lens space L®(k) = L(/c; 1, ..., 1,...) 
is defined by taking n=co; the infinite sphere 
S“ is a /c-fold covering space of L^{k), and 
L"^{k) = B 2 ^^ — K{Z^, 1). Its tcohomology ring 
is given as follows. 

(1) For integral coefficients, ff^‘‘^^(L‘®(/c)) = 

0, H^‘{L'^{k)) — Z^^ (i>0), and the tcup prod- 
uct of generators of degree 2i and 2j is a gen- 
erator of degree 2{i+j). 

(2) Let k = pk' {p is a prime). If p 7^ 2, or p ^ 2 
and k' is even, //*(L®(fc); Z^ — /\{eYl®Zp[e{\. 
If p — 2 and k' is odd, H*(L®(/c);Z2) — Z2[Ci] 

(e, is an element of degree /). Here /\ indicates 
the ^exterior algebra over Zp, and Zp[ ] the 
tpolynomial ring over Zp. 

Two lens spaces L{k; /j , . . . , /„) and L(k'; /', , 
are of the same homotopy type if and 
only if k = k' and there is an integer m prime 
to k with 

/j .../„= ...l'„{modk) 

[8]. Furthermore, the condition k — k',l = 

± /'-^(modk) holds if and only if L{k; 1) and 
L(k'; I') are homeomorphic. (Sufficiency is 
shown in [1]; necessity follows from the fact 
that the tHauptvermutung is valid for com- 
binatorial 3-manifolds and that the condition 
holds if the polyhedra L(k; /) and L(k; 1') have 
isomorphic subdivisions [6].) Also [7, 1 1] 
and 65 Combinatorial Manifolds. 
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A. Space Groups and Point Groups 

Let G be a discrete subgroup of the group of 
^motions © in the real n-dimensional Eu- 
clidean space V. If G contains n linearly inde- 
pendent translations, then G is called an n- 
dimensional crystallographic space group, a 
space group, or a crystallographic group. The 
following assertions are known to be equiva- 
lent to each other: (i) A discrete subgroup G of 
6 is a space group; (ii) ©/G is compact; (iii) 
V/G is compact; (iv) There is a compact subset 
F of K such that F=GP-Ug^G0^[l]-The 
subgroup T consisting of all ^translations in G 
is called the lattice group of G. T is a normal 
subgroup of G and is generated by n linearly 
independent translations, say, G,t2, 

Take a point x of V. Then the T-orbit of the 
point X is called a lattice of G (or T). t j , . . . , 
can be identified with a basis of V. Let K be 
the quotient group G/T and k — gT an element 
of K. Then k gives rise to a linear transforma- 
tion k on T via the formula k{t) = gtg~^ (the 
product in G), te T, and K can be regarded 
as a subgroup of the torthogonal group of 
V via the above identification of T with a 
lattice in V. That is, the set of orthogonal 
transformations stabilizing x, {0|(0(y) = x-l- 
9(y) — 9ix)>y^ k'), 9 ^G} forms a group isomor- 
phic to K. The quotient group K is a finite 
group and is called the point group of G. A 
group isomorphic to a point group of an n- 
dimensional space group is sometimes called 



an n-dimensional crystallographic group. An 
algebraic characterization of space groups 
among abstract groups is given by “A group G 
is isomorphic to an n-dimensional space group 
o G has a normal Tree Abelian subgroup of 
trank n, which is maximal Abelian and has 
finite index” [1]. 

By representing the action of X on T in 
terms of a basis of T, K can be regarded as a 
subgroup of GL(n, Z). Hence we obtain the 
following crystallographic restriction; If K 
contains an element of order m, then n ^ (p{m). 
Here cp is the tEuler function. For example, 
if n = 2, 3, then m = 1, 2, 3, 4 or 6; and if n = 

4, 5, then m — 1, 2, 3, 4, 5, 6, 8, 10, or 12. The 
number of nonconjugate finite subgroups of 
GL(n, Z) is finite (Jordan-Zassenhaus theorem; 
- [2,3,21]). 

Two space groups Gj and G2 are called 
equivalent if they are conjugate via an taffine 
transformation / of V, i.e., G2 = fG^f^^. Let 7^ 
and K, be the lattice group and the point 
group of Gj (i= 1, 2), respectively. Then T 2 = 
hTj^ and K 2 = hK^h~^ for the linear trans- 
formation h induced by / Moreover, there is 
an orthogonal transformation k such that 
X2 = this k can naturally be consid- 

ered as an element of GL{n, Z). Gj and G2 are 
equivalent <=> Gj and G2 are isomorphic (as 
abstract groups) [1]. In applications it is often 
required that in the definition of equivalence / 
be an torientation-preserving affine transforma- 
tion. Under this definition, there can be a pair 
Gj , G2 such that G^ and G2 are mutually iso- 
morphic but not equivalent, since they are 
conjugate only by means of an torientation- 
reversing affine transformation. In this case, 
they are called enantiomorphous to each other, 
or the pair is called an enantiomorphic pair 
(Table 1 below). For a given dimension n, 
there is only a finite number of equivalence 
classes of space groups [1]. We also conclude 
from this that point groups have finitely many 
equivalence classes up to conjugacy in the 
orthogonal group 0(V) or in GL{n, Z). 

B. Crystal Classes, Bravais Types 

Let T be an n-dimensional lattice in n- 
dimensional Euclidean space V, and K a finite 
subgroup of the orthogonal group 0{V). De- 
note by a pair (T, X) a tfaithful linear Tepre- 
sentation of X on T, i.e., a ^^monomorphism 
of groups X^Aut(T). A space group G deter- 
mines a pair (T, G/T). Conversely, for any pair 
(T, X) the following holds: “Any group texten- 
sion of X over the tkernel T is isomorphic to a 
space group” (^ the algebraic characteriza- 
tions in Sections A and C). Thus to each pair 
(T, X) there corresponds a set of space groups. 
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Two pairs (Tj, Kj) and (Tj, K2) are called 
arithmetically equivalent (this is denoted by 
(Ti,Xi)%(T2,K2))if there exists an invertible 
linear transformation geGL{V) such that T2 
= gTi, K2 = gKyg~^. These pairs are called 
geometrically equivalent or simply equivalent 
(this is denoted by (T(, X J~(7’2,K2)) if there 
exists a geGL{V) such that K2 — gKig~^. The 
relation s^(~) is an equivalence relation, and 
equivalence classes are called arithmetic (geo- 
metric) crystal classes. The set of geometric 
crystal classes is in one-to-one correspondence 
with the set of equivalence classes of point 
groups up to conjugacy, and is sometimes 
called the set of crystal classes. The geometric 
crystal class of (T,X) is usually denoted by K. 

Now let T be a lattice. Then the group of all 
orthogonal transformations that leave T in- 
variant is called the Bravais group of T and 
is denoted by B{T), i.e., B{T) = {geO{V)\gT= 
T}. The group B{T) is finite and determines 
a pair (T, B(T)). Two lattices Tj and T2 are 
called arithmetically equivalent (this is denoted 
by Ti % T2) if (Tj, B(Ti)) and (T2,B(T2)) are 
arithmetically equivalent. An equivalence class 
is called a Bravais type, and an arithmetic 
crystal class determined by (T, B{T)) is called a 
Bravais class. For each Bravais type, a repre- 
sentative is said to be its Bravais lattice (Fig. 3 
below). 

Define an torder relation on the set of lat- 
tices belonging to an arithmetic crystal class as 
follows. When (Ti,Ki)«i ( 72,^2), define T2^Ti 
if there exists a geGL{V) such that T2^gT^, 
K2=gKig \ and B{T2)^gB{T^}g~^. Note that 
T^^T2<>Ti^ T2 and T2 ^ T, . In each arith- 
metic crystal class, there is a pair (T, K) with a 
tminimal T with respect to this order relation. 
In this case the Bravais type of T is referred to 
as the Bravais type of the class of (T, K). This 
lattice T is, intuitively, of the most general type 
appearing in the class. Now define a relation 
Tj T2 if there are pairs (7^, K J, i = \,2, such 

that (T, ,Xi)~(T 2,X2) and 7^ is minimal in the 
class of (7],X,). The equivalence class of the 
equivalence relation generated by the relation 
~y is called a crystal family. 

Summarizing, we have defined the following. 
Let y', be the set of equivalence 

classes of space groups, arithmetic crystal 
classes, geometric crystal classes, Bravais 
types, and crystal families, respectively. Then 
we have the relations shown in Fig. 1. (All 
arrows in Fig. 1 are surjective.) is nothing 
but the tcoproduct of ^ and ^ over j?/, 






I i 






Jfc: 
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Fig. 1 



Fig. 2 



Moreover, there are injective mappings 
and such that the composites 

^ and are identities. The first is 

the mapping sending a Bravais type T to the 
arithmetie erystal class defined by (T,B(T)). 
The second one sends a class of (T, K) to 
a tsemidirect product of T and X (^ 190 
Groups N). A space group belonging to a class 
of the image of .s/ c+y is said to be symmorphic 
or symmorphous. A class which belongs to the 
image Jf” of the composite is called 

a holohedral or holosymmetric class, or a holo- 
hedry. Define a mapping M of ^ to the power 
set ^(^) by M = (J^<-ja/)-(=G/^^)"f For two 
elements and C2 of define Cj if 
M(Ci) = M(C2). Each equivalence class is 
called a crystal system, and the set of crystal 
systems is denoted by *^y. Two elements C 
and C' of ^ belong to the same crystal family 
if and only if there exists a sequence of ele- 
ments of Cl , C2 , . . . , Q such that Ci = C 
and Ck = C and M(C;)nM(C,+i)y 0 for 
1, . .. , /c — 1. Therefore there is a surjective map- 
ping (Fig. 2). When n^4, the com- 

posite Jf c^^-><g'y is bijective. It should be 
remarked that when n ^ 3, there is no map- 
ping ^-»^y such that and 

^^^y coincide. 

The numbers of elements of these sets are 
shown in Table 1. 



Table 1 





n = 1 


2 


3 


4 


crystal 

families 


1 


4 


6 


23 


crystal 

systems 


1 


4 


7 


33 


M, Bravais types 
point groups 


1 


5 


14 


64 


(isomorphic 

classes)'’ 


2(2) 


10(9) 


32(18) 


227(118) 


arithmetic 

crystal 

classes 


2 


13 


73 


710 


<9, space groups 


2 


17 


219" 


4783“ 



“ When n < 4, the number of elements of coincides 

with that of '^.9’. 

‘’The number in parentheses denotes the number of equiva- 
lence classes under algebraic isomorphism. 

‘’•‘‘The number of equivalence classes under orientation- 
preserving transformations is 230 and 4895, respectively. 



C. Construction of Space Groups 

Take an element (T, X) of an arithmetic crystal 
class. Since X is finite and T is tfmitely gen- 
erated, the ■^cohomology groups H'{K, T) are 
finite groups (^ 200 Homological Algebra G). 
Let a be a "fsecond cocycle representing an 
element [a] of H^{K, T). Let (T,K)^ be the 
textension of X with kernel T correspond- 
ing to [a] (^190 Groups N). Then the set 
of elements of (T, X)^ is given by {<t, /c> | 
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teT,keK}, and the product is defined by 

Since H‘{K, K) = 0, 1, there is a tfirst co- 
chain V) such that a{fei,/c 2 ) = /8(/ci)-|- 

and the tfirst cocycle ^ on 
V/T defined by j5 induces an isomorphism 
HHK, V/T)^H^{K, T), [^]H->[a], Define an 
action of <t, /c> on K by 

{t,k}{x) = kx + t + (i{k), xeV. 

Then {T,K)^ acts on K as a space group. Con- 
cerning their equivalence, this result holds: 
“The equivalence classes of {T, K)„, [a] e 
H^(K, T) are in one-to-one correspondence 
with the orbits of N{K), the normalizer of K 
in Aut(T) acting on H^(K, T)” [4], The action 
of heN{K) is given by ''oL{k) = ha{h~^ kh), ae 
C^(X, T). 

For a pair {T, K) the weight lattice or the 
weight group T* of (T, K) is defined by {ve V\ 
v — kv£T,ykeK). Let R be an irreducible re- 
duced +root system, Q{R) its lattice, and 1T(/?) 
its tWeyl group. Then for a pair (6(/?), fF(f?)), 
Q(R)* is nothing but the usual weight lattice 
P(R). 248 Lie Algebras). Suppose that K of 

{T, K) does not contain the central inversion 
— I„:vh^—v,veV. Denote the group K U 
{ — I„)K by +K. Then there is an isomorphism 

H^i±K, V/T)^IHHK, F/T)]2 

x(T*IT)/2(T*/T). 

Here [ ]2 denotes the subgroup of elements 
of order not greater than 2 [5]. This isomor- 
phism provides powerful means for the con- 
struction of space groups. 

In constructing a pair (T, K), it is useful to 
take a basis of T in a special form. Let 
. . . , t„} be a basis of a lattice T, and set aij = 

(t;, tj), where (,) is the inner product in V. The 
nxn matrix A =(ay) tsymmetric and tposi- 
tive definite, and the numbers are called 
the lattice constants of T. A basis ... , t„} is 
called a reduced basis if the tquadratic form 
{Ax,x) = 'LaijXiXj is reduced 348 Qua- 

dratic Forms). A lattice admits at least one re- 
duced basis, and in this case ± 2a;^ ^ ^ ajp 

i<j [6, Table 5.1; 7]. The totality of an n- 
dimensional reduced basis forms in a nat- 
ural manner a semialgebraic set of dimension 
(n + 2)(n — l)/2 in its closure, say Q„, 

is compact. On the other hand, the set L„ = 

(R+ X 0(R”))\GL(n,R)/GL(n,Z) is identi- 
fied with the set of equivalence classes of n- 
dimensional lattices under the relation of 
“same shape.” There is a natural inclusion 
L j U . . . U L„ c+g„, and L„ is dense in Q„. More- 
over, the inclusion gives a bijective homeo- 
morphism when n<4 [8]. 

A finite subgroup K of the orthogonal 
group 0(F) is said to be fully transitive if there 



is a set S = ...,gj} that spans F on which K 

acts ttransitively and if K has no tinvariant 
subspace in F. In this case one can choose S as 
either of the following: (i) the primitive hyper- 
cubic type, 5 = {ci , . . . , e„}, (c;, e^) = (tKro- 
necker delta); (ii) the primitive hyperbolic type, 

s={/, 

(ii. /j) = - 1 /«. i. y = 1 . . . . , n + 1 , i (especially, 
£?;ii; = 0)[IO]. 



D. Color Symmetry Groups, Twinning 

A color symmetry group, or a colored symme- 
try group, associated with a space group G is a 
pair (G, G') such that G' is a subgroup of G 
with finite index. The index r^[G: G'] is called 
the number of colors. We call (G, G') a white 
group when r= 1, a black and white group or a 
magnetic group when r~2, and a polychroma- 
tic group when r ^ 3. G' is also a space group 
with the lattice group T'~GT\ T, where T is 
the lattice group of G. Two color symmetry 
groups (Gi, G'l) and (G 2 , G 2 ) are called equiva- 
lent if there exists an +affine transformation / 
such that G2 -/Gi/" 1 and G^=/G'i/"L Then 
(G, , G[) and (G 2 , G 2 ) are equivalent if and only 
if there is an isomorphism a:Gi->G2 such that 
<t(G)) — G 2 . A color symmetry group (G, G') is 
called lattice equivalent if T' =T, K' a K, and is 
called class equivalent if T cT, K' — K. Take a 
pair (G, G'), and let G" be the ^inverse image of 
K' under the canonical tepimorphism G^K. 
Then (G, G") is lattice equivalent and (G", G') is 
class equivalent. Fix the color number r. Then 
(i) the equivalence classes of lattice equivalent 
color symmetry groups are in one-to-one 
correspondence with the conjugacy classes of 
subgroups of index r, and (ii) the number of 
equivalence classes of class equivalent color 
symmetry groups is finite. When r is prime to 
the order of K, the classes are in one-to-one 
correspondence with K-invariant sublattices 
Tj of index r in T. Therefore, when the number 
of colors is given, the set of equivalence classes 
of n-dimensional color symmetry groups is 
finite. 

In case (i), a pair (K, Xj) is called a color 
point group, and in case (ii), (T, T,) is called a 
color lattice. In particular, when r = 2, they are 
called a black and white point group and a 
black and white lattice, respectively. Their 
equivalence is defined in a similar way. 

Take a color lattice (T, T^) of color number 
r. Let m be the least natural number such that 
mTczT^, then m is a divisor of r. Let m = 

Pi‘ • • • PkL Pi ^ Pj j) be the decomposition 

of m into its prime factors. Then there is a 
sequence of K-invariant sublattices of T, 7), 
i = 2, . . . , k — 1, such that (i) T^ciTj... <= c 
Tk+i = T; (ii) pf' T;+i c= 7), 1 - 1, . . . , k; (iii) the 
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submodules of i = l, Z"® 

ZIppZ are uniquely determined independently 
of the choice of such a sequence {7^}. Thus 
the existence of color lattices is related to the 
tmodular representation of K (-^ 362 Repre- 
sentations G). For example, if r is a prime 
number, then the representation of K on T/rT 
~ Z" ® Z/rZ is the reduction modulo r of 
that of K on T. 

A mathematical treatment of twinning is 
given as follows. Let V be an n-dimensional 
Euelidean space and T an n-dimensional lat- 
tice in V. The volume of T, vol(T), is defined 
by vol(T) = (|det/l|)’/^, where /l=(ay) is the 
matrix of lattice constants. This definition is 
independent of the choice of a basis of T. If 
T' is a sublattiee of index r in T, vol(T') — r • 
vol(T) follows. Let X be a finite subgroup 
of 0(K), Tj and T 2 K -invariant lattices, and 
U a hyperplane in V. Then the quadruple 
{Ti,T 2 ,K,U) is called a twinning structure if 
it satisfies the following: (i) vol(Ti) = vol(T2); 

(ii) the lattice 7^ is minimal in the arithmetic 
crystal class of (7], K), / = 1, 2; and (iii) T^r\U = 
T2 D 1/ = Tq, and Tq is an (n — l)-dimensional 
lattice in U. Two arithmetic crystal classes are 
said to be twinable if there exist representa- 
tives (Ti,7C), {T 2 ,K), and a hyperplane U such 
that (Ti, Tj, K, U) is a twinning strueture. Let 
(Tj , K) and (T2, X) be two arithmetic crystal 
classes. If there exists a X-invariant hyper- 
surface U such that Tq = Tj fU/ ^ T2 fl L is 
an {n— l)-dimensional lattice in U, then two 
classes (Tj , X) and (T2, X) are twinable. 



E. Three- and Two-Dimensional 
Crystallography 

Three-dimensional crystal classes are listed in 
Appendix B, Table 5. IV. To name these crystal 
classes, both Schoenflies’ notation and the 
international notation are used. 

Fix an torthogonal tright-handed system 
{x,y,z) in V. For each crystal system, define 
three axes as follows. The first axis is the z- 
axis. The second is the line x — y = z for the 
cubic system and the x-axis for the others. The 
third is the y-axis for orthorhombic, trigonal, 
and hexagonal systems, and the line x = y, z = 0 
for the others. Then a crystal class is expressed 
by its generators with respect to these axes. 

The symbol k{k^\, 2,3,4, 6) denotes the rota- 
tion by the angle 2n/k around the axis; /c is a 
composition of the rotation k and the central 
inversion —v,ve V), and k/m (k — 2,4, 6) 
means the composition of k and the reflection 
about the hyperplane perpendicular to the 
above axis. The symbol m denotes 2. Usually, 

1 is omitted. This is the full international nota- 
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tion and a set of generators can be read from 
it. The short international notation is also used. 

Schoenflies’ notation consists of the letters 
C„, D„, S„, T, O, and the subscripts h, v, d. The 
letters denote the group and order: C„ is the 
^cyclic group of order n; D„ is the dihedral 
group of order 2n; T is the ^tetrahedral group 
isomorphic to the fourth ^alternating group; O 
is the ■''octahedral group isomorphic to the 
fourth tsymmetric group; and 82 = !, 84 = 4, 8^ 
= 3. The subscripts h, v, d mean that the group 
is generated by reflections in the hyperplane 
which is horizontal, vertical, or perpendicular 
to a diagonal, respectively. The orientation is 
appropriately given. For example, in C„h and 
there is an nth order rotation around the 
z-axis and the horizontal plane is the xy-plane. 

The above notations are also applied to 
noncrystallographic finite subgroups of the 3- 
dimensional orthogonal group [13]. 

The symbol for Bravais type is specified by 
the name of its crystal system and one of the 
letters P, A, B, C, F, I. 8ix kinds of simple 
Bravais lattices are given as follows. A simple 
lattice is generated by (a, h,c} and satisfies the 
following condition. Denote the lengths of 
vectors by a= ||a||, b— ||h||, c= ||c||, and let a = 
L (b, c), fi— /_ (c, a), 7 = (a, b) be the angles 

between the indicated pairs of vectors. For 
triclinic P,a^b^c^a, for mono- 

clinic F, a^b^c^a, x = y = 90° jS; for or- 
thorhombic P, a^b^c^a, P — y — 90°; 
for tetragonal P,a = b^c, a = (^ = y = 90°; for 
hexagonal P or trigonal P,a — b^c,x = [i — 90°, 
y — 1 20°; for cubic P, n — i> — c, a — = y = 90°. 
Let T be one of the above lattices generated by 
{a,b,c}. Then the lattices generated by {a, b, 

(b + c)/2}, {(c-l-a)/2,b,c}, {a,(a + b)/2,c} {(a-i- 
b + c)/2, b, c}, {(a -h b)/2, (b + c)/2, (c + a)/2} are 
called A, B, C, I, F lattices determined by T. 
The original lattice T is called the primitive 
lattice. The trigonal R lattice is generated by 
{a', b', c'} satisfying a' ~b' = d , x' = (i' = y' < 

120°, ^90°. 8et a = a' — b', b = b' — c', c==a' + 
b'-l-c'. Then the hexagonal P lattice gener- 
ated by {a, b,c} is defined to be the primitive 
lattice of the trigonal R lattice. The fourteen 
Bravais lattices are illustrated in Fig. 3. 

An arithmetic crystal class (T, X) is denoted 
by the symbol of the Bravais lattice T and the 
symbol of X with respect to the action of X on 
T. Let the primitive lattice of T be generated 
by (a, b,c}. Put the vector a on the x-axis, and 
b on the xy-plane. The action of X is then 
determined by the international notation for 
X, except when K = 42m, 32, 3m, 3m, 6m2, 
and T is the primitive or I lattice. In these 
cases two kinds of actions interchanging the 
second and the third symbols are not equiva- 
lent. If such is the case, 1 is inserted into the 
third position for trigonal systems. Let (X) be 
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the symbol determined as above and (T) be 
the symbol of T. Then the action (T, K) is 
expressed by ((T), {K)). For example, when K 
— 32, there are three arithmetic crystal classes 
(F,321), (F,312) (R,32). 





Tetragonal Trigonal Hexagonal, 
Trigonal 




P I F 



Cubic 



Fig. 3 

The conventional unit cell for each of the 14 Bravais 
lattices. 



The notation of space groups determined by 
(T, K) is given by writing both the symbol (T), 
and the symbols of actions of elements on V 
which appear in the symbol (X) defined above. 
The symbol (T)(X) denotes the +semidirect 
product of T and K by the action (T, K), and 
is the symmorphic space group determined 
by (T, K). Other nonsymmorphic groups are 
designated by replacing symbols in (X). If k is 
replaced by ( j = 1, . . . , ^ — 1), then kj is a k- 
fold screw glide with pitch j/k, that is, the com- 
posite of the rotation k and the translation 
jf/k, where f is the vector of minimum length 
of T along the axis of k. If m is replaced by 
a, b, c, n, d, then it is a glide reflection, that 
is, the composite of the reflection m and the 
translation in the reflecting hyperplane with 
the direction of a, b, c, a face diagonal and 
a diamond, respectively. See [6] for the pre- 
cise meaning of the notation discussed in this 
paragraph. 

The notations for 2-dimensional point 
groups and space groups can be adopted from 
the ones for 3-dimensional ones by setting the 
reference plane as the xy-plane and the z-axis 
perpendicular to that plane. In this setting, the 
notation of a plane point group is given by the 



corresponding point group in 3-space, and 
similarly for plane space groups. There is only 
one glide operation, denoted by g. The motion 
g is the composite of the line reflection about 
an axis and half the translation along the axis 
with the vector of minimum length of T. In 
[13], the precise group-theoretic description of 
space groups can be found. It should be noted 
that the notations for the two groups p3m\ 
and p3\m are frequently interchanged in the 
literature (including earlier editions of [13]). 



F. History 

The first mathematical study of the structure 
of crystals was done by the mineralogist J. F. 

C. Hessel (1830). He enumerated the finite 
subgroups of the 3-dimensional orthogonal 
group. Afterward, his result was rediscovered 
many times. A. Bravais (1850) showed that in 
3-dimensional Euclidean space there are only 
14 different lattices. All space groups consist- 
ing of orientation-preserving transformations 
were determined by C. Jordan (1868, 1869) and 
L. Sohnke (1879). Finally, almost all the space 
groups were determined independently by E. 

S. von Fedorov (1885, 1889) and A. Schoenflies 
(1887, 1889). Probably as a result of com- 
parison of each other’s lists, they established in 
1891 the existence of the 230 space groups. 
Then A. Barlow (1894, 1896) derived these 
space groups by adding reflection operations 
to Sohnke’s 65 groups. 

In 1900 in his eighteenth problem D. Hilbert 
raised the question of whether the number 
of equivalence classes of space groups of a 
given dimension is finite. This was answered 
affirmatively by L. Bieberbach (1910 [1]). An 
algorithm for determining space groups was 
given by H. Zassenhaus (1948 [4]). Their 
results, including [2,3], and work by C. Her- 
mann (1949 [9], 1952 [10]) gave a solid 
foundation for n-dimensional crystallography. 

Then followed a concrete treatment of 4- 
dimensional crystallography. A. C. Hurley 
(1951), using an earlier work of M. E. Goursat 
(1889) on finite subgroups of the 4-dimensional 
special orthogonal group, determined 221 of 
the 4-dimensional point groups. In the 1960s 
A. L. Mackay and G. S. Pawley (1963) and 
others gave 56 of the 4-dimensional Bravais 
types. After E. C. Dade (1965) determined 
9 maximal finite subgroups of GL(4, Z), R. 
Billow (1967) and H. Brown, J. Neubiiser, and 
H. Zassenhaus (1968) independently deter- 
mined 710 arithmetic crystal classes. They 
reconfirmed the result with H. Wondratschek, 
and as a result, 64 Bravais types and 227 
geometric crystal classes were established 
(1971 [15]). They ran Brown’s computer 











345 



program for the determination of space groups 
based on [4] and obtained 4,783 space groups. 
Their work culminated in a book (1978 [16]). 
There have been several attempts to establish 
a unified treatment of n-dimensional crystal- 
lography [5, 8, 14, 17] and crystallography 
in complex Euclidean spaces. 

Two-dimensional space groups were em- 
pirically known in ancient times, as demon- 
strated by artistic decorations. All the 17 
2-dimensional space groups appear in the 
tile patterns of the Alhambra in Granada. A 
discrete subgroup of the plane motion group 
whose translation subgroup is only rank one 
is called a frieze group. 

The notion of black and white groups was 
introduced by H. Heesch (1930), H. J. Woods 
(1935), and A. V. Shubnikov (1951). Only with 
the introduction of the use of neutron diffrac- 
tion techniques did it become apparent that 
these groups could be used in the description 
of magnetically ordered structures [12]. Color 
symmetry groups were defined by B. L. van 
der Waerden and J. J. Bruckhardt (1961 [1 1]) 
for an arbitrary number of colors. A mathe- 
matical treatment of the twinning structure 
is found in papers by T. Ito (1938) and R. 
Sadanaga (1959). 
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Curves 

A. Introduction 

In the beginning of his Elements, Euclid gave 
definitions such as: A line is a length having no 
width; an end of a line is a point. However, he 
left notions such as width and length unde- 
fined. Thus his definitions were far from satis- 
factory. Actually, it was only during the latter 
half of the 19th century that efforts were made 
to obtain exact definitions of lines and curves. 
Euclid, among others, distinguished two kinds 
of curves: straight lines and curves. Nowadays, 
however, lines in the sense of Euclid are called 
curves, and a straight line is considered a 
curve. A first effort to give an exact definition 
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of a curve using analytic methods was made 
by C. Jordan in his Cours d'analyse I ( 1893 ). 

B. Jordan Arcs and Jordan Curves 

Following Jordan, we define a continuous plane 
curve C to be the image of a tcontinuous 
mapping sending the interval [0, 1] into the 
Euclidean plane E^. Namely, C is the set of 
points (x, y) in such that 

x=f(t), y = g{t), O^f^l, 

with continuous functions /, g defined on 
[ 0 , 1 ]. A continuous curve is also called a 
continuous arc. We call (/( 0 ),gf( 0 )) and (/(I), 
0(1)) the ends of the arc. Given continuous 
functions /, g defined on (0, 1), the set {(x, y)| 

X = f{t), y = g{t), 0 < f < 1 } is called an open 
arc. More generally, the image of a continuous 
mapping of [0, 1],(0, 1), [0, 1), or (0, 1] is called 
an arc (or curve). Suppose that C is an arc that 
is the image of an interval / and P — {x,y) is- a 
point on C to which there correspond two 
elements tj (ti<tz) of / such that P is the 
image of both t,, (3. In this case, the point P is 
called a multiple point on C. An arc having no 
multiple point is called a simple arc or Jordan 
arc. 

An arc with one and only one multiple point 
F = (/(O),0(O)) = (/(1),0(1)) is called a Jordan 
curve or simple closed curve (-^ Section K). A 
Jordan curve can be regarded as a topological 
image in a plane of a circle. Let C be a curve 
that is the image (p(l) of an interval. Then C is 
said to be of class C‘‘ (analytic) if the mapping 
(p is of tclass C*" (tanalytic). In general, if S is a 
topological space, then the image (p{l) in S of 
an interval is called a curve in S. In particular, 
if S has the structure of a differentiable (ana- 
lytic) manifold, we can define the notion of 
curve of class C'‘ (analytic curve) in S. 

C. Ordinary Curves 

A tconnected subset of E^ that is the union of 
a finite number of simple arcs meeting at a 
finite number of points is called an ordinary 
curve. An ordinary curve is called a tree if it 
does not contain a subset that is homeomor- 
phic to a Jordan curve. Let p be a point on an 
ordinary curve C. The ^boundary of a suffi- 
ciently small ^neighborhood of p meets C 
at a finite number of points, and this number 
is independent of the choice of the small 
neighborhood. We call it the order of p in C. 

A point of order 1 is called an endpoint of C, 
a point of order 2 an ordinary point, and a 
point of order > 3 a branch point. If we can 
represent an ordinary curve C as a continuous 
curve tracing each simple arc of C just once, 



we say that C is unicursal (Fig. 1 ). A necessary 
and sufficient condition for C to be unicursal is 
that the number of points of odd orders in C 
be less than or equal to 2 (L. Euler). 




Fig. I 



D. Further Consideration of Definitions 

Although the set of ordinary curves as defined 
above contains most familiar curves, it does 
not contain a point set defined by y = sin 1/x in 
0<x^ 1 and — 1 ^x<0, and by —1 ^y^ 1 at 
X = 0 (Fig. 2 ). This point set is called a sinusoid, 
and it is desirable to obtain a definition of 
curves wide enough to contain a sinusoid. On 
the other hand, the notion of continuous 
curves is, in a sense, too wide, because a curve 
such as a Peano curve (-^ Section J), which 
covers a whose square, is among such curves. 
The notion of simple arcs is too narrow, 
because even a circle is not a simple arc. As a 
point set in E^, a continuous arc is character- 
ized as a tlocally connected tcontinuum and 
is sometimes called a Peano continuum (H. 
Hahn, S. Mazurkiewicz). On the other hand, 

A. Schoenflies, inspired by the statement of the 
Jordan curve theorem (-^ Section K), consid- 
ered a closed set that divides the plane into 
two parts, forming the common boundary of 
both domains, and called it a closed curve. 
According to this definition, however, a simple 
curve is not a closed curve. Thus as a general 
definition of curves it is not appropriate. 




Fig. 2 



To give a general notion of curves on a 
plane (containing sinusoids), we may define a 
curve as a continuum that is tnowhere dense 
in E^ (i.e., a continuum that is a boundary of 
open sets on the plane) (G. Cantor). Further- 
more, to deal with the curves on a topological 
space, P. S. Uryson and K. Menger defined a 
general curve to be a 1 -dimensional continuum 
(Menger, 1921-1922 [ 1 ]). In E^, the latter 
notion coincides with the notion of curves 
defined by Cantor, while in E^ a general curve 
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is a continuum that does not divide any 
tdomain 79 Connectedness). 

E. Universal Curve 

Consider a 3-dimensional cube /^ (/= [0, 1]). 
Draw two planes parallel to each face so that 
the two planes parallel to a face trisect the 
edges of the cube meeting the planes. Thus 
is divided into 3^ cubes. Let be the closure 
of the subset of P that is obtained from P by 
deleting the cube If — [1/3, 2/3]) and the 6 
cubes having common faces with If. Then Mj 
consists of 20 cubes (Fig. 3). We apply to each 
of the 20 cubes forming M^ the same operation 
that we applied to P and denote by M2 the 
union of point sets thus obtained (consisting of 
20^ cubes, the length of whose edges equals 
1/3^). Repeating this process, we obtain a 
point set M„ consisting of 20" cubes, the length 
of whose edges equals 1/3". Thus we obtain the 
sequence Mj =5 Mj => M3 => . . . . The set U 
= ri”=i ^ general curve in the sense of 
Uryson and Menger. Moreover, we can prove 
that an arbitrary general curve is homeomor- 
phic to a subset of U. Hence we call U the 
universal curve. 




F. Length of a Curve 

In this section, by a curve we mean a continu- 
ous curve in a Euclidean space £". Let C be 
a curve in E" defined by Xi-f^it) 
a^t^b; the /• are real-valued continuous 
functions defined in [a, h]). (We sometimes 
write this simply as X — f(t), where X — 

(xj, ... ,xj.) We divide [a,h] arbitrarily and 
denote the dividing points by a tp < ti < f2 < 

. . . < L = h. Let A'j == f (t /c = 0, . . . , r, and let 
be the length of the straight line seg- 
mentjoining and X^. If the length / 

= Zfc=i of the broken line {XqX^^ . . . A',) 

(Fig. 4) is bounded for any subdivision of 
la,bl C is called a rectifiable curve, and the 
upper limit of I with respect to the subdivisions 
is called the length of C. For C to be rectifi- 
able, it is necessary and sufficient that the 
/( (i = 1, 2 , . . . , n) be of tbounded variation 
(Jordan). Thus if C is a rectifiable curve, then 
each /j(t) (j = 1, 2 , . . . , n) is almost everywhere 
differentiable (H. Lebesgue). In particular, if C 
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is of class C\ then C is rectifiable, and its 
length can be represented by 




(-^ 246 Length and Area). 




G. Shapes of Curves 

In this section by a curve we mean the image 
(p{I) in a Euclidean space £" of an interval / 
(bounded or unbounded), where 9 is a continu- 
ous mapping. When a curve C of class C* is 
given in a Euclidean space £", it often becomes 
necessary to examine the shape of C globally. 
The determination of the global shape of C 
from the equation of the curve is called the 
curve tracing of C. The problem has been 
thoroughly studied in the particular case in 
which n = 2 and the equation of C is given by 
/(^>y) — 0 in a rectangular coordinate system 
(by F{r,9)^0 in a tpolar coordinate system), 
where / (or F) is an analytic function. The 
problems in the case of a rectangular coordi- 
nate system are as follows; 

Let (phe a single-valued analytic function 
and / a (bounded or unbounded) interval on 
the x-axis. If a curve Cq represented by y = 

(p(x) (xe/) is a subset of C, then Cq is called 
a branch of C. According as / is bounded or 
unbounded, Q is said to be a finite branch or 
infinite branch of C. When a curve represented 
by X — i//(y) (yeJ) (1/^ is a single-valued analytic 
function and J is an interval on the y-axis) is a 
subset of C, it is also called a branch of C. If it 
is necessary to distinguish these two branches, 
we call the former the x-branch and the latter 
the y-branch. C consists of an at most de- 
numerable set of branches. If P(xQ,yo)eC and 
fy — df/dy¥=0 at P, then there exists an x- 
branch containing P; f^ = df/dx^0, there 
exists a y-branch of C. If df/dx = Q, df/dy = 0 at 
P, then P is called a singular point of C. Points 
on C that are not singular points are called 
ordinary points of C. 

When P is an ordinary point of C, a branch 
Co of C containing P is determined, and the 
tangent line and normal line to C at P are 
the same as those to Cq and are uniquely 
determined. The equations of these are (x — 
Xo)fx(xo, yo) + (j* - yo)/,.(^o . yo) = 0 and (x - 
Xo)fyi^o, yo) - (y - yo)/x(^o^ yo) = O, respec- 
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lively. If we ehoose a coordinate system with P 
as origin and the tangent line and normal line 
as (^-axis and f/-axis, respectively, then the 
equation of C with respect to this coordinate 
system is of the form ri = C 2 ^^ + € 3 ^^ + ... in the 
neighborhood of P. If we denote by p the 
tcurvature of C at P, then p = 2c 2 = —{fxxfy ~ 
+.W)/(// When C 2 = P^ 

0, P is a ^stationary point. A stationary point 
of a curve of class on C is also called a 
point of inflection. When P is not a point of 
inflection and rj) are points on C in a neigh- 
borhood V of P, the sign of rj is definite if V is 
small enough (Fig. 5). However, if P is a point 
of inflection and C3 #0, then C is of the shape 
shown in Fig. 6. At a point of inflection, if C3 = 

. . . = = 0, 7^: 0, and v is even, C is of the 

shape shown in Fig. 5, and if v is odd, C is of 
the shape shown in Fig. 6. 




Fig. 5 Fig. 6 

In a neighborhood of a singular point, C 
takes various shapes. For example, consider a 
curve represented by = x^(x -f a), and let P 
be the origin (0, 0). If n > 0, then there are two 
branches of C passing through P, and they 
have different tangents at P (Fig. 7). As in this 
case, if there are a finite number of different 
branches passing through P with different 
tangents, P is called a node of C. If a < 0, then 
PeC, but there is no other point of C in the 
neighborhood of P (Fig. 8). Such a point is 
called an isolated point of C. If a = 0, then there 
are two branches of C starting from P, and the 
tangents to these at P are the same (Fig. 9). 
Such a point is called a cusp of C. When C is 




Fig. 7 Fig. 8 




Fig. 9 



an talgebraic curve, we can examine the shape 
of a curve in a neighborhood of a singular 
point using the tPuiseux series. 

When C has an infinite branch Cq (for 
example, when Cq : y = ^ (x) (x e /) is an x- 
branch of C and I = \_a, 00)), if the tangent to 
Co at P(xq, yo) (xo^/) has a limiting line for 
Xq-j-oo, then the limiting line I is called an 
asymptote of Cq. In this case, the distance from 
a point P(xo,yo) of Cq to I converges to zero 
when Xq-^oo. An asymptote of an infinite 
branch of C is also called an asymptote of C. 



H. Special Plane Curves 

The following are the well-known curves (for 
ellipses, parabolas, and hyperbolas — ^ 78 
Conic Sections). 

Among curves of the third order, those 
having an equation of the form 

y^^f{x)/{x-a) ( 1 ) 

(/(x) is a rational expression of at most the 
third order in x) are symmetric with respect to 
the x-axis and have x = a as an asymptote. In 
particular, if a>0, /(x)= — x^ in (1), then the 
curve is as shown in Fig. 10 and the origin is a 
cusp. Let a half-line starting from the origin 
meet the curve, the circle with diameter [0, a], 
and the straight line x = a at points X, Y, and 
A, respectively. Then we have OX = YA. This 
curve is called a cissoid of Diodes. 

If a — 0,f{x) — c^{c — x) (oO)in (1), then the 
curve takes the shape shown in Fig. 11. Let A, 
C be the points whose coordinates are {a, 0), 

(c, 0) (0 < a < c), respectively, and let X, Y be 
the points in the first quadrant at which the 
straight line parallel to the y-axis and passing 
through A meets the curve and the circle with 
diameter OC, respectively (Fig. 11). Then we 
have AX:AY= OC : OA. This curve is called a 
witch of Agnesi. 





Fig. 10 Fig. 11 

If a <0, /(x)= — x^(x/3 + a) in (1), then the 
curve takes the shape shown in Fig. 12a. If we 
rotate it by n/4 and put it in the position 
shown in Fig. 12b, then the equation of the 
curve takes the form x^ -H y^ = 3cxy (c = — ^/2 
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a). If we take as parameter t^y/x, then we get 
the parametric representation x = 3ct/(l + 
y = 3ct^/(l + 1^). This curve is called a folium 
cartesii (or folium of Descartes). A curve that 
has a parametric representation of the form 
x = (p{t), y — ij/{t), where cp, are rational func- 
tions, is called an (algebraic) unicursal curve (or 
rational curve). Such a curve is an algebraic 
curve of tgenus 0. 



Fig. 12 

Let r=/i(0), r=/2(0), ...,r=/^(0) be equa- 
tions of curves C, , C 2 , . .. , with respect to a 
polar coordinate system with origin 0. A curve 
C having equation r = A^fi{9)+ ...+ 4/^(0) 

(the Aj are constants, usually -1-1 or —1) in the 
same coordinate system is called a cissoidal 
curve with respect to O. (Fig. 13: r= —fi{9) + 
f2(0)). In Fig. 10, let be the circumference 
of the circle with the diameter [0, a], let Cj be 
the straight line x = n, and put Aj = —1, ^ 2 ^ 1. 
Then we have a cissoid of Diodes. We can 
regard the folium cartesii as a cissoidal curve 
obtained from a straight line and an ellipse. 
When C, is a circle with center at 0, we call C 
a conchoidal curve of C 2 with respect to 0. In 
particular, when Cj is a straight line and O is 
not on C 2 , the conchoidal curve is called a 
conchoid of Nicomedes. 




Fig. 13 



As shown in Fig. 14, when C 2 is perpendic- 
ular to the initial line of the polar coordinate 
system, the equation of the conchoid is r = 
ascc9±b{b is the radius of Cj), and the Car- 
tesian equation of the curve is (x — «)^(x^-l- 
y^) = b^x^. According as a>b, a = b, or a<b, 
the curve has a node, cusp, or isolated point, 
respectively. When C 2 is a circle and 0 is on 
C 2 , the conchoidal curve of Cj with respect to 
0 is called a lima 9 on (or lima 9 on of Pascal) 
(Fig. 15). The equation of a lima^on C with 
respect to a polar coordinate system having 



the diameter of a circle passing through 0 as 
its initial line is r = acos9±b, while the equa- 
tion of C with respect to a Cartesian coordi- 
nate system is {x^ + y^ — ax)^ = b^{x^ + y^). In 
this case, if a > h, 0 is a node of the curve; if 
a — b,Oisa cusp. When a = b, the curve is 
called a cardioid (the curve shown by a dashed 
line in Fig. 15; see also Fig. 26 below). 




Fig. 15 



The locus of a point X having a constant 
product of its distances from two fixed points 
A, B is called Cassini’s oval (Fig. 16). The 
equation of this curve with respect to the 
Cartesian coordinate system whose origin O is 
the midpoint of the segment AB and whose 
x-axis is the straight line AB is (x^ -t-y^)^ — 
2a^{x^ — — (where AB = 2a, — 

AX ■ BX). In particular, if a^ = k^, then 0 is a 
nodal point of the curve. In this case, the curve 
(shown by the dashed line in Fig. 16) is called a 
lemniscate (or Bernoulli’s lemniscate) (Jakob 
Bernoulli). 




Fig. 16 



The locus of the foot of the perpendicular 
drawn from a fixed point O to the tangent of a 
fixed curve C at each point of the curve is 
called the pedal curve of C with respect to O. 
The pedal curve of a rectangular hyperbola 
with respect to its center is a lemniscate (Fig. 
17), and the pedal curve of a circle with respect 
to a point is a limagon. 
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Fig. 17 



also a brachistochrone, i.e., the line of swiftest 
descent (Johann Bernoulli and others). 




Fig. 21 



When a curve C' rolls on a fixed curve C 
without slipping and is always tangent to C, 
the locus r of a point X kept fixed with re- 
spect to the curve C is called a roulette whose 
base is C, rolling curve is C, and pole is X. In 
particular, when C is a straight line, C is a 
circle, and X is on C, T is called a cycloid (Fig. 
1 8). When X is not on C, F is called a trochoid 
(Fig. 19). A trochoid is represented parametri- 
cally by the equations x — aO ~ b sind, y = a — 
bcosO, where the parameter 0 is the angle of 
rotation of C. When a = b, the equation repre- 
sents a cycloid. The tevolute and tinvolute of a 
cycloid are also cycloids (Fig. 20). 





\y 










.4 
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Fig. 18 




Fig. 19 




Fig. 20 

Suppose that we are in a gravitational field 
with a given path represented as a cycloid, as 
is shown in Fig. 21. Assuming that there is no 
friction, the time necessary for a particle to 
slide down the path from a point X on the 
curve to the lowest point C of the curve is 
independent of the initial position X (C. Huy- 
gens). Because of this property, the cycloid is 
also called a tautochrone. Suppose that a par- 
ticle starts from a point A in the space and 
slides down to a lower point B along a curve F 
(without friction) under the effect of a gravita- 
tional force. To minimize the elapsed time, 
we simply take F as a cycloid that lies in the 
vertical plane containing AB and has a hori- 
zontal line through A as the base (Fig. 22). 
Because of this property, the cycloid is called 




Fig. 22 



When the base curve C and the rolling curve 
C are both circles and X is on C', we call F an 
epicycloid if C, C are externally tangent (Fig. 
23), and a hypocycloid if C, C are internally 
tangent (Fig. 24). When X is not on C', cor- 
responding to these two cases, we have an 
epitrochoid and a hypotrochoid, respectively. 

Let a, b be radii of C, C, respectively, c the 
distance from the center of C to X, and 6 
the angle of rotation of C. Then the para- 
metric equations of these curves are x = {a± 
b) cos 9 + c cos((a + b)/b) 9, y — {a±b) sin 9 — 
csm{{a±b)/b)9. (Take the upper signs when 
the curve is an epicycloid and the lower signs 
when the curve is a hypotrochoid. When b — c 
the equations are equations of epicycloids and 
hypocycloids.) When the ratio a:b is a rational 
number p/q (p, q are mutually prime), then C 
returns to its initial position after rotating q 
times around C; in this case each F becomes 




Fig. 23 




Fig. 24 
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an algebraic curve. In particular, when a = 

4b = 4c, the hypocycloid is called an astroid 
(Fig. 25). Its equation (with respect to a Car- 
tesian coordinate system) is 
The envelope of line segments of length a 
whose endpoints are on the x-axis and y-axis, 
respectively, is an astroid (Fig. 25). When a = 
b = c an epicycloid is a cardioid (Fig. 26). 




Fig. 25 Fig. 26 

When the base C is a straight line, the roll- 
ing curve C is an ellipse or a hyperbola, and 
the pole is a focus of C, then the roulette is 
called a Delaunay curve (Fig. 27). 




Fig. 27 



When C is a straight line, C is a parabola, 
and X is the focus of C', the roulette is called a 
catenary (Fig. 28). When we hold two ends of a 
string of homogeneous density in the gravita- 
tional Field, the string takes the form of this 
curve. The equation of the catenary with re- 
spect to a Cartesian coordinate system is y = 
acoshx/a = a(e^^“-l-e~*^‘')/2. The involute 
starting at the point A{0,a) of this curve is 
called a tractrix (Fig. 29). Let Q be the point of 
intersection of the tangent at P to the tractrix 
and the x-axis; then the length of PQ is con- 
stant and is equal to a. Consequently, when we 
drag a weight at /I by a string of length OA 
along the x-axis, the curve described by the 
weight is the tractrix. The parametric equa- 
tions of the tractrix are x = a(logtant/2-|- 
cost), y = asint. 



Fig. 28 Fig. 29 

Suppose that a point Q moves with constant 
velocity on the x-axis and another point P also 
moves with constant velocity always toward Q. 
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The locus of the point P is called a curve of 
pursuit (Fig. 30). When the velocity of Q is a 
times that of P, the equation of the curve of 
pursuit is 2(x — n) = y‘”“/c(l — a) — + s() 
if ct^ \ and 2(x — a) = (l/c)logy — cy^/2 if a = l. 
We can consider similar problems when Q 
moves on a general curve instead of on the x- 
axis. 




Many plane curves that are called spirals 
can be expressed by r= f{6) (/ monotonic) in 
polar coordinates (r, 0). An Archimedes spiral 
is a curve having the equation r = a0 (Fig. 31). 
Archimedes found that the area bounded by 
two straight lines 0 = , 6 = 02 (6i < Oa) the 

curve is a^{6l — 6f)/6. A logarithmic spiral 
(equiangular spiral or Bernoulli spiral) is a 
curve having the equation r~ke“^ (Fig. 32). 

The angle between the straight line 0 = con- 
stant and the tangent to the curve is constant. 
Johann Bernoulli found that the involute and 
evolute of this curve are congruent to the orig- 
inal curve. A curve having the equation r = 
a/9 is called a hyperbolic spiral (or reciprocal 
spiral), and the one having the equation r^6 = a 
is called a lituus. These two spirals are shown 
in Figs. 33 and 34, respectively. Let p = (p{s) be 
the tnatural equation of a curve. Properties of 
the curves for which the functions (p{s) are 
simple have been investigated. Specifically, a 
curve having the natural equation p = ks(k is a 
constant) is a logarithmic spiral. A curve hav- 
ing the equation p = a^/s'is called a Cornu 
spiral (or clothoid; ^167 Functions of Con- 
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fluent Type). Its parametric representation is 




(tFresnel integral). M. A. Cornu used this 
curve in the representation of diffraction in 
physical optics. 

There are also curves that appear as graphs 
of '•'elementary functions. For example, a curve 
having the equation y = sinx is called the sine 
curve, and the graphs of equations y = and y 

= logx are called the exponential curve and the 
logarithmic curve, respectively, although they 
are congruent. In contrast to algebraic curves, 
these analytic curves that are not algebraic are 
called transcendental curves. (Regarding the 
differential geometric properties of plane and 
spaces curves ^111 Differential Geometry 
of Curves and Surfaces; for plane algebraic 
curves ^ 9 Algebraic Curves.) 

I. Envelopes 

Let f(s, r) be a function of class C^ of real vari- 
ables s, t. If we fix t = tQ, then r = f (s, Iq) is the 
equation of a curve C,^ with a parameter s. If 
s(t) is a function of t, then f (s(to), G) represents 
a point on C,^. Let E be the locus of 
when to moves. If s(t) is a function of class C\ 
then £ is a curve of class CL If £ and C,^ are 
always tangent at each point P,^, we call £ the 
envelope of the family of curves Cj. When f(s, f) 
is given, to find £ we need only determine the 
function s(t). We note that d\/ds — Xdydt is a 
condition that must be satisfied by the func- 
tion s(t). When n — 2 and the equation of is 
given in the form f{x, y, to) = 0, the point of 
intersection of C,^ and f{x,y, to) = 0 (/( — 5//5t) 
is P^^. The equation R(x,y) = 0 obtained by 
eliminating t from /(x, y, t) = 0 and /({x, y, t) = 0 
is called the discriminant of /(x, y, t) = 0. The 
set of points (x, y) satisfying the discriminant 
K = 0 is the union of £ and the locus of the 
singular points of 

J. Peano Curve 

A tcontinuous curve in the Euclidean plane £^ 
(i.e., the image f(l) of a segment I = [0, 1 ] 
under a continuous mapping / may 
cover a square. We call such a curve a Peano 
curve after G. Peano, who gave the first exam- 
ple. D. Hilbert simplified the example and 
constructed a Peano curve as follows {Math. 
Ann., 36 (1890), 38 (1891)). 

Divide a square and a segment into four 



equal parts (Fig. 35) and let each square 
correspond to the segment 7^ (i = 0, 1, 2, 3). 
Then divide each D,- into four equal parts (Fig. 
36) and let correspond to 7)^ (j = 0, 1,2, 3), 
and continue this process (Fig. 37). A sequence 
of squares Dj id D^j => ^ has the unique 

common point and we let this point 
correspond to the unique common point 
of the sequence of segments 7] 

The correspondence is a continu- 

ous mapping of the segment [0, 1] onto the 
square D, and this continuous curve has 
double points, triple points, and quadruple 
points. The set of multiple points has the car- 
dinal number of the continuum and is a 
tdense set in the square. We can improve this 
method so that there are no multiple points 
other than double and triple points, but it is 
impossible to eliminate triple points 
altogether. 
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Fig. 37 



We can also construct Peano curves as 
shown in Figs. 38, 39, and 40. That is, we 
“bisect” a triangle and a segment [0, 1] suc- 
cessively and build a correspondence as fol- 
lows: Let a point represented by a binary 
number t — O.ijk . . . {i,j, k, ...—0,1) correspond 
to the unique common point of the sequence 
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A; 3 Aij =3 Aiji^ n . . . and obtain a continuous 
mapping from the segment [0, 1] onto the 
triangle. 




0.0 0.1 1.0 
^ ^ — I 

•I 0 j 1 

Fig. 38 




7’on 



Fig. 40 

K. The Jordan Curve Theorem 

The Jordan curve theorem states; A Jordan 
curve J in the plane separates into inner 
and outer tregions (C. Jordan, Cours d’analyse, 
2nd ed., 1893). More precisely, R^ — J is the 
disjoint union U Gj of two regions Gj and 
Gi whose common tboundary is J. Let p be a 
point of J. Then there is a Jordan arc with p as 
an endpoint such that all points of the Jordan 
arc are contained in G; (i = 1 or 2) except for p 
(A. Schonflies), that is, J is accessible from Gj. 
Conversely, let J be a compact subset of R^ 
such that R^ — J = G; U G 2 and Gj H G 2 == 0, 
where the G,- are regions such that J is acces- 
sible from both G^ and Gj. Then J is a Jordan 
curve (Schonflies, 1908). A homeomorphism 
between a Jordan curve J and the circle C 
extends to a homeomorphism (more precisely 
to a tconformal mapping) between a plane 
containing J and a plane containing the circle 
C (-^ 65 Combinatorial Manifolds G; 77 Con- 
formal Mappings). 
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Curvilinear Integrals and 
Surface Integrals 

A. General Remarks 

The integral of a function (or more precisely, a 
differential form) along a tcurve (tsurface) is 
called a curvilinear integral (surface integral). 
Because a curvilinear integral is a special case 
of the Stieltjes integral, we shall first explain 
this notion, formulated by T. J. Stieltjes (1894) 
as a generalization of the tRiemann integral. 
The notion was introduced in connection with 
Stieltjes’s study of Continued fractions, and 
led to the idea of integrals with respect to 
general measures. 

B. The Riemann-Stieltjes Integral 

Suppose that f{x), a(x) are real-valued 
bounded functions defined on [a, b]. Take 
a partition of the interval a = XQ<x^<X 2 
< . . , < x„_ ] < x„ = b (— ^ 216 Integral Calculus) 
and consider the Riemann sum with respect 
to a(x); 

Z /(<^.)(a(^i+i)-a(Xi)), (^iG[Xj,X;+i]. 

1 = 0 

Suppose that the Riemann sum tends to a 
fixed number as max(Xj+i — Xj) tends to zero. 
Then the limit is called the Riemann-Stieltjes 
integral (or simply Stieltjes integral) of /(x) 
with respect to a(x) and is denoted by 
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Ja/(x)da(x). The Riemann integral of f{x) is a 
special case, where a(x) = x. 

The Riemann-Stieltjes integral has the ele- 
mentary properties, such as linearity, of the 
usual Riemann integral. We also have the 
following theorem: The integral jf{x)da{x) 
exists for every continuous function /(x) if and 
only if a(x) is of tbounded variation. Hence 
when we consider the Stieltjes integral of /(x) 
with respect to a(x), we usually assume that 
/(x) is continuous and «(x) is of bounded 
variation. However, the Stieltjes integral can 
be defined if /(x) is of bounded variation (not 
necessarily continuous) and a(x) is continu- 
ous (not necessarily of bounded variation). If 
a sequence f„{n) (n = 1, 2, . . . ) of uniformly 
bounded continuous functions defined on the 
interval [a, b'\ converges to a continuous func- 
tion f(x) on the interval [a,h], we have 



lim 

«-»co 



/„(x)da(x) = 



/(x)da(x). 



Ja 



Ja 



where a(x) is a function of bounded variation. 
Furthermore, if a(x) and a„(x) (n = 1, 2, . . . ) 
are functions of bounded variation whose 
ttotal variations are uniformly bounded and 
lim„^^ a„(x) = a(x) at every point of continuity 
of a(x), then we have 



lim 



/(x)da„(x) = 



f /(x)doc(x) 



Ja 



for every continuous function /(x) on [a, b] 
(Helly’s theorem). 

Let a(x) be a tstrictly monotone increasing 
continuous function, and let j8(y) be its inverse 
function. Then we have 



«(6) 



/(x)da(x)- /(jS(y))dy, 



«(a) 



( 1 ) 



where the right-hand side is the usual Rie- 
mann integral. A function a(x) of bounded 
variation is represented as the difference of two 
strictly monotone increasing functions ai(x) 
and a 2 (x). If we denote by jS,(y) the inverse 
function of a;(x) (i = 1, 2), we have 



'b 

/(x)d«(x) = 
J ^ 






mAy))dy 



X2(b) 



fiAiy))dy. 



Xila) 



(2) 



It x'(x) exists and is continuous, we have 



'b 

f{x) da{x) — 

J a 



/(x)a'(x) dx. 



Ja 



( 3 ) 



C. The Lehesgue-Stieltjes Integral 

Suppose that a(x) is a monotone increasing 
and right continuous function and f = (xi,X 2 ]. 



We define an interval function t/(/) = a(x 2 ) — 
a(xi). It is nonnegative and countably addi- 
tive. Hence by utilizing U{I) we can con- 
struct the outer measure and also a completely 
additive measure (-^ 270 Measure Theory; 380 
Set Functions). The Lebesgue integral with 
respect to this measure is called the Lehesgue- 
Stieltjes integral (or Lehesgue-Radon integral) 
and is denoted by Jo/(x)da(x). If a(x) is a 
strictly monotone increasing continuous func- 
tion and P{y) its inverse function, then formula 
(1) is true if the left-hand side is a Lehesgue- 
Stieltjes integral and the right-hand side is a 
Lebesgue integral. If a(x) is a function of 
bounded variation, decomposing a(x) into the 
difference of two strictly monotone increas- 
ing functions, we also have formula (2). If 
oc(x) is tabsolutely continuous, formula (3) is 
valid, where the right-hand side is a Lebesgue 
integral. 

The Stieltjes integral has the following two 
properties. 

Integration by parts: In the interval [a, b], 
we have 

I UdV+\ VdU = U{b)Vib)~Uia)V{a) 

J a J a 

if one of U (x), V{x) is continuous and the other 
is of bounded variation. 

Second mean value theorem: If U (x) is mono- 
tone increasing and F(x) is continuous, then 
there exists a in [a, b] such that 

= U{a)m)-Via))+U{bmb)- V(0). 



D. The Curvilinear Integral 

A continuous mapping from an interval a < 
r ^ b in R ‘ into R” : (p(t) = {(p^ (t), . . . , (p„{t)) is 
an oriented curve. Suppose that a function 
/(xi, ...,x„) is defined in a neighborhood U of 
the image C of the mapping (p{t) or merely on 
C. The Stieltjes integral 

'b 

J <3 

is called the curvilinear integral of the function 
/(xi , . . . , xj along the curve C with respect to 
Xj and is denoted by The curve C is 

called the contour (or path) of the integration, 
(p{a) is called the initial point (or lower end), 
and cp{b) is called the terminal point (or upper 
end) of the integration. Let C be a rectifiable 
curve, and denote by s(t) the arc length of C 
from the initial point to the point (p(t). Then 
the Stieltjes integral 

fi(p{t))ds{t} 

J O 
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or simply is called the curvilinear inte- 
gral with respect to the line element. Here, 
th e line element, denote d by ds, means 

(0)^ + . . . + dt when (p{t) is of class 

If the integrand in (4) is of bounded vari- 
ation as a function of t, the curvilinear integral 
is well defined. If C is a trectifiable curve, the 
curvilinear integral is defined for an arbitrary 
continuous function. In the usual case, we are 
concerned mainly with this sort of situation. 
For a differential form co—fidxi + ...+ f„dx„ 
defined on U, the curvilinear integral is 
defined by Z"=i 

The curvilinear integral is linear with re- 
spect to its integrand. If the terminal point of 
Cj is the initial point of Cj, we can construct 
the joint curve C = + C 2 , and we have 

additivity for the contours 




(a similar formula holds if we replace dx; by 
ds). Monotonicity, which asserts that jcfdx^^ 
jcSdXi whenever f ^g, holds if (p,(t) is mono- 
tone increasing, and monotonicity also holds 
for the curvilinear integral with respect to the 
line element. 

If n = 2, can be identified with the tcom- 
plex plane C = {z = x -I- iy}, and we define 
]cf(z)dz by 



u(x)dx- 




+ i 



v(z)dx + 



u{z)dy 



where f(z)^u{z) + iv{z). The integral is then 
said to be an integral in the complex domain. 
(For the application of integrals in the com- 
plex domain to complex analysis ^ 198 
Holomorphic Functions.) 



E. The Surface Integral 



By an m-dimensional smooth surface S we 
mean the image S of a ^regular mapping of 
class from a domain G in R"* into R" (m< 
n), ^(u) = (4(ui , . . . , u J, . . . , ^„(ui , . . . , u J). 
Given a continuous function /(xi, ...,x„) 
defined in a neighborhood (7 of S in R", the 
multiple integral 









DiXi Xi) 



( 5 ) 



is called the surface integral of / along S 
with respect to x,^ , . . . , Xj^ and is denoted by 



jsfdXi^ . . . dxi^ or f . . . ^sfdXi ^ . . . dx,-^. This 
definition depends on the choice of the para- 
meters («!,..., M„) in the following way. Let 
(m'i , . . . , u'J be another parametrization of S. 
Since 



and 



du\ . . . du'„ - 



D{u\,. 


• 5 ^m) 


D(Uy,. 


•,Mm) 



du ^ . . . dUffj^ 



thus according as 



D(ui,...,wJ 

the foregoing expression remains the same or 
changes its sign. Usually we assign to the 
parameters (w^ , . . . , positive or negative 
orientation, in which case S is called oriented. 
If we replace the Jacobian D(x,^, . . . , XjJ/ 

D (u 1 , . . . , u J in (5) by the quantity 



z 



DjXi^,...,XiJ 

D(ui,...,mJ 



2N1/2 



which corresponds to the surface element of S, 
the integral is called a surface integral with 
respect to the surface element and is denoted 
by jsfdS or The surface integral of a 

differential form of degree m in R" is similarly 
defined. In the case m — 1, the surface integral 
reduces to a curvilinear integral. Just as the 
Stieltjes integral is a generalization of the 
curvilinear integral, there are several ways 
to generalize the notion of surface integral 
without assuming that the mapping ^{u) is of 
class 



F. The Stokes Formula 



Let S be an m-dimensional smooth surface in 
R" {m^n} and dS be the (m — l)-dimensional 
surface corresponding to the boundary of S. 

Let CO be a differential form of class of 
degree (m — 1) and dm be its texterior deriva- 
tive. Then we have J 5 scu = J^c/co, which is called 
the Stokes formula (or the Green-Stokes for- 
mula). (For the Stokes formula on a general 
differentiable manifold 105 Differentiable 
Manifolds.) As special cases of the Stokes 
formula, we have the following three classical 
theorems: 

(1) The case of a plane domain: Let D be a 
bounded domain on the xy-plane bounded by 
a finite number of smooth curves C with posi- 
tive directions. If m = P dx + Qdy is a. differen- 
tial form of class on £), we have 



Pdx + Qdy = 





dxdy. 



( 6 ) 
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since da> = (dQ/dx — dPIdy) dx a dy. This is 
called Green’s formula (or Green’s formula on 
the plane). Equality (6) is true if P, Q are totally 
differentiable and the integrand on the right- 
hand side is continuous (even if the functions 
dQldx, dP/dy themselves are not continuous) 
(E. Goursat). This formula remains true under 
the following weaker assumptions: (i) C is 
rectifiable; (ii) P and Q are continuous in D 
and dP/dy and dQjdx are continuous and 
summable (in the sense of Lebesgue) in D. 

(2) The case of a domain in a 3-dimensional 
space: Let D be a bounded domain in xyz- 
space surrounded by a finite number of 
smooth surfaces S. For a tvector field V = 
{P,Q,R} of class C* on D, we put co^PdyA 
dz + Qdz Adx + Rdx A dy. Then since dco = 
{dP/dx + dQ/dy + dRjdzjdx Ady a dz, we have 



Pdydz + Qdzdx + Rdx dy 

J Js 



divVdx dydz 




(dP dQ dR\ 
dy dz / 



dxdydz. 



( 7 ) 



Equality (7) is called the Gauss formula (Ost- 
rogradskii’s formula or the divergence theo- 
rem). The left-hand side of (7) is equal to the 
surface integral j‘Js(V,n)d(T, which is the tvec- 
tor flux through S (where n means the outer 
unit normal vector of the surface S). This 
formula remains true under the following 
weaker assumptions: (i) S is piecewise of class 

; (ii) P, Q, R are continuous in D and dP/dx, 
dQ/dy, and dR/dz are continuous and sum- 
mable in D. 

(3) The case of a bordered surface in a 3- 
dimensional space: Let S:x = x(u,v), y = y(u,v), 
z = x(u, v) ((u, v) G G) be a smooth surface in 
xyz-space, and suppose that the boundary T of 
the domain of the parameters G consists of a 
finite number of smooth curves with positive 
direction. The boundary C of the surface S is 
the image curve of T. Now let V = (P, Q, R} be 
a vector field of class C‘ on S, n be the unit 
normal vector of S (its direction being canoni- 
cally assigned by the parameter (m,u)), and t be 
the unit tangent vector, and set co — Pdx + Qdy 
+ R dz. Then since dco = {dR/dy — dQ/dz) dy a dz 
+ {dPjdz — d/?/dx) dz A dx-\-{dQldx — dP jdy)- 
dx A dy, we have 



(rot V, a)da- 





dydz 








dzdx-y 



fdQ 

\dx 




= (Pdx + Qdy-yRdz)^ (V,t)ds. (8) 



Equation (8) is called the Stokes formula (^ 
Appendix A, Table 3. III). 
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Cybernetics 

The term cybernetics was invented in 1947 by 
Norbert Wiener [1] to denote a field of science 
that treats the system of control and communi- 
cation in animals and machine. The term was 
derived from a Greek word Koji^pv^x^c;, also 
the source of the word “governor.” The stimu- 
lus for establishing such a new area of science 
came from studies of automatic computation, 
automatic control, and information process- 
ing. These fields had given rise to technological 
innovations such as the high-speed electronic 
computer and automatic control instruments. 
Such machines have had a profound influence 
on the information sciences, as well as on 
theoretical investigations in biology. However, 
even now, we can hardly say that cybernetics 
has been formly established as a systematic 
branch of science or of applied mathematics. 
Nevertheless, it has had far-reaching influence 
on both biology and machine engineering as 
a methodology and as a philosophy. In the 
Soviet Union and in European countries, the 
word corresponding to cybernetics is still 
widely used for the “Grenzgebiete” between 
biology and machine engineering in the wider 
sense. Nonetheless, it is difficult to say that 
cybernetics as a whole has undergone system- 
atic development. Systematic theories have 
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been established and developed for separate 
parts of the field, and these have become in- 
dependent disciplines; they are, for example, 
^control theory, tinformation theory, the 
theory of artificial intelligence, the theory of 
tautomata, mathematical biology, genetics, 
and ecology. 

The contributions of Wiener to cybernetics 
include not only the invention of the concept 
itself, but also prediction theory or the theory 
of the Wiener filter [2]. The latter was sub- 
sequently formalized by R. E. Kalman from 
a different point of view as an estimation 
problem within a linear system, working from 
finite observation data, and including the non- 
stationary case (-^ 405 Stochastic Control and 
Stochastic Filtering G). Another contribution 
by Wiener is the input-output identification of 
a nonlinear system using statistical time series 
analysis of the outputs when the inputs are 
white noises [4]. This is achieved by expan- 
sion of the output function in terms of the 
convolution of input functions, which corre- 
sponds to the expansion of functions in terms 
of tHermite interpolation polynomials. The 
kernel of the expansion is called a Wiener 
kernel. Recently, this method has been applied 
to input-output identification for nervous 
systems; it also finds application in nonlinear 
system theory [5]. Wiener also studied brain 
waves [6,9]. 

Cybernetics in the wider sense, although not 
necessarily called by this name, may include 
investigations of the following “Grenzgebiete”: 
information processing of the nervous sys- 
tem [7]; self-organizing systems in the non- 
equilibrium thermodynamics of Prigogine and 
Haken; the self-reproducing machines of von 
Neumann; and the theory of pattern formation 
in biological or chemical systems [8]. 

For related topics ^176 Gaussian Process 
I, 395 Stationary Processes D. 
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Data Processing 

A. General Remarks 

With the development of electronic computers, 
effective systems for data transmission and 
processing have been created on a large scale, 
and there is now a large literature concerning 
data-processing methods and techniques for 
such systems. 

Research on data processing encompasses 
both specific techniques and entire systems of 
processing, for example, the system of pro- 
gramming techniques, as well as mathematical 
problems originating from them, such as the 
complexity of computations 71 Complexity 
of Computations). As domains of application, 
we have, for example, information retrieval, 
stock management, program evaluation, and 
review techniques. We denote the set of data 
by D and assume that D is finite. Depending 
on the properties of D, we have various suit- 
able representations and processing problems. 



B. The Notion of Data 

In recent applications of computers, the main 
task has been arranging and searching for 
items or attributes in storage, rather than 
numerical computation. Individual informa- 
tion is called a record. Records arranged and 
stored in the memories of computers are usu- 
ally called data. A collection of such data is 
called a file. A complex, large-scale data collec- 
tion is often called a data base. 

In an abstract sense, the record consists of a 
string of letters, but it is usually convenient to 
view it as consisting of its identifying mark 
followed by a finite sequence of items. The 
contents can be classified according to their 
properties, such as topological relations, order 
relations, or items representing numerical 
values. For each case, there may be different 
suitable representations and operations. 

We often store information after a suitable 
process of information compression. In some 
cases the operations are reversible, and com- 
plete recovery is possible. In other cases the 
operations are not completely reversible, and 
we must throw away part of the information 
in order to compress the rest. An example of 
recoverable information is the replacement 
of a run of I’s (or O’s) in a binary code by its 
length. An example of lost information often 
occurs in the graphical manipulation of num- 
bers by means of hashing. In this process we 



select a suitable function s=/(ai, where 

the vector (a, , . . . , a„) corresponds to the repre- 
sentation of the record in such a way that the 
value s can be monomorphic for the universe 
of the records as often as possible, and the 
data (uj , . . . , is kept in the storage corre- 
sponding to the value s. Generally, the vari- 
able representing the location of data is called 
a pointer. 

In the abstract sense, data bases are simply 
sets of items, but in most cases a base will also 
exhibit some mathematical structure, such as 
order relations. In order to handle a data base 
efficiently, we must take its particular structure 
into account, and suitably represent that struc- 
ture. Such structures and their representations 
are called data structures. 

C. Linear Structures 

The component of a vector or the stations 
along a single railway line form linearly 
ordered sets. Their most essential feature is the 
notion of immediate predecessor or immediate 
successor. Such a data structure is called a 
linear structure, and the aligned sequence is 
called an array. Multidimensional arrays, such 
as the elements of a matrix, are stored in the 
form of 1 -dimensional arrays in the memory of 
a computer. 

When the set of data D is a linearly ordered 
set, it is usually represented as a suitable array. 
Here, the problems of ordering and table look- 
up are fundamental. Ordering in this sense 
means putting the given elements of D into the 
order defined for D. Since, historically, sort- 
ing machines were used to put punched cards 
in order, this process is also called sorting. 

The process of arranging several individually 
sorted data packs into one sorted pack is 
called merging. 

A fundamental data-sorting operation is 
“comparison” with respect to the order for 

D. There have been many investigations of 
the estimation of the lower or upper bounds 
for the number of comparisons and of devel- 
oping efficient algorithms. Asymptotically, 

0(n log n) is the theoretical lower bound for 
n elements, and some algorithms are known 
to achieve this bound (-- 71 Complexity of 
Computations). 

If there is given a univalent correspon- 
dence f :D-^D' and the correspondence table 
is stored in the memory in a suitable form, the 
problem of table look-up arises, which requires 
finding f{d) for a given deD. In this process, a 
fundamental operation is a comparison of 
deD with some xeD in the table. There have 
been many investigations of efficient arrange- 
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merits and of algorithms with minimal num- 
bers of comparisons. 

D. List Representation 

If the given finite set D has an order satisfying 
the reflexive and transitive laws, addressing is 
often used as a medium for representing the 
order in the memory. Let d^, ... ,d„be all the 
immediate successors to an element deD. We 
can represent them by the following sequence 
of triplets; 

If « = 0, we take (d, 0, 0); 

Ifn=l, we take (d,d^,0); 

Ifn = 2, we take {d,dX,d*); 

Ifn = 3, we take (d,d’f,e‘f),{ei,d 2 ,d*); 

If«^4, we take (d,d^,e'f),{ei,d^,e^), 

where df means the pointer to di, e,- is an ele- 
ment introduced for convenience, and ef is the 
pointer to e,-. This is called a list representa- 
tion. If we denote by d->di, the fact that d^ is 
an immediate successor of d, then the total set 
can be regarded as a tdirect graph. Usually, 
this graph is a ttree, and then the graph is 
called a tree structure and its representation a 
tree representation. The advantage of this 
representation is that addition or deletion is 
quite easy. 

The set of logical formulas is partially 
ordered, with the order given by the rules of 
inference. The set consisting of a series of 
inferences forms an ordered subset. Thus, if 
a tree representation can be automatically 
treated, so can the process of inference. 

In dealing with linguistic data (words or 
sentences, for example), it is often natural to 
consider a noncommutative Tree semigroup D 
generated by a finite number of generators (the 
alphabet or vocabulary). In this case, if there 
exists a natural order for the generators, it 
determines in D a lexicographic partial order- 
ing. Then the tree representation can be used 
for representing a dictionary whose entires are 
elements of D. This method is not efficient with 
respect to speed of table look-up and economy 
of memory, but it sometimes has the advan- 
tage of simplifying the treatment of compli- 
cated data. 



E. Memory Devices for Processing 

In dealing with algebraic formulas or lan- 
guages with parentheses, data maintenance 
methods such as tree representation or push- 
down storage are often convenient as auxiliary 
memory-controlling methods. The character- 



istic of the push-down storage method is that it 
returns the data in reverse order with respect 
to the time of acceptance and remittance. 

It is sometimes called a stack or a first-in-last- 
out memory. Contrary to a stack, there is a 
storage that returns the data in the same order 
as the acceptance. It is called a queue or a 
first-in-first-out memory. 

F. Information Retrieval 

A request to obtain all the records qualified 
by some property in a data base Q is called a 
query. The part of the collection of records O 
qualified for a query, therefore, can be identi- 
fied by its characteristic subset of the space of 
all possible records. A procedure that deter- 
mines the subset of Q composed of all the 

records relevant to a query q is called the 
information retrieval of the query. A system 
which provides such a procedure for every 
query ^eQ is called an information retrieval 
system organized for Q with respect to Q. 

It is essential to design the system in such a 
way that records can be retrieved quickly for 
queries in a certain class. Such a selected set of 
queries can be composed of all the first-order 
queries specifying an item and asking for all 
the records containing the attribute value or 
the key characterizing the item. It can, in some 
cases, include second- or higher-order queries 
specifying a number of items and asking for all 
the records in which attribute values or keys 
characterizing the item occur simultaneously. 

In an information retrieval system, the mas- 
ter file of a data base Q is usually organized by 
way of auxiliary memories on data structures 
such as sequential files, indexed sequential 
files, virtual storage files, or direct access files, 
using the magnitude of the accession number 
or primary key of each record. Various man- 
agement systems, called SAM (Sequential 
Access Method), ISAM (Index Sequential 
Access Method), VSAM (Virtual Storage 
Access Method), or DAM (Direct Access 
Method), exist. 

In addition to the organization of such a 
master file, various directory files or indexes 
are organized in order to retrieve each query 
qeQ quickly, because there are many relevant 
keys or combination of keys other than the 
primary key. An inverted filing scheme (IFS) is 
a typical scheme for organizing such indexes 
or directory files. A bucket S, or an address- 
able set of secondary memories is provided 
for each canonical query in a one-to-one 
way. An index or a list of accession numbers of 
pertinent records is organized in each of the 
buckets contiguously so as to make it possible 
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to retrieve all accession numbers of relevant 
records quickly. The essentials of an IPS is 
to define a one-to-one QAT (Query to Ad- 
dress Transformation) from the inverted set of 
queries Q = {q,} to the set of buckets 

An IPS has efficient retrieval performance 
with respect to the inverted canonical queries. 
The scheme, however, may in some cases 
require a large number of buckets. Moreover, 
the scheme requires quite redundant storage 
of accession numbers by a number of buc- 
kets because a record may be pertinent to a 
number of canonical queries simultaneously. 
Although higher retrieval performance can be 
expected by including higher-order queries in 
the inverted set, the space and machine time 
needed for such organization is prohibitive. 
This is one of the reasons why an IPS for first- 
order queries is preferable in almost all prac- 
tical cases. Another reason is that if the first- 
order queries are inverted, every retrieval can 
logically be performed by certain Boolean 
operations executed among a certain number 
of retrieved sets of accession numbers. A trade- 
off of space and time needed for the organiza- 
tion of a scheme and its retrieval performance 
might be the determining factor for the selec- 
tion of the set of canonical queries to be 
inverted. 

An attempt to overcome the limitations 
inherent in the inverted scheme can be found 
in the work on the balanced file organization 
scheme due to Abraham et al. [7]. By extract- 
ing the essentials, Yamamoto et al. [9, 10] 
defined a BPS (Balanced File-organization 
Scheme) in a wider sense as follows: 

(i) Buckets are organized in such a manner 
that every bucket is associated with more than 
one query. 

(ii) Every canonical query is associated with a 
unique bucket. 

(iii) The accession number of a record with 
some additional information is stored in a 
bucket once if and only if it is pertinent to at 
least one of the associated queries. 

The essentials of a BPS is to define a many- 
to-one transformation from the set of canon- 
ical queries Q to the set of bucket addresses B, 
or an MQAT (Multiple Queries to Address 
Transformation). An MQAT defines a parti- 
tion of Q into mutually disjoint subsets. It is 
a generalization of a QAT which defines an 
IPS. If c is the number of queries to be asso- 
ciated simultaneously with a unique bucket 
in a BPS, then the number of buckets to be 
prepared is 1/c, which is a drastic reduction 
from that for an IPS. This reduction makes 
it possible to extend the feasible range of 
canonical queries. Actually, in the system 
HUNDRED (Hiroshima University New 
Documents REtrieval and Dissemination), a 



51-to-l MQAT is used for the organization of 
indexes. 

Although some space overload may occur 
by storing some additional information in 
order to tell which record is pertinent to which 
query in a bucket, reduction of redundancy 
can be expected, because a record may be 
pertinent to more than one query simulta- 
neously. The reduction in the number of buc- 
kets and redundancy would contribute much 
in saving time and space needed for the or- 
ganization of indexes. 

Among those BFS’s defined by MQAT’s 
with a given data base Q, Q, and the number c 
of queries associated with the same bucket, 
one BPS can be called the best if its redun- 
dancy is the least under some reasonable as- 
sumptions imposed on the distribution of keys. 
Several combinatorial problems have been 
raised and solved in this connection [10]. 
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Decision Problem 

Suppose that we are given a set S and a propo- 
sition P(xi ,X 2 ,...,x„) for elements X; of S. 

Then we have the problem of universal validity 
of P, which is the problem of finding a general 
algorithm (i.e., a finitary procedure) by which 
we can discern whether P{x i , . . . , x„) is true for 
all n-tuples (x , , . . . , xj. The problem of finding 
an algorithm by which we can discern the 
validity of P(xi , . . . , x„) for some specifically 
chosen n-tuples (xj , . . . , x„) is called the prob- 
lem of satisfiability of P. These two problems 
are customarily called decision problems. The 
problems are such that affirmative solution of 
one of them implies negative solution of the 
other. 

To give a precise definition of decision prob- 
lems, let us note that a tfree semigroup with 
countable generators can be identified with a 
subset of the set N of natural numbers (by 
virtue of +Godel numbering; ^185 Godel 
numbers). On the other hand, if S is a given 
tformal system with countably many symbols, 
the set of all formulas in 6 is a subset of the 
free semigroup generated by the symbols in S. 
Thus the set of all formulas in S is identified 
with a subset of N. A subset M of N (or N x 
N X ... X N) is (general) recursive if its tre- 
presenting function is general recursive (-^ 356 
Recursive Functions). By using the concept of 
recursive function, a precise definition of the 
decision problem can be given as follows: The 
decision problem of M is solved affirmatively if 
and only if we can obtain effectively a proce- 
dure defining the representing function of M, 
and the function is general recursive. The 
decision problem of M is solved negatively if 
and only if we can obtain a proof that M is 
not recursive. 

For a set A of formulas in we let g{A) be 
the set of all Godel numbers corresponding to 
the elements of A. Let A' be the set of formulas 
in A that are deducible in S, and let x{A) = 
g{A'). The decision problem of the set A of 
formulas is said to be solved affirmatively 
(negatively) if the decision problem of x{A) is 
solved affirmatively (negatively). By refining 
this concept we arrive at the notion of the 
degree of (recursive) unsolvability. Let A and B 
be subsets of N. The relation “A is recursive in 
B and B is recursive in A" (-^ 356 Recursive 
Functions) is reflexive, symmetric, and transi- 
tive. Hence this relation decomposes the class 
of all subsets in N into disjoint nonempty 
equivalence classes. A and B are defined to 
have the same degree of unsolvability if they 
belong to the same equivalence class. Thus the 
degrees of unsolvability can be identified with 
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the equivalence classes. The degree of recursive 
sets is 0. 

The relation a <b is defined between the 
degrees a of A and b of B to mean “A is recur- 
sive in B.” Clearly, for any degree a, we have 
0 ^ a. The partially ordered system of degrees 
constitutes an tupper semilattice. 

Research on the decision problem has been 
done mostly in areas related to the tfirst-order 
predicate calculus and the formal systems 
on it. We now list some important results. 

(I) Results concerning The decision 
problem has been solved negatively for the sets 
of formulas of the following forms. (Here it is 
assumed that no funetion symbols appear and 
that represents a formula involving no 
occurrence of V, 3, or free variables.) 

(1) All formulas in (A. Church, A. M. 
Turning), 

(2) 3xj 3x2 • • • 3x„,Vyi Vy 2 ■ ■ • Vy„ '41 (T. Skolem), 

(3) 3xi3x23x3VyiVy2 ... Vy„3z 9f (K. Godel), 

(4) 3xi3x2VyiVy2 ... Vy„3z (L. Kalmar), 

(5) 3xi 3x2 Vy 3z2 . . . 3z„ 91 (J. Pepis), 

(6) Vx3yVz3«i3u2 ... 3u„ 91 (W. Ackermann), 

(7) 3xi 3x23x3 Vy 91 or 

3xj 3x2Vy 3z 91 (J. Suranyi), 

(8) 3xVyiVy2 3zi3z2 9lor 

Vx 3y Vz3u, 3u2 91 (Suranyi). 

The decision problem has been solved 
affirmatively for the sets of formulas of the 
following forms, where it is assumed again 
that no function symbols appear and that 91 
is as above. 

(1) All formulas involving variables only on 
predicates with one argument. (L. Lowen- 
heim, Skolem, H. Behmann), 

(2) VxjVx 2 ... Vx„, 91 (P. Bernays, M. Schonfin- 
kel, Ackermann), 

(3) VxiVx 2 ... Vx„,3yi3y2 ... 3y„ 91 (Bernays, 
Schonfinkel, Ackermann), 

(4) VxiVx 2 ... Vx„,3yi3y2VziVz2 ... Vz„ 91 
(Godel, Kalmar, K. Schutte). 

(II) Results concerning formal systems on 
Throughout the rest of this article we 

assume that no ^function variables appear. 
Predicate constants, function constants, and 
object constants may appear. By the decision 
problem for a formal system 2 we mean the 
decision problem for all tclosed formulas in 2. 
Most of the results obtained so far concerning 
the decision problem for formal systems have 
been negative. Such results include those for 
formal systems formalizing natural number 
theory, the theory of rational integers, the 
elementary theory of tgroups, trings, tfields, 
tiattices, and the like, and axiomatic set theory 
(A. Tarski et al.). 

The word problem for groups was solved 
negatively by P. S. Novikov (-^ 161 Free 
Groups B). In connection with this deci- 
sion problem, there are some investigations 
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by W. W. Boone, G. Higman, and others 
[14-17], 

The decision problem for a formal system 
formalizing the elementary theory of tAbelian 
groups has been solved affirmatively (W. 
Szmielew). Little is known about the decision 
problem concerning partial systems of for- 
mulas of given formal systems except the fol- 
lowing: (1) the decision problem for the set of 
formulas of the form ... Vx^ 91 in a 

formal system 161 Free Groups); (2) the 
Hilbert-type problem, which is the decision 
problem for the set of formulas of the form 
3xi 3x2 • ■ ■ (f = s) in a formal system. In 
particular, the Hilbert-type problem in a 
formal system formalizing natural number 
theory is called Hilbert’s tenth problem 
196 Hilbert). The latter is the problem of find- 
ing an algorithm for deciding whether a tDio- 
phantine equation has an integral solution. 

This decision problem was studied by M. 
Davis, H. Putnam, J. Robinson, and others, 
and finally Yu. V. Matiyasevich solved it neg- 
atively by showing that every recursively 
enumerable relation is Diophantine [9]. (A 
relation R(mj , . .. ,mj) is called Diophantine if 
there is a polynomial P(Xi , . . . , Xj, yi , . . . , y^) 
with integer coefficients such that Rim^, 
holds if and only if P(mi, ...,m^,yi, ...,yfc) = 0 
has a solution for y ^ , . . . , in natural num- 
bers.) In addition, some investigations have 
been made about the ''’second-order predicate 
calculus L^, tintuitionistic logic, etc. [1,2]. 
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Julius Wilhelm Richard Dedekind (October 6, 
1831-February 12, 1916) was born in the city 
of Braunschweig in central Germany and 
studied at the University of Gottingen under 
C. F. tGauss, who was then in his later years. 
He received his doctorate at Gottingen with a 
thesis on the tEuler integral. He was professor 
of mathematics from 1858 to 1862 at Zurich 
and from 1863 to 1894 at the Technische 
Hochschule in Braunschweig. During his early 
twenties, he wrote papers concerning analysis 
and the theory of probability, but in 1857 he 
began publishing papers on the theory of 
numbers. He edited tDirichlet’s lectures on 
number theory {Vorlesungen uber Zahlen- 
theorie, first edition 1863, fourth edition 1899) 
and concentrated on research in arithmetic 
and algebra. The theory of '^ideals, which he 
founded, was originally set out in a supple- 
ment (1863) to Dirichlet’s Vorlesungen. 

Dedekind treated subjects ranging from the 
axiomatic foundations of the theory of ideals 
to tlattices and 'f'groups as algebraic systems. 
He was a pioneer of the abstract algebra of the 
20th century. Among his notable achievements 
are the tDedekind zeta functions of talgebraic 
number fields, '•'Dedekind cuts in the theory of 
real numbers, the algebraic theory of +alge- 
braic functions (of which he was a coauthor 
with H. Weber), and the theory of natural 
numbers. He was one of the first to support 
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tCantor’s set theory. His theory of natural 
numbers was founded on the concept of sets 
and included the idea of ^recursive functions. 
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A. Degree of Mapping 



B. Local Degree of Mapping 

Suppose that M" and N" are n-dimensional 
oriented C' (or combinatorial) manifolds and 
/: is a continuous mapping. Suppose 

further that a point p of M" has a neighbor- 
hood U such that f{p)^f{q) for any point q 
contained in U—{p}. Then f induces a homo- 
morphism U — {p])->'H„{N,N — 

{/(p)}) of n-dimensional tlocal homology 
groups with integral coefficients that are both 
isomorphic to Z. If u and v are generators of 
the groups H„{U,U — {p}) and //„(iV,yV — 
{/(p)}) corresponding to orientations, re- 
spectively, then there exists an integer k such 
that fj^u) = kv. We call this integer k the local 
degree of mapping / at p. If M" and N" are 
closed oriented C'' manifolds (r ^ 1) and / ; M" 
is a C mapping, then there exists a point 
r of N" such that the set f~^(r) is a discrete 
subset {pi, ...,pt] of M", and each p,- has a 
neighborhood U, satisfying the foregoing con- 
dition (tSard’s theorem). If k; is the local de- 
gree of / at Pi, then = 



Let M" and JV" be n-dimensional ^closed 
^oriented tC" manifolds (or tcombinatorial 
manifolds). For example, M" = N" = S" (the 'En- 
sphere). Their nth ^homology groups with 
integral coefficients H„(M"; Z) and H„(N"; Z) 
are infinite cyclic groups generated by the 
fundamental homology classes [M"] and [N"], 
respectively (^ 201 Homology Theory). A 
continuous mapping / induces a 
homomorphism : W„(M"; Z)-*’//„(N"; Z), and 
there exists an integer df such that /„,([M"]) = 
dflN"']. This integer df is called the degree of 
mapping (or the mapping degree) of /. When 
M" = iV", df does not depend on the orienta- 
tion of M". 

If a continuous mapping is 

■’’homotopic to / (/ :^g), then we have df = dg. 

If / is homotopic to a ^constant mapping 
(/ ~0), then df = 0, while if / is a homeomor- 
phism, then df-±l. When M" = N", a homeo- 
morphism / : M"-+M" is called an orientation- 
preserving mapping if df—1 and an orientation- 
reversing mapping if — 1. 

Suppose that M" and N" are closed oriented 
n-dimensional combinatorial manifolds and 
that /; M”-> Af" is a tsimplicial mapping. Let 
and SjS” are n-simplexes of M", 

N", respectively) represent [M"], [iV"], and let 
Pj (resp. qj) be the number of n-simplexes ct" 
such that /(<t") is equal to s" (resp. — s"). Then 
Pj — qj is independent of the index j and equal 
to df. 

Suppose that / g :S"-^S" are continuous 
mappings (n^ 1). Then /~gi if and only if — 
dg (Brouwer mapping theorem). This implies 
that 7t„(S")^Z (-- 202 Homotopy Theory). 



C. Linking Numbers 



Given two mutually disjoint smooth closed 
curves Cj and Ci in Euclidean 3-space, a quan- 
tity Lk(Ci, C 2 ) indicating how closely they 
are interlinked with each other was given by 
Gauss as follows: Let C; be expressed by the 
parameters Xi — Xi{ti) (i= 1, 2), where Xi(t,) are 
+continuously differentiable. Then the quantity 



Lk(Q,C2) = 




1 



|X2-Xi|^ 



/ dxi dx2\ 

X deti X2 — Xj,— — ,— — \dt2dt1 
dt ^ dt 2 j 



is an integer called the linking number of C, 
and Cj. 

More generally, let M" be an «-dimensional 
oriented tcombinatorial manifold (or C mani- 
fold (r^ 1)) and K and K* its tcellular decom- 
positions such that K* is dual to K. Let z') and 
z* (r -I- s = u — 1) be tboundaries belonging to 
the complex K and K*. Suppose that is 
any tchain of K whose boundary is z\. Then 
the intersection number • [z|] does not 

depend on the choice of such a chain We 
set Lk( 25 ,Z 2 ) = [C''+‘] • [Z 2 ] and call it the link- 
ing number of zl and z^- The linking number 
Lk(z J , Z 2 ) of tsingular boundaries z\, Z 2 (r -t- s = 
n — 1) of M” is similarly defined by consider- 
ing the approximations z \ , z| of z \ , z| belong- 
ing to a suitable cellular decomposition K and 
its dual K*. The number Lk(zJ , Z 2 ) is bilinear 
with respect to z\, z\, and we have Lk(zJ,Z 2 ) = 
(_l)rs+iLk(z|^z^). In the example in 3-dimen- 
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sional Euclidean space shown in the left 
half of Fig. 1, we have Lk(z},Z 2 ) = 1, while 
Lk(z},z 2 ) = 2 for the example shown in the 
right half of the same figure. In particular, if z^j 
is homologous to 0 in M" — Izjj, then we have 
Lk(zi,Z 2 ) = 0 (Fig. 2). Generally, if Zj and z7 
are homologous in M" — \zl\, then Lk(zj,Z 2 ) 

= Lk(z7,z|). 




Fig. 1 

D. Order of a Point with Respect to a Cycle 

Let M" be an n-dimensional oriented com- 
binatorial manifold (or C'' manifold (r^ 1)) 
with the nth Betti number b„ = 0, z”"* an 
(n — 1 (-dimensional singular boundary of M", 
and 0 a point of M” that is not contained in 
|z”“'|. We set ord(z""\o) = Lk(z"^\o) and call 
it the order of the point o with respect to z""E 
For example, when M" = R ^and z‘ = {/(f)|0 
l,/(0)= /(!)}, where/ is a continuous 
function, the order ord(z/ o) is equal to the 
rotation number around o of a moving vector 
of(t) as t varies from 0 to 1. This ord(z/o) 
stays invariant as the point o moves in a ^^con- 
nected component of the complement R^ — 1 

(Fig. 3). On the other hand, if z/ = {/(t)|0^ 
l,/(0)=/(l)} (i = 0, 1) are closed curves in 
and the distance p(/Q(t), //r)) is smaller 
than p( /o(t),o) for all t in the interval [0, 1], 
then we have ord(zo, o) = ord(z} , o) (Rouche’s 
theorem). 




Fig. 2 Fig. 3 
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A. History 

For a real-valued function f{x) of a real vari- 
able to be tLebesgue integrable, it is necessary 
and sufficient that there exist an '•'absolutely 
continuous function F{x) such that F'{x)— f{x) 
at '•almost all points x (-- 221 Integration 
Theory D). In general, the derivative of a func- 
tion is not necessarily Lebesgue integrable. A 
function f{x) is Lebesgue integrable if and only 
if |/(x)| is integrable. Hence a function which 
has an improper Riemann integral is not neces- 
sarily Lebesgue integrable (^221 Integration 
Theory A). For this reason, it is desirable to 
extend the concept of Lebesgue integrals. In 
1912, A. Denjoy constructively defined a new 
eoncept of integrals (Denjoy integral in the 
restricted sense; ^ Section D), which is an 
extension of both Lebesgue and Riemann 
integrals. Later, N. N. Luzin provided the 
descriptive theory of this integral. Indepen- 
dently, and nearly simultaneously, A. J. Khin- 
chin and Denjoy defined a more general in- 
tegration (Denjoy integral in the wide sense 
(1916); - Section D). 

In 1914, O. Perron, independently of Den- 
joy, defined a concept of integrals (Perron 
integrals) that is equivalent to that of Denjoy 
integrals in the restricted sense. To establish 
this concept. Perron considered the differen- 
tial equation y' —f{x) and utilized a method 
similar to the one used in the proof of the 
existence theorem for the solution of the dif- 
ferential equation / = /(x, y). However, the 
concept of Denjoy integrals is inadequate to 
treat unbounded functions. Thus to extend 
the concepts of Riemann and Lebesgue inte- 
grals, various ideas have been introduced; for 
example, Denjoy (1921), J, C. Burkill (1951), 
and R. D. James (1950) introduced new con- 
cepts as byproducts of investigations concern- 
ing the coelficients of trigonometric series 
[2, 3]. The A-integral concept devised by A. N. 
Kolmogorov was meant to deal with the pro- 
blem of the conjugate function of Fourier 
series [4]. As a certain completion of the space 
of functionals of step functions, K. Kunugui 
defined the notion of E. R. integrals (1956), 
which coincides with that of A-integrals in a 
special case [5,6]. 

What has been stated so far deals only with 
functions of a real variable. Concerning the 
extension of Denjoy integrals to the case of 
several variables, researeh has been done by 
M. Loomis, S. Kempisty, S. Nakanishi (nee 
Enomoto), and others [7, 8]. 
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B. Approximate Derivative 

If we have j^^Qm{E r\{^ — h, ^ + k)} /{h + 

/c)= 1 at a point ^ of a tmeasurable subset E 
of the real line, where m is the 1 -dimensional 
Lebesgue measure, the point ^ is called a point 
of density for E. Almost all the points of E are 
points of density for E f Lebesgue’s density 
theorem). Let £ be a measurable set having Xq 
as a point of density, and let F{x) be a measur- 
able function on E. If there exists a number / 
such that for each e>0, Xq is a point of density 
for the set {x I / — e < (F(x) — F(xo))/(x — Xq) 
l + e,xeE}, then I is called the approximate 
derivative of £(x) at Xq and is denoted by 
ADF(xo). If ADF(xq) exists, F(x) is said to 
be approximately derivable at Xq. If F{x) is ap- 
proximately derivable at each point of £, then 
F(x) is said to be approximately derivable in 
£. If F'(x) exists at a point x, then AD F(x) 
exists at x, and we have ADF(x) = F'(x). How- 
ever, there exists a continuous function F(x) 
that is approximately derivable at almost all 
points of an interval and yet not differentiable 
at any point of a set of positive measure. 

C. Generalized Absolute Continuity 

Let £ be a set in R, and let F(x) be a real- 
valued function whose domain contains £. If 
for each e > 0, there is a ^ > 0 such that for 
every sequence {[a„,b„]} of nonoverlapping 
intervals whose endpoints belong to £ the 
inequality Yib„-a„)<S implies L|F(h„)- 
F(a„)| <e, then the function F(x) is said to be 
absolutely continuous on £. We denote by AC 
the set of all functions that are absolutely 
continuous on £. If F(x) is continuous on £ 
and E is the union of a countable sequence of 
sets £„ on each of which FeAC, then F(x) is 
ealled a generalized absolutely continuous 
function, and we write FeGAC. If FeGAC, 
ADF(x) exists almost everywhere. 

If, for each e >0, there is a 5 >0 such that for 
every sequence { of nonoverlapping 
intervals whose endpoints belong to £ the in- 
equality J^(b„ — a„)<s implies E„0{F; [a„,b„] } 
<e(0{F; [a„, h„] } denotes the oscillation of 
the function F(x) in \_a„, h„], i.e., the difference 
between the least upper bound and the great- 
est lower bound of the values assumed by F(x) 
on \_a„, h„]), then F(x) is said to be absolutely 
continuous in the restricted sense (or abso- 
lutely continuous (*)) on £; and we write Fe 
AC{*). Just as we defined the notions of gen- 
eralized absolute continuity and absolute con- 
tinuity, so we define the notions of generalized 
absolute continuity in the restricted sense and 
generalized absolute continuity (*). Thus Fe 
GAC{ * ) means that F(x) is a generalized ab- 
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solute continuous (*) function on £. If Fe 
GAC{*), then F'(x) exists almost everywhere. 

D. Definitions of Denjoy Integrals 

Let /(x) be a real-valued function defined on 
/ = [a, h]. If for /(x) there exists a function 
F(x) that belongs to GAC on I and for which 
ADF(x)= /(x) holds almost everywhere, then 
fix) is said to be Denjoy integrable in the wide 
sense (or D-integrable) on /. We call F{b) — 
F{a) the definite D-integral of /(x) over /, 
and denote the value by (D)J*/(x)dx. The 
function F(x) is called an indefinite D-integral 
of fix) on /. Similarly, we obtain the definition 
of Denjoy integral in the restricted sense (or 
D(*)-integral) by replacing GAC by GACi*) 
and ADF(x) by F'(x) in the definition of the 
D-integral. If a continuous function F(x) satis- 
fies the equality ADF(x)= /(x)^ +cx) (F'(x) = 
fix)i^ +oo) for all except eountably many 
points in /, then F(x) is an indefinite D- 
integral (D( * )-integral) of fix). A Lebesgue- 
integrable function is D(*)-integrable, a D(*)- 
integrable function is D-integrable, and a D- 
integrable function that is almost everywhere 
nonnegative is Lebesgue integrable. 

E. Constructive Definition of Integrals 

Let S be a functional whose domain (JjK(S;7) 
consists of the union of sets X(S; I) of real- 
valued functions defined on closed intervals 
/ = [a, h]. If / belongs to K(S; I), we denote 
the value Sif) by S(/; I). Such a functional S 
is called an integral operator if the following 
three conditions are satisfied: (1) If f eKiS; f) 
and / is an arbitrary interval contained in /, 
then the trestriction / to / of / also belongs 
to K(S; /). Also, Sif ; /) is a tcontinuous addi- 
tive function of the interval /c/. (2) Let / = 
[u, h], / = [b, c], and / = [a, c] (a < /)< c). If for 
a function / defined on /, / eK(S; 7i) and 
/2 6 KiS; f), where / and /2 then 
f eKiS; I). (3) If / is identically 0 on 7, then 
f e KiS; I) and Sif ;I) — 0. For two integral 
operators Si and S 2 , we say that S 2 includes Si 
(or Sj is weaker than S 2 ) if K(Sj ; 7) <= K{S 2 ;I) 
for every 7 and Sj (/; 7) = S 2 (/; 7) for every 
/g7C(Si ; I). The D-integral (D( * )-integral) is 
the weakest integral operator containing the 
Lebesgue integral and satisfying the following 
two conditions, (C) and (H) (resp. H( * )): (C) 
Cauchy’s condition. If, for every function / 
defined on 7 q, we have fjeKiS; I) for any 7 = 
la + S,b — s]'^I(,~ [a, 6], and also if the finite 
limit ^^0 Sif ; I) exists, then/e7C(S;7o) 

and Sif ; 7q) coincides with the foregoing limit 
value. (H) Harnack’s condition. Let £ be a 
closed subset of Iq, {f} be a sequence of inter- 
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vals contiguous to the set consisting of the 
points of E and the endpoints of /q, and / be a 
function on /q satisfying the following three 
conditions: (i) fj:eK{S;lo), where /e(x)=/(x) 
whenever xeE and /e — O otherwise; (ii) /* = 
f,^eK(S;Ik) for each k; and (hi) ZJS(/t;4)|< 
+00 and lim,(_,^ 0(S; /j^; /J = 0 when the se- 
quence {4} is infinite. Then it follows that 
/e X (S; lo) and S(f; lo) = S(/e; /o) + Z* 5(/, ; 4). 
(Here 0(S;4;4) denotes the variation of S(/J 
on 4, that is, the least upper bound of the 
numbers \S{fj;J)\, where J denotes any sub- 
interval of 4 ) We obtain condition (H(*)) by 
replacing condition (iii) in (H) with a more 
restrictive condition: ZkO(S;4!;4)< T^^e 
constructive definition of the Denjoy integral 
in the wide sense (the Denjoy integral in the 
restricted sense) is obtained by ttransfinite 
induction starting with the Lebesgue integral 
and using two methods, (C) and (H) (resp. 
(H(*)), of extensions. 



F. Perron Integrals 

Given a function f{x) defined on an interval 
{a, h], suppose that F{x) is a function defined 
on the same interval such that (1) F{x)^f{x); 

(2) F{x)^-oo (resp. (!') f (xK/(x); (2') F{x)^ 
+co) at every point x, where F(x) (resp. F(x)) 
denotes the tlower (upper) derivative of f (x). 
In this case, F{x) is called a major (minor) 
function of /(x). If for any e > 0 there is a major 
function \jj{x) and a minor function <p(x) of 
f{x) such that ij/{b) — (p{b)<£, then f(x) is 
said to be Perron integrable. We denote by 
the value inf^{il/{b)~il/{a)} ^ 
sup^{(p(b)-(p{a)}. 



G. Properties of Integrals 



If {/„} is a nondecreasing sequence of func- 
tions that are D-integrable on an interval 
[a, b] and whose D-integrals over [a, b1 consti- 
tute a sequence bounded from above, then the 
function /(x)==lim„_,j,(^/„(x) is D-integrable on 
\_a,b'], and we have 



(D) 



/(x)dx= lim (D) 



[ /«(x)dx. 



Ja 



If f (x) is a function of ^bounded variation 
and g{x) is a D-integrable function on an 
interval [a, h], then F(x)gf(x) is D-integrable on 
[a, by, moreover, denoting by G(x) the inde- 
finite D-integral of g{x), the following formula 
is valid: 



(D) 



F{x)g{x)dx 



= G{b)F(b)-G{a)F{a)- 



f G(x)dF(x), 



Ja 



where the last term is the tStieltjes integral 

(integration by parts). 

If F(x) is a nondecreasing function and g[x) 
is D-integrable on [u,h], there is a point ^ in 
[fl, £»] for which the following formula is valid: 

(£))| g(x)F{x)dx 



F(a) • (D) g{x) dx + F{b) ■ (D) 



g(x) dx 



(the second mean value theorem). 

The foregoing theorems remain valid if D 
is replaced by D( *) in the hypotheses and 
conclusions. 
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Descartes, Rene 

Rene Descartes, (March 31, 1596-February 
11, 1650), philosopher, mathematician, and 
natural scientist, was born in the province of 
Touraine in France. He became dissatisfied 
with his studies of scholastic philosophy in the 
Jesuit Academy in La Fleche, and later, in 
1619, while stationed in Ulm during a tour of 
duty in the army, he underwent a philosoph- 
ical conversion. He had an idea of methodo- 
logically unifying the various fields of interest 
to him using mathematics as a model. He 
returned to Paris in 1621, but moved to Hol- 
land in 1628 to concentrate on his work. Swe- 
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den’s Queen Christina invited him in 1649 
to that country, where he died the next year, 
evidently from a combination of cold and 
overwork. 

Descartes, often considered the founder of 
modern philosophy, discarded early the tra- 
ditional theological view of the world and 
stated that all knowledge should be recognized 
as logical only after it has been submitted to 
rational criticism. This ushered in the modern 
view of the world based on mathematics and 
physics. In 1637, he published Geometrie as an 
appendix to his Discours de la methode, which 
also contained his works on optics and meteo- 
rology. In it, he promoted F. tViete’s symbolic 
algebra, which he applied to geometric prob- 
lems. His idea that algebra could be used as a 
general method for geometry established him 
as the founder of tanalytic geometry. 
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Design of Experiments 

A. General Remarks 

The design of experiments is a part of the 
statistical planning required to collect the data 
appropriate to the purpose of statistical in- 
ference (—^401 Statistical Inference) in various 
fields of scientific research and application. 

The main purposes of the design of experi- 
ments are (1) to analyze a given statistical 
linear model 403 Statistical Models) and 
(2) to devise a “good” statistical linear model. 
Sometimes this term also refers to a statistical 
method including the tanalysis of variances. 
Thus the purpose of designing an experiment 
is to provide the most efficient and economical 
methods of reaching valid and relevant conclu- 
sions from that experiment 403 Statistical 
Models). 

R. A. Fisher, whose contributions to statis- 
tical theory were remarkable and far-ranging, 
propounded three required principles to con- 
trol the experimental field in order to guaran- 
tee the validity of statistical methods and to 
increase the sensitivity of experiments: (i) re- 
plication, or the repetition of the set of all the 
phenomena to be compared in the experiment, 
for the evaluation of experimental error var- 
iance; (ii) randomization, or the procedure 
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allotting various experimental treatments at 
random, in order to change systematic errors 
into random errors; and (iii) local control, 
or the procedure which makes the variation 
within each experimental block as small as 
possible, in order to minimize or remove sys- 
tematic errors. These are called Fisher’s three 
principles. A design satisfying principles (i) and 

(ii) is called a completely randomized design; it 
enables us to attach a probability statement to 
estimated treatment differences by obtaining 
valid estimate of experimental error variance. 

Let an n-dimensional trandom variable Y = 

( Yj , . . . , T„)' be represented by a linear model 

Y^X^ + W, (1) 

where X is a given nxs real matrix, ^ — 

...,^)' is an s-vector, and W = (IFi, ..., WJ is a 
random vector with the texpectation £(W) = 

0. Then Y is called the observation vector, W 
the error vector, 4 the effect vector of Y, and 
X the design matrix. 

According to the properties of the effect 
vector the linear model (1) is separated 
into three classes: (i) The class of fixed-effects 
models for which ^ is a fixed unknown para- 
meter. In this case, the component of ^ is 
called a fixed effect, and a linear function 7t = 
F'<^ of <^, with a given coefficient vector F is 
called a linear parameter or parametric func- 
tion. (ii) The class of random-effects models for 
which the components S) of ^ are random 
variables. In this case, each component is 
called a random effect, and ^ is denoted by S. 

(iii) The class of mixed models for which there 
are both fixed effects and random effects Ej 
in In this case, the model (1) becomes 

( 2 ) 

where ^ = (t^i , . . . , is a fixed-effect vector 

and = is a random-effect vec- 

tor. The conditions frequently assumed for the 
tdistribution law of Y are: (a) The errors 
(i= 1, ...,n) are uncorrelated and E{W^)= ... 

= E{W„) = Q. (b) The errors W, {i= 1, ...,n) have 
a common unknown tvariance (c) The 
errors Wiii — l, ... ,n) have the tnormal distri- 
bution. (d) The random effects are uncor- 
related and independent of the error vector W. 
(e) The random effects Ej have a common 
unknown variance The random effects 
Ej have the normal distribution. 

Let L{X) be a tlinear subspace of R" span- 
ned by the column vectors of X. The linear 
model 

Y = X^ + W (3) 

is called a hypothesis on the linear model (1) if 
L{X)ciL{X). 

The main issues of the theory of design of 
experiments are concerned with (I) statis- 
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tical inferences, such as estimation or testing 
hypotheses, under models (1), (2), (3), (i), (ii), 
(iii); (II) determination of the design matrix X 
satisfying certain optimal conditions; (III) 
construction of a theoretical foundation that 
can explain the validity of the statistical treat- 
ment of the observed data by means of the 
above models. 



B. Block Design 



The design of experiments is described here in 
terms of the so-called block design. There are n 
experimental units a = 1 , . . . , n called plots, and 
an observation is assigned to each plot a. A 
block is constructed with several plots under 
Fisher’s principle (iii), and the number of plots 
in a block is called the block size, the jth one 
being denoted hy kj,j=\, b, with kj = 
n. One of v operations, called treatments or 
varieties, is applied to each plot. It is assumed 
that the observation at the plot a in the 
jth block under the ith treatment has the 
structure 

The i = I , . . . , r, are called treatment effects, 
and the rjjJ, block effects. It is also as- 
sumed that — 0. In this case, Y is repre- 
sented in matrix notation as 



Y = 0(^ + 'F;/ + W, (4) 

where <F = ((p5,i), a= 1, ... ,n, i= 1, ...,u, with 

r 1 when the i th treatment is applied 
= < to the plot a, 

Lo otherwise. 



and T — (i/^^^), (X— 1, ...,n,;= 1, with 







when the plot a belongs to 
the jth block, 
otherwise. 



Here it is assumed that S; (Pou = 

r, > 1, Zi fi = n, Zj 1, and = 1. 

We call r, the number of replications of the ith 
treatment. We set N = — . Then is 

the number of observations in the yth block to 
which the ith treatment is applied. The matrix 
N is called the incidence matrix of the block 
design. 

In any experiment, each plot has its own • 
effect. The blocks are constructed so that this 
plot effect in each block becomes as homoge- 
neous as possible, although it is impossible 
to eliminate the effect completely. For this 
purpose, randomization is adopted. Suppose 
that we are given k plots in a block and v 
treatments {k^v). Then randomization is 



utilized to select a treatment out of v treat- 
ments to be allocated to each plot so that the 
selection is “at random.” Then the plot effects 
are random, and the error term in model (4) 
can be considered to be the sum of a plot effect 
and an original error. 

A block design satisfying Fisher’s three 
principles is called a randomized block design. 
Blocks that can accommodate all the treat- 
ments to be studied are called complete blocks. 
Those that do not contain all the treatments 
are called incomplete blocks. Blocking can be 
considered to be an extension of pairing. In 
the terminology of block design, we can say 
that many experiments are block experiments, 
or even (tautologically, if we allow those with 
only one block) that all experiments are block 
experiments. 

In a block design N = («ij), treatments Iq 
and ii are said to be connected if there exists a 
chain 



hJihJi ■•■h-iJih 



of integers such that I ^ip^v {p — 0,1,2, ... ,1), 
\^j,^b{q^\,2,...,l), and > 0, ritj^ > 0, 
ri: i >0, ... ,ri: / > 0, ri: i > 0. If all pairs of 
treatments are mutually connected, then the 
design is said to be connected. In this case, the 
rank of the matrix C defined in Section C is 
t> — 1. If the design is disconnected, then the 
incidence matrix N can be partitioned into 
two or more connected portions, e.g.. 





Thus without loss of generality we can restrict 
ourselves to the connected case. 



C. Estimation under the Fixed-Effects Model 

Conditions (a) and (b) of Section A are as- 
sumed here. The ’’'normal equation which gives 
the ’^least square estimates ^ and 7 of ^ and rj, 
respectively, is 




Set d)'<I) = diag(ri, .. . , = = 

diag(ki , . . . , kfc) = D;t, C = N’, Q = 

(O' — ff^') Y, where diag( . . . ) means a 

diagonal matrix with the diagonal elements 
Then (5) reduces to 

C|=Q, (6) 

Let L be an orthogonal matrix that transforms 
the matrix C to a diagonal form; that is, L'CL 
= diag(pj, ... ,py_j,0) = A, p,>0 for all i. Set 
A* = diag(pj“\ ...,p“4j,0), C* — LA*L' . Then ^ 
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= C*Q is a particular solution of (6) and S; 

A parametric function tt — F'^ with coeffi- 
cient vector F — (Fi , f 2 5 • • • > K)' is called a treat- 
ment contrast if the sum S, F, of coefficients 
vanishes. A treatment contrast n = F'l^ is called 
a normalized contrast if F'F = 1 . It is called the 
elementary contrast if F has only two nonzero 
elements 1 and —1. Elementary contrasts of 
treatment effects show the comparison of 
treatments involved in them. When a design is 
connected, any contrast n is testimable, and 
the tbest linear unbiased estimate of ti is jr = 
F'|. Furthermore, if f, is the eigenvector of 
the matrix C with unit length corresponding to 
an eigenvalue p; and F = then the var- 

iance of the estimate tz is given by o'^Siflf/Pi- 
The following properties are equivalent: (i) A 
design is connected, (ii) Any treatment con- 
trast is estimable, (iii) The rank of the matrix 
C is y — 1. (iv) The minimum eigenvalue 0 of 

is simple and other eigenvalues 9^ 
satisfy 0< ^ 1. (v) The maximum eigenvalue 

1 of is simple and other 

eigenvalues 02 satisfy 0 < 02 < ^ • (vi) There 
exists a positive integer p such that each ele- 
ment of N N ' is positive. It 
holds that 02 = 1 — 0i . 



D. Test of a Hypothesis H : (^j = . . . — in the 
Fixed-Effects Model 

Conditions (a), (b), and (c) of Section A are 
assumed here. The hypothesis H:^^ = = 

represented by 

Y-'Tj/-I-W. (7) 

Consider a direct sum decomposition R” = 

L(F) Ljt('F) + L^(<F, T') -t- of R", where 
LgiA) and stand for the torthocomple- 

ments of L{B) with respect to L{A) and R", 
respectively, and F^(l, 1, ..., l)'eR". The 
t projection operator matrices for the decom- 
posed subspaces L(F), Lp('F), L,j,(®,'F), and 
4, are denoted by P^, P 2 , P^, and P^, respec- 
tively. Then we have 

= P2 = T>D,-"F'-u-1£„„, 

P3 ^ (/„ - 'PDfc" ‘ ‘P'), 

P4 = /„-Pj-P2-P3, 

where is an a x h matrix whose entries are 
all unity and 1„ is the nx n identity matrix. 

The analysis of variance for the hypothesis 
(7) in model (4) is given by 

Y' Y = Y'Pi Y + Y'p2 Y -h Y'Pj Y 4- Y'P^ Y. 

This is called an intrablock analysis. A usual 
test for the hypothesis H is given by a critical 
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region with 

n-r-ft + l Y'PjY 

F = > constant 

r-1 Y'P^Y 

(-" 403 Statistical Models). 

E. Optimal Block Design 

A block design is said to be optimal when it 
minimizes the variance of the estimate of a 
normalized contrast n. Suppose that the num- 
ber V of treatments, the number b of blocks, 
and each block size kj,j=\, ... ,h, are given. 
Under each of the following criteria the corre- 
sponding block design is optimal in the sense 
indicated: For positive eigenvalues, p,, p 2 , 

. . . , , of the matrix C in Section C, (I) 

Pi is maximal (D-optimality); (II) min Pj is 
maximal (£-optimality); (III) pf^ is mini- 
mal (or the average variance of the estimates 
of all normalized contrasts of the parameters 

is minimal) (A -optimality). 

If Pi ^ ^ p^_ , ^ (n - b)/{v -l)i = p, say) 

and Uij is either 1 or 0, then the design is opti- 
mal for each of the optimality criteria (1), (II), 
and (III). In this case, we have 

C = pih-v^'EJ. 

Such a design is called a variance-balanced 
block design in which every normalized con- 
trast is estimable with the same variance. If 
Dpi N' - n"i £^,1 r' = p(/„ - £„i r') (or 
C = (l — p)(D,. — n'^rr')), every normalized con- 
trast is estimable with the same efficiency 1 — p, 
where r — D^E^^ and 0 <p < 1. Such a design 
is called an efficiency-balanced block design. 
For a block design, any two of the following 
properties imply the third: (i) The design is 
variance-balanced; (ii) the design is efficiency- 
balanced; (iii) the design is equireplicated. 

When all block sizes kj equal some number 
k independent of j, all numbers r,- of replica- 
tions equal some number r independent of i, 
and A;;. = (number of times that both 

treatments i and i' are applied to the same 
block) equals some number a independent of 
i and f, then the design is combinatorially 
balanced. The design is usually called the 
balanced incomplete block design (BIBD) if 
these three conditions are fulfilled and k < 

V. A BIBD is both variance-balanced and 
efficiency-balanced. If all the treatments are 
replicated the same number of times and the 
blocks are of the same size in a block design, 
then the only variance-balanced design is a 
BIBD, provided such a design exists. A BIBD 
is often denoted by BIBD(t;, b, r, k, X) and we 
have the relations vr = bk, X{v — 1) = r{k — 1) 
and r^£>, among the parameters; the last 
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relation is called Fisher’s inequality. The design 
is said to be symmetric when v — b. Further- 
more, if r = 6 is even, then r — X must be a 
perfect square. U v — b is odd, then the equa- 
tion = (r — -l-( — must have a 

solution in integers x, y, z not all zero. This is 
called the Bruck-Ryser-Chowla theorem. Neces- 
sary conditions for the existence of a BIBD 
have been obtained. One of these conditions 
is stated in terms of the Hasse-Minkowski p- 
invariant Cp{^) = (-1, -l)pn^i(A, -A-i)p. 
where is the tHilbert norm-residue 

symbol. Do = 1, and D, is the principal minor 
of the nxn matrix A. Another is described in 
terms of the embedding of a quasiresidual 
design into the corresponding symmetric 
BIBD. No effective necessary and sufficient 
condition for the existence of a BIBD has been 
obtained. In general, it is conjectured that for 
a positive integer k, with finitely many excep- 
tions, BIBD(r, b, r, k, 2)’s exist for all pairs v, / 
of positive integers satisfying r > k, /l(tJ — 1) = 0 
(mod k— 1) and Xv{v— 1) = 0 (mod k{k— 1)). 

The known methods of constructing block 
designs are of two main types, direct and re- 
cursive. Recursive methods are a way of con- 
structing designs from smaller ones. Direct 
methods yield easier constructions, but are 
applicable only for special values of the para- 
meters. A direct method of constructing a 
BIBD designates the treatments and blocks, 
respectively, as the points and subspaces of the 
■•■projective space and the ^affine space over a 
ffinite field. To explain another method of 
constructing a BIBD, we let G be an tadditive 
group of order n and x‘/’, he m treat- 

ments corresponding to each element x‘‘’ of the 
group (i = 1 , 2, . . . , n). The treatment x^‘* is said 
to belong to the ath class (a = 1 , 2, . . . , m), and a 
pair (x^'\ x^*) of treatments is called a dif- 
ference of type (a, fi, x‘^’) if x*‘^ — x*^’ = x*'’* ( 0). 
We can form t blocks of size k 



such that each block B,- contains exactly r 
treatments belonging to the octh class (a = 

1, ...,m) and among all pairs of treatments 
in the same block there are X differences of 
each type (oc, jS, x*'’’). Such a set of t blocks is 
called a difference set. The t blocks in a dif- 
ference set are called initial blocks. Given 
such a difference set, we can obtain nt blocks 
by joining elements of G to the elements of 
each B, (s = 1, 2, . . . , t). These nt blocks form 
a B\BD(v — mn,b — nt,r,k = rm/t,X)(^ 66 
Combinatorics). 

As a generalization of Fisher’s inequality, 
for an unequal-replicated block design with 
unequal block sizes, b'^v — 5 holds, where ^ is 



the multiplicity of the maximum eigenvalue 1 
of the matrix The equality sign 

holds if and only if the projection operator cor- 
responding to zero eigenvalue of D^'^^CD”'^^ 
is a zero matrix. 

F. Estimation in a Mixed Model 

Consider a block design (4), where every block 
has the same size k and every treatment has 
the same number r of replications. Let ^ be a 
fixed effect and tj a random effect denoted by 
H, and assume that W satisfies conditions (a) 
and (b) of Section A and that H satisfies con- 
ditions (d), (e), and (f) (where the Ej are re- 
placed by the coordinates Hj of H). If E{Hj) = y, 
j—l,...,b, then, changing the notation H — 
to H, we can rewrite (4) as 

Y = r7 + d)<^ + 'FH + W, (8) 

with £(H) = 0. The normal equation that gives 
the least square estimate of ^ is 

{C + + k(rf)~'^ Ci)| 

= Q + aHa^ + kaf)-^Q,, (9) 

where C and Q are the same as in (6) and 
C, = iVD,-i A' - rr - ^ , Q , = {ND, ^ 4^' - 

F')Y. Equation (9) cannot be solved 
unless the ratio : (if is given. When : af is 
not known, substituting in Aka} its ■fun- 
biased estimate given by analysis of variance, 
one obtains a solution of (9) which tends to a 
tconsistent estimate of ^ as the number of 
blocks tends to infinity. 

G. Estimation in a Random-Effects Model 

Let ^ (denoted by S) and H in the model (8) be 
random effects. Suppose that S, H, and W are 
mutually independent, and that the distribu- 
tions of S, H, and W are A(0, A(0, 4), 

and A(0, cr^/J, respectively. The distribution of 

Y in (8) contains four parameters y, of, oj. 
When kj<v, the tminimal sufficient statistic is 
generally incomplete, and therefore the op- 
timal estimate of y, of, and of cannot be 
determined. As an example, the minimal suffi- 
cient statistic for the random-effects model of a 
BIBD(r, b, r,k, X) is 

(I; Y'B^i Y, YT22 Y, YT3 Y, Y'B^ Y, YP,\), 

where 

Hr-x)-‘BTB-kr(r-/l)-'n-*£„„, 

B22 = /c((k-l)r-t-2)-V-' 

x{T-k-^BT){T-k ^TB), 

P^ = k-^B-k-^{r-X)-^BTB 

+ r/(r-A)“’n~‘£„„, 

P^ = I„-k-^B-P,„ P,^r-^T-n~^E„„, 
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with T=<P<1>' and B = In this case, 

E{Y'P^Y) = {n-v-h+l)a^, E{Y'P^ Y) = (h - 
v){a^ + k(T^). From these equations unbiased 
estimates of and cf can be derived, but their 
optimality is not guaranteed 396 Statistic). 

H. Factorial Experiments 

Suppose that there are h factors f i , . . . , 
which affect Y, and each factor has S; levels 
{i=\,...,h). It is assumed that r = x $2 x ... x 
Sft treatment combinations are derived by all 
the combinations of the levels of h factors, and 
that V treatment effects are represented by the 
sum of subeffects called main effects and inter- 
actions, or factorial effects covering both. Such 
an experiment is called a factorial experi- 
ment or factorial design; it allows us to ex- 
amine the effects of two or more factors, each 
factor being applied at two or more levels, 
by testing all possible treatment combinations 
formed from the factors under study. Specifi- 
cally, suppose that we have the case h — 2 , 
called a two-way layout. Let main effects be 
denoted by ^ , ... , 4\)', 

and interaction by = 
where = 0, E , = 0, E; = E, = 0. 
When there is no restriction on the number of 
observations, each of t) = Si x Sj treatment com- 
binations is replicated t times. Components n-k 
of the observation vector Y are represented 
by a linear model 

Yijk = y + ii + + ^1/ + ^ijk^ 

i=l,...,Si; j— 1,...,S2; 

or, in vector notation, 

Y = r7 + X,^‘+X2^'+Yi2^'" + W. 

The analysis of variance in this case is given by 
Y'Y^^Y'P^Y, 

i 

where P, is the projection operator matrix 
on the subspace derived by a direct sum 
decomposition 

R" = L(F) + L^{X,) + 4(^2) + 2 ) 

T Lif , 

-'12’ 

where n = s,S2t- Denoting by Yij. the arith- 
metic mean of over the subscript k and 
using similar notation yj.., Yj., and T..., we 
have Y'P^Y = nf^.,Y’P2Y ^S2t'L{%-Y...)\ 
YT3 Yj= s, r_E( Yj. - Y.f, YT4 Y = tE ( 4 + 
Y.-Y,-Yj.)\ Y'P,Y^nYijk-Y,j.f. The 
^analysis of variance table is given in Table 1. 
For h ^ 3 similar models can also be con- 
sidered for dealing with interactions up to h- 
factor. The factorial experiment is said to be 
symmetric if Sj = S2 = . . . = s^, = s and is called 
the s'" factorial experiment. Otherwise it is 



Table 1 Analysis of Variance 



Source 


Sum of 
Squares 


Degrees of 
Freedom 


Fj main effects 


Y’P^Y 


Si-1 


F2 main effects 


YF3Y 


S2-I 


Fj F2 interactions 


YF4Y 


(Si- 1)(52~1) 


Error 


Y'FjY 


SiS2(t-l) 



said to be asymmetric and is called the 
Si X S 2 X ... X 5ft factorial experiment. 

I. Fractional Factorial Designs 

Factorial designs require at least an experi- 
ment for all the combinations of levels of 
factors under consideration. When there are a 
large number of treatment combinations re- 
sulting from a large number of factors to be 
tested, it will often be beyond the resources of 
the investigator to experiment with all of them. 
For such cases Finney {Ann. Eugen., 12 (1945)) 
proposed a method in which only a fraction of 
the treatment combinations are experimented 
with. A design of this type is called a fractional 
factorial design. One reason for the usefulness 
of fractional factorial designs in preference to 
factorial designs is that they involve a smaller 
number of treatment combinations, since, in 
most scientific experiments, it is usually found 
that a large number of the higher-order inter- 
actions are negligible. The crucial part of the 
specification of fractional factorial designs is 
the suitable choice of the defining or identity 
relationship. Equating the nonestimable fac- 
torial effects for the selected fraction of treat- 
ment combinations with / gives the identity 
relation. Each factorial effect is not estimable 
after selecting a fraction of treatment com- 
binations, and any contrast of the selected 
treatment combinations represents more than 
one factorial effect. All factorial effects repre- 
sented by the same treatment combinations 
are called aliases. In aliases, by assuming that 
other interactions are negligible when com- 
pared to the one of interest, estimation can 
be made by means of the corresponding con- 
trast of the selected treatment combinations. 
When all factorial effects of order higher than 
/ are assumed to be zero, a fractional factorial 
design is said to be of resolution 2/ -I- 1 if it 
satisfies the condition that under the usual 
model all factorial effects up to order I are 
estimable, whereas a fractional factorial design 
is said to be of resolution 21 if it satisfies the 
condition that under the usual model all fac- 
torial effects up to order /—I are estimable. 

In the beginning, the theory was developed 
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for an orthogonal fractional factorial design in 

which the estimates of various effects of inter- 
est are all uncorrelated. However, these are 
available only for special values of the number 
n of treatment combinations and are in general 
uneconomical in the sense that they require a 
large value of n in comparison with the num- 
ber of unknown effects. As a generalization of 
orthogonal fractional factorial designs, I. M. 
Chakravarti (Sankhya, 17 (1956)) introduced 
the concept of balanced fractional factorial 
designs. In these designs, the estimates of 
effects are not always uncorrelated. However, 
the tcovariance matrix of the estimates is 
invariant under any permutation of factors. 
Balanced fractional factorial designs are flex- 
ible in the number of treatment combinations, 
with the result that more experimental situ- 
ations can be handled. These two kinds of 
fractional factorial designs can be constructed 
by using orthogonal arrays and balanced 
arrays, defined in Section L. All alias relations 
can be derived from the identity relation in 
orthogonal designs. 

In the model (1) for a design, the variance of 
the estimates of estimable linear functions of ^ 
depends on the matrix X’X which is called the 
information matrix. For eigenvalues of XX, 
the D-, £-, and A -optimalities of fractional 
factorial designs can be defined similarly to (I), 
(II), and (III) given in Section E. An ortho- 
gonal fractional 2* factorial design is D-, E- 
and A -optimal. A fractional factorial design is 
said to be saturated if the number of treatment 
combinations is equal to that of parameters in 
the model to be estimated. However, since 
saturated designs do not provide an estimate 
of experimental error, their use should, in 
general, be confined to those situations in 
which a prior estimate of experimental error 
is available. The existence of a symmetric 
BIBD(4f— l,4t — l,2r— l,2t— l,r— 1) implies 
that of an orthogonal saturated fractional 
2"’'' ^ factorial design of resolution III. 

J. Application of Algebras 

In the theory of design of experiments, the 
ideas of association algebra and relationship 
algebra play an important role. Let A; be a 
vxv symmetric matrix with entries 0 or 1 (i — 

0. 1. ... , m). If a set {A,- 1 / = 0, 1, . . . , m} satisfies 
the conditions 

'^o = ^v, = 

i=0 

and there is a nonnegative integer pj^ for every 

1, j, k such that 

m 

!=0 



then the A; are called association matrices. In 
this case, there exists a natural number n,- for 
every i such that = holds. If an (a,^) 
component of A^ is unity, then the treat- 
ments a and are said to be the ith associates. 
The talgebra M over the real number field 
generated by the matrices Aq, Aj, ... , A„, is 
called the association algebra, is commuta- 
tive and tcompletely reducible, 
is the tregular representation of si. There is a 
nonsingular matrix U = (Uy) that transforms 
all ^ into diagonal matrices simultaneously: 

diag(zoi, . . . , z„,;), ( = 0, 1 , . . . , m. 

( m \ — 1 m 

Z Z i-0, l,...,m, 

/=o / j=0 

are mutually orthogonal idempotent elements 
of.fi/. For example, consider the case where 

h, ® ^3 = -h)® (E.,s, - h,) 

with m = 3, where ® means the tKronecker 
product. The algebra si generated by Aq, Aj, 
A2, A3 is called an £3 type association algebra. 
The association matrices Aq, Aj, A 2, and A3 
correspond to the relationships between the 
treatments in a two-way layout. The mutu- 
ally orthogonal idempotent elements in this 
case are A% (g) 53 ‘ Af =(/,,- 

Sr^-Es,.,)® ^2" =Sr‘E.SiS, ®(^5^- 

»2 and Af = [I, ^ E,^s,)®ih,~ 

For the factorial experiment with h 
factors (h > 3), association matrices can be 
constructed in a similar way. If h = 3, the num- 
ber m of association matrices is 7. Many types 
of association schemes are known, and some 
of them for m = 2 are classified as group divis- 
ible, triangular, Latin square, cyclic type, and 
so on. For a group divisible type of i; = s,S2, 

^0 ~ —h, ® ~ A2 = — Aq — 

Ai; Af^v~^E,„ Af = {(s, - 1) (Aq + AJ- 

A2},A2#=S2-‘{(S2-1 )Ao-Ai}. 

An experimental design consists of a set of 
n experimental units called plots. Define a 
relationship R between the plots as a set of 
ordered pairs (i,j) of plots. A relationship R 
among a set of n plots can be expressed as a 
symmetric nxn matrix (ry) of O’s and I’s: 

r 1 if the ith plot is related to the jth 
3 P^ot by the relationship R, 

Lo otherwise, 

and this matrix is also denoted by R. If there 
are k types of relationships Ri, ...,R^ among n 
plots, the algebra over the real number field 
generated by the matrices K is called 

the relationship algebra of the design. ^ is a 
tsemisimple algebra. 

Example (1). The relationship algebra ^ of 
the factorial experiment with h = 2 and t repli- 
cations is generated by the following matrices 
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over the real number field: Aq® It, 82 = 

Aq ® (£„ - Q, 82 = A^® E,„ 84, = A2® Ett, 

^5 = ^43® £„, where At,i — 0, 1, 2, 3, are the 
association matrices of £3 type- The mutually 
orthogonal idempotents that correspond to 
the ttwo-sided ideal decomposition of ^ are 
Bf ^A*®t ‘£„, B^=Af ® r‘£„, Bf = 

A* ® t~^E,t, B* =Af 0 £■’£„, and Bf = 

^0 ® (It ~ t ~* £«)• These are the same as the 
projection Pi in the two-way layout design 
given in Section H. 

Example (2). Consider a block design with v 
treatments, each having the same number r of 
replications, and with b blocks, each having 
the same size k{<v). Suppose that association 
matrices .4,-, j = 0, 1, . . . , m, are given, by which 
the associations among the treatments are 
defined. Let be the number of blocks to 
which the ith associate treatments a and are 
applied. The design is called the partially bal- 
anced incomplete block design (PBIBD) if 2^^ 

— 2; ^ 0 independently of a and p. When m = 1, 
the design is a BIBD. Let the observation 
vector be represented by (4). The relationship 
algebra 01 of a PBIBD is generated by n x n 
matrices /„, £„„, Ti = <t>Ai<b', i = 1, 

2, where N N' — — PiAlf , 

Pi = Sj™ 0 ^j^ij (0 ^ p, < r/c; ( = 0, 1 , . . . , m), and p„ 
^rk^'ZT=o*^i^i- Write T* =0/4i^<h'. Accord- 
ing as Pi — rk,0< pi < rk, or p; = 0, L(T*) is said 
to be confounded with the blocks, partially 
confounded with the blocks, or orthogonal to 
the blocks. ^ is noncommutative, completely 
reducible, and isomorphic to the algebra of 
matrices of the type shown in Fig. 1. 




Fig. 1 

The analysis of variance of a PBIBD is 
given by a decomposition of /„ into mutually 
orthogonal idempotent elements of For a 
PBIBD, the matrix C in (6) is of the form C = 

, where i; = r — /c“Vi- For a connected 
PBIBD, all Af 4, i— 1, 2, ...,m, are estimable. 
If there exists a group divisible PBIBD(m = 2) 
with 22 = 2 i -h 1, then it is £- and A-optimal. 

K. Design for Two-Way Elimination of 
Heterogeneity 

Consider a design with v treatments in a « x w 
rectangular block. The row effect and the 
column effect of this block are denoted by t] 
and V, respectively. Thus the observation vec- 
tor Y is of the form 
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where the definitions of F, <D, and fl are 
similar to those for block designs. Set £ = 0'n, 
M = 0)''P, D.-cD'O), F = D,-w-^LL'-u~^MM' 
-I- (uw)“^ ££„„£'. The matrix £ plays a role 
similar to that of the matrix C in (6). When the 
rank of £ equals r — 1 , the design is said to be 
connected. If 

£-T(/,-r-i£J, (11) 

then the design satisfies the optimum criteria 
(I), (II), and (III) given in Seetion E. When u — 
w = v and (11) holds, the design is called the 
Latin square. When u — v and (11) holds, the 
design is called the Youden square. When u>v 
and (11) holds, the design is called the Shrik- 
hande square. In a Youden square design treat- 
ments are grouped into replications in two 
different ways, i.e., rows and columns, where 
rows constitute a BIBD, whereas columns are 
complete blocks. The existence of a Youden 
square design is equivalent to that of a sym- 
metric BIBD. 

If the associations among v treatments are 
defined in terms of association matrices, the 
partially balanced design for two-way elimi- 
nation of heterogeneity can be defined in a way 
similar to the PBIBD. In this case, the equa- 
tion F = '^iXiAf holds, and if j?/ = E^^, — I^}, 

then (11) holds, hence the optimum criteria 
are fulfilled. The definition of the relationship 
algebra 0 ^ of a partially balanced design for 
two-way elimination of heterogeneity is similar 
to that used for the PBIBD. ^ is isomorphic 
to the algebra of matrices shown in Fig. 2. 




Fig. 2 

The analysis of variance of this design is 
given by a decomposition of /„ into mutually 
orthogonal idempotent elements of 

L. Balanced Array and Orthogonal Array 

Suppose that T is an n x h matrix with entries 
from a set /I of s ( 5= 2) distinct elements. Con- 
sider the s' 1 X t matrices X =(xi,X2, ...,x,) 
that can be formed by giving different values 
to the X;(g T), / = 1 , 2, . . . , f . Suppose that asso- 
ciated with each 1 x t matrix X there is a non- 
negative integer 2(xi,X2, ...,x,) which is invar- 
iant under any permutation of a given set 
{xi,X2, ...,xj. If, for every t-columned sub- 
matrix of T, the s' 1 X f matrices X occur as 



Y = Fy + (hc> -H n V + W, 
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rows Mx^,X 2 , ■■■,x^) times, then the matrix T 
is called the balanced array (BA) of size n, h 
constraints, s levels, and strength t having the 
index set {/{x, , . . . , x,)}. Such an array is de- 
noted by BA(n, h, s, t); n is also called the num- 
ber of treatment combinations. In particular, 
when /l(x,, ...,x,) = / for every Xj, ...,x„ such 
an array is called an orthogonal array (OA) of 
size n, h constraints, s levels, strength t, and 
index A, which is denoted by OA(«, h,s, t). 

We have n = As'. Balanced arrays have the 
advantage that they can be constructed 
with fewer treatment combinations than 
the orthogonal arrays for given h, s, t para- 
meters. The transpose of the ineidence ma- 
trix of a BIBD(i!, b, r, k. A) is, for A == {0, 1}, a 
BA{b,v,2, 2) with A(0,0) = h-2r + A, A(1,0)- 
A(0, l) = r — A, and A(l, 1) = A. There is a close 
relation between the existence of an OA and 
that of a BIBD. For example, the existence 
of an OA(s^, h, s, 2) is equivalent to that of 
h — 2 mutually ^orthogonal Latin squares of 
order s that is for h = 5 -t- 1 equivalent to the 
existence of a BIBD(s^,s(s-l- l),s+l,s, 1). The 
existence of an OA(4t, 4t — 1, 2, 2) is equivalent 
to that of a symmetric BIBD(4t — l,4t — 1, 

2f — 1, 2t — 1 , t — 1 ) which is also equivalent to 
the existence of an OA(8r, 4t, 2, 3). 

Balanced arrays and orthogonal arrays play 
a vital role in the construction of symmetric 
and asymmetric confounded factorial experi- 
ments and fractional factorial designs. In an 
OA{n, h, s, t), regarding columns and entries as 
h factors and levels of a factor of the column, 
respectively, each row corresponds to a treat- 
ment combination. In this case, this ortho- 
gonal array gives an orthogonal fractional s' 
factorial design of resolution r-l- 1 with n treat- 
ment combinations. A necessary and sufficient 
condition for a fractional 5 * factorial design 
of resolution 21 -t- 1 to be orthogonal (resp. 
balanced) is that the design be an orthogonal 
(resp. balanced) array of strength 21, provided 
that the information matrix of the design is 
nonsingular. 

M. Response Surface 

If all the factors represent quantitative vari- 
ables, such as time, temperature, amount of 
ingredients, etc., it is natural to think of the 
yield or response Y of the experimental results 
as a continuous function of the levels of these 
factors. We can write a functional relationship 

Y^=f(xi^,X 2 ^,...,xJ+W^, at = l,2,...,n, 

where represents the ath observation in the 
designed experiment and x^^ represents the 
level of the ith factor in the ath observation. 
The function / or the surface defined in (h+ 1)- 



dimensional tEuclidean space by / is called the 
response surface. The residual measures the 
experimental error of the ath observation. A 
knowledge of the function / gives a complete 
summary of experimental results and also 
enables us to predict the response for values of 
the Xjj, that were not tested in the experiment, 
or to determine such combinations of the 
values of variables X;j that give the max- 
imum (or minimum) value of f. When the 
mathematical form of / is not known, the 
function can sometimes be approximated 
satisfactorily within the experimental region, 
which is relatively small, by flexible graduating 
functions, such as polynomials of degrees 1 
and 2 in the variables Xj^: 

I — 1 

= ft + Z fti^?« + E PijXio^^ja + K- 

1=1 i=l i<j 

The coefficients ... are parameters to be 
estimated from the data. Thus, (i) when the 
form of the true / is assumed known, the ob- 
ject is to estimate the parameters; (ii) when the 
form of the true / is unknown, the object is to 
approximate the / by some graduating func- 
tion- Designs appropriate for (i) and (ii) are 
called designs for estimating parameters and 
designs for exploring a response surface, respec- 
tively. Some experimental designs that have 
been developed for fitting polynomials of the 
first and second degrees are called first-order 
designs and second-order designs, respectively. 
The problem here is to increase the precision 
of fitting response surfaces by appropriately 
choosing n points in a given experimental 
region S of variable x. That is, how do we find 
the experimental region that interests us in the 
sense of finding optimum conditions and, 
having found it, how do we design experiments 
to map / over the region? As optimum criteria 
of allocations, corresponding to (I) and (II) 
given in Section E, consider the following; 

(IV) the t generalized variance of the estimates 
of coefficients is minimal; (V) the supremum of 
the variance of estimates of the expectation 
E{Y) of response Y is minimal for all variables 
xeS. For example, when an observation Y is 
given by a polynomial regression of a variable 
xas Y^ = Po + piX^ + p 2 X^ + ...+PhXa + K,if 
for the t Legendre polynomial /^(x) we allocate 
an experiment replicated at x = ± 1 and h — 1 
roots of Pft(x) = 0 the same number of times, 
then it is an optimum design in the sense of 
(IV) and (V). 

As a design for fitting first-order response 
planes, we can use a 2'' factorial design (or its 
fraction) of resolution III, since it is sufficient 
to consider only linear effects for each of h 
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factors. In comparing first-order designs, we 
take as an optimum criterion the minimum of 
the average variance of the estimates of the 
coefficients, corresponding to (III) of Section 
E. For a design for fitting second-order re- 
sponse surfaces, we need at least three levels 
of each factor for estimating all effects up to 
quadratic ones; then we can use a 3’' factorial 
design (or its fraction) of resolution V for esti- 
mating the linear x linear component of inter- 
actions of two factors. In this case, treatment 
combinations are too large in number, so we 
can adopt central composite designs, which are 
constructed by adding further treatment com- 
binations to those obtained from a 2'* factorial 
design (or its fraction). If the coded levels of 
each factor are —1 and -hi in the 2^ factorial 
design, the {2h+ 1) additional combinations 
are (0, 0, . . . , 0), ( + d, 0, . . . , 0), (0, ± d, 0, . . . , 0), 

. . . , (0, . . . , 0, ± d). The total number of treat- 
ment combinations to be tested is 2'" -f 2/j -h 1 
( < 3'' for /i ^ 3). The value of d can be chosen 
to make the coefficients in the quadratic poly- 
nomials as orthogonal as possible to each 
other or to minimize the bias that is created 
if the true form of the response surface is not 
quadratic or to give the design with the prop- 
erty of being rotatable. If Tq is the estimated 
response at (xjo,^ 2 o> the response 

surface design is said to be rotatable if and 
only if the variance of Tq is a function only 
of the distance p = (xj 0 + ^ 20 + ••• + of 

(xio,X 2 o, from the origin, so that the 

variance contours in the experimental region 
of the variables are circles, spheres, or hyper- 
spheres centered at the origin. Now, when Y— 

^0 + ^ 1 X 1 + ^2^2+ (322xj + 2^l2^lX2 + 

W with for 1, 2, an optimum experi- 

ment in the sense of (V) is given by allocating 
0.0960 at the origin, 0.0802 at experimental 
points (1,0), (-1,0), (0, 1), (0, -1), and 0.1458 
at experimental points (1, 1), (1, —1), (—1, 1), 

( — 1, —1), respectively. However, this design is 
not rotatable. Instead of an optimum criterion 
(V), we can consider a region S which is not 
always equal to the original experimental 
region S and a criterion: (V') The supremum 
of the variance of estimatesof the expectation 
E{Y) of response Y is minimal for all variables 
xeS. Within xf-l-xl^c^, an optimum alloca- 
tion depends on the value of c, and it is shown 
that a rotatable design is obtained only if c 
is in a certain range. 

There are many topics in the theory of ex- 
perimental design besides the ones mentioned 
in this article (^ [4] for multiple comparison, 

[9] for confounding designs that are factorial 
experiments in which the block size is reduced 
and in which for each block a fractional of all 
the treatment combinations are tested, [10] for 
split-plot designs in which certain main effects 



are confounded with blocks, [15] for weighing 
designs). 
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Determinants 

A. Definition 

The determinant of an n x n ^matrix A — (ajJ 
in a tcommutative ring R is defined to be the 
following element of R: 

Yisgn P)a,p a 2p^...a„p^, 

where 

p=(‘ " ■ 

\Pi P 2 Pj 

is a permutation of the numbers 1 , 2, . . . , «, 
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sgn P denotes the sign of the permutation P 
(that is, sgnP= 1 if P is an even permutation 
and sgnP= —1 if P is an odd permutation), 
and the summation extends over all n! per- 
mutations of 1,2, The determinant of A 
is denoted by 

di2 ••• 

Qii U22 ••• « 2 n 

^nl ^m2 

It is written la,*) or j/11 and is also denoted by 
del A. Usually we suppose that R is the field R 
of real numbers or the field C of complex 
numbers, but the following theorems are also 
valid for cases in which R is any commutative 
ring, unless otherwise stated. 



B. Relation to Exterior Algebras 

Consider an texterior algebra (+Grassmann 
algebra) of a linear space (Tree module) of 
dimension n over R with a basis (e 1 , ^2 , . . . , e„). 
Set 

e; = aaei + a,.2e2 + ... + a,„e„, 

where a^sR. Then we have e\ /\e'2 ^ /\e'„ 

— \aii^\ei A62 A ... Ae„. Conversely, we can 
define the determinant |a,7j by this relation. 
The properties of determinants can be easily 
deduced from those of exterior algebras. 



C. Fundamental Properties of Determinants 

(1) The determinant of the Transposed matrix 
'A of a matrix A is equal to the determinant of 
A. Hence the theorems stated for rows are also 
valid for columns. 

(2) If the elements of one row (column) of a 
matrix are multiplied by a factor c, the deter- 
minant of the matrix is also multiplied by c. If 
the elements of one row (column) of a matrix 
are zero, its determinant is equal to zero. 

(3) If from a matrix A =(o;j), we obtain two 

matrices A' and A" by replacing one row, for 
instance the ith row, by ,-..., a •„ and by 

Uii +n-i, -I- a-„, respectively, then \A"\ = 

|A| + |A'|. This relation is equally valid for a 
column. 

(4) If we obtain Aq by a permutation Q on 
the rows of a matrix A, then |AqI = (sgnQ)|A|. 
In particular, if two rows (columns) of a ma- 
trix are interchanged, then the determinant 
changes sign. 

(5) The determinant of a matrix is zero if 
two rows (or columns) are identical. 

(6) The determinant of a matrix is not 
changed if the elements of any row (column), 
each multiplied by the same factor, are added 



to the corresponding elements of another row 
(column). 

(7) Suppose that R has unity element. Let x,* 

variables in R, and denote 
a function (having its values in R) of these 
variables by (p{X), A = (x,J. Assume that (p{X) 
has the following properties: (i) if the elements 
of one row of X are multiplied by a factor A, 
the value of cp is also multiplied by A; (ii) if we 
obtain two matrices X' and X" by replacing 
one row of X, for instance the ith row, by 
Xn, ...,x;„ and by x,i +x;-i, ...,x,„ + x';„, respec- 
tively, then (p{X") = (p{X) + (p{X’}; and (iii) if 
two rows of X are equal, q){X) = 0 . Then (p{X) 
= c|A| for some constant c (in R). 

(8) Suppose now that R is a field K. Assume 
that a function <p(X) (in K) has the following 
properties; (i) if the elements of one row in 

X are multiplied by A, the value of (p is also 
multiplied by A; and (ii) the value of <p is not 
changed if the elements of any row are added 
to the corresponding elements of another row. 
Then (p{X) — c\X\ for some constant c (in K). 



D. The Laplace Expansion Theorem 

Let A = (n,J be an n x n matrix. Take r-tuples 
(/,,..., ir) and (fci , . . . , k,), where L and k^ be- 
long to {1, ...,n} and ij < ... <i^, k, < ...<k,. 
Let (L+i , • • • , ij and {k^^ ,k„)he {n — r)- 
tuples such that L+i < . . . < i„, k,.+i < . . . < k„ and 
{ ii , . . . , i,., i,.+j , . . . , i„} {kj , . . . , k,., k^+ j , . . . , k„} 

= { 1 , . . . , n} . Let a,i^ , *,) be the deter- 

minant of an r X r matrix whose {p, q)- 
component is the (ip, k,^)-component of A 
for each p and q. We call this determinant 
a minor of degree r of the matrix A. (The 
corresponding submatrix of A is sometimes 
also called a minor of A.) In particular, if 
(/j ,...,(,.) = (k, , ... , kr), then it is called a princi- 
pal minor. Furthermore, we define the cofactor 
of the minor a,; ; ^ of A to be 

‘rH*i V) ~ ^(^r+i ‘n^(^r+i 

where A = ii -I- ... + k and p = ki-t ...+k^. In 
the particular case r=l, the cofactor of a^_ is 
fl,^ = ( — l)''^'‘A;j, where A, 7, is the determinant 
of the (n — 1) X (n — 1) matrix obtained from A 
by eliminating its ith row and kth column. For 
simplicity, we abbreviate (ij, ...,k), (k,, ...,kj, 
and (y'l, ... ,yV) as (i), (k), and (y), respectively. 



Then we have 






1 

U) 1 


\\A\ 


if (i)-(k), 


( 0 


if (iV(k), 


1 

(./■) 1 


\\A\ 


if (i) = (k). 


to 


if (i)^(k). 



where Sy) means that the sum is taken over all 
combinations (y). This is called the Laplace 
expansion theorem. If a matrix A has the form 
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B 0\ 


{B A 




or 


U c 


VO c 



and B and C are square matrices, then by this 
theorem we have 1 4 1 = 1 6| | C|. If we number 
the combinations (i) — (i,, and (/c) = 

(fc appropriately (for instance, in lexi- 
cographical order) and regard the numbers 
assigned to them as row numbers and column 
numbers, respectively, to form a matrix 
then the Laplace theorem can be expressed as 

/Ml 0 1 

' 0 ' Ml/ 



In the particular case r= 1, we have 
A\ if i = /c, 



j=i 

^ dji^ dji 

j=i 



0 if iV k, 

\A\ if i = /c, 
0 



E. Product of Determinants 

Let A — ia^) and B = {bnJ be two nxn matrices. 
For the product AB = C = (c,J, where C;^ = 
a/jbjk (i, /c= 1, ...,n), we have \AB\ = 

|/1| |B|. The tinverse matrix A~^ exists for an 
nxn matrix A — (u,j) if and only if 1 /1 1 7^ 0, and 
then A~^ =(^ik) with elements bj,^ = ai^i/\A\. 
Moreover, we have |/4~‘ | = (In the case 
where the elements are in the commutative 
ring R with unity element, A exists if and 
only if I /4 1 is a ^regular element of R.) 



F. Theorems on Determinants 

(1) Let (3;^ be the cofactor of a, 7, in the deter- 
minant of an nxn matrix A = {aik)- Then the 
determinant \ai^\ is equal to In general, 

t^(i, ip(k, » 

(”;■) 

l^(i, ~ Ml 

(2) The determinant of a submatrix of the 
matrix (a.-J, composed of the ijth, .. . , i,th rows 
and ^ith, . . . , /c^th columns of is equal to 

Mr k^)- 

(3) Let A ( ' ’ ’ M be the determinant 

\k,,...,kj 

of the (n — r) X (n — r) matrix obtained from an 
nxn matrix A by eliminating its ijth, ... , j,.th 
rows and /cjth, . . . , /c,.th columns. Then 
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(4) Sylvester’s theorem. Let (i, /c = 1, . . . , 

n — r) denote the minor a,i r,r+i)(i,...,r,r+fc) 

of an « X « matrix A = (aji). Then 



\bik\-\A\ 



Cl 1 1 • . • a i 



a^i ... 



(5) Let A he an nxm matrix and B an m x n 
matrix. Then AB is an n x n matrix. If n>m, 
then |v4B| =0. If n^m, let (i) = (i, , ... , ij (i, < .. . 
< i„) be a combination of 1 , 2, . . . , m, taken n at 
a time. Let be the nxn matrix composed 
of the (’ith, ..., i„th columns of A, and the 
nxn matrix composed of the ijth, ..., i„th rows 
of B. Then M^^l — 2(i)M(i)l M(i)l’ where the 
summation extends over all possible combi- 
nations (/). 

(6) Determinant of a '•'Kronecker product. If 
4 is an m X m matrix and B is an n x n matrix, 
then l/l(x)B| = |/inBr. 

(7) Let H he an nxn '•'Hermitian matrix, and 
let denote the matrix composed of its first k 
rows and columns. Then H is positive definite 
if and only if |//^|>0 for all /c= 1, •••,«■ 



G. Special Determinants 

(1) A determinant of the form 



1 


1 


... 1 


-^1 


X 2 


••• 






••• X^ 


n -1 

•^1 


v-n ^ 1 
^2 





is called a Vandermonde determinant. It is 
equal to the ^simplest alternating function 

(2) A cyclic determinant is one of the form 



■'^0 




X 2 • 


• ^«-l 




Xo 


Xi • 


• ^«-2 




X 2 


^3 • 


• ^0 



= nMo + C'xi+C^'x, + ...+C''’-'A-i), 

1 = 0 

where C is a '•'primitive nth root of unity. 

(3) Consider the vectors ,a(2, ••• , «,„) 

((— 1,2, ...,n), and let i'Xi,ocj) denote the dinner 
product of a,- and ctj. Then the following deter- 
minant is called the Gramian of these vectors: 





(ai,a2) • 


•• (ai.aJ 


(a2,ai) 


(X2,a2) • 


•• (a2.«J 




(a«,«2) • 





1 1 ^12 ■ ■ ■ 

^^21 ^22 ■■■ ^2ii 

^n2 ^nn 



i <j, k< 1. 
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(4) For an talternating matrix (namely, a 
square matrix X such that ‘X = —X), we have 
the identity 



0 


Xi2 


^13 




1 

X 


0 


^23 






~^23 


0 


^3« 


-JCln 




••• 


0 



...)^ if a is even, 
[0 if n is odd, 

where , . . . ) is a polynomial of vari- 

ables X;j, which (equipped with appropriate 
sign) is called the Pfaffian of these variables. 
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Difference Equations 

A. General Remarks 

Let y be a function of a real variable x defined 
on an interval / and let Ax be a fixed quantity. 
When two points x and x + Ax are in /, we 
define the difference Ay(x) of y at x by Ay(x) = 
y(x-i-Ax)— y(x) and the difference quotient 
by Ay(x)/Ax; Ax is called the difference of x. 
Without loss of generality, we can take Ax = 

1, for otherwise there is a constant b such 
that Ax'= 1 for the new independent vari- 
able bx — x'. If Ax = 1, the second difference 
A^y(x) = A(Ay(x)) is given by 

A"y(x)-Ay(x+1)-Ay(x) 

y(x -b 2) - 2y(x +\) + y(x). 

Similarly, the difference of the nth order is 

defined by A"y(x) = A(A" *y(x)), and 



Conversely, y(x + n) is expressed by differences 
as 

y(x + n)= A'‘y(x) 

(^ 223 Interpolation). 



B. Summation 



Given a function g{x) and Ax, a function y(x) 
that satisfies Ay(x)/Ax = 3 (x) is called a sum 
of g{x). Summation of gi(x) is to find a sum of 
^/(x). Given a sum y(x) of g(x), an indefinite 
sum of ^(x), written as S< 5 i(x)Ax, is given by 
Sg'(x)Ax = y(x)-|-c(x), where c{x) is an arbitrary 
periodic function of period Ax. In many cases, 
c(x), which corresponds to an arbitrary con- 
stant in an indefinite integral, is omitted. For 
example, a sum of ^(x) = nx"“^ for Ax = 1 is the 
rtth-ordcr tBernoulli polynomial B„(x) for 
n#0; a sum of x"‘ is \jj{x), given by i//(x) = 
JlogF(x)/dx (^ 174 Gamma Function). 

When the series — Ax2®=o.i?(x-l-/cAx) or 
Ax£^i g{x — kAx) converges, both can be 
sums of g{x). Since the requirement of conver- 
gence for these series was found to be too 
strict, the following requirement was given by 
N. E. Norlund instead: Let x be a real variable 
and g{x) be continuous for x^h. Define A(x) 
by 2(x) = x^(logx)'' (p> \,q^0). Then if for a 
positive r] 



F(x, Ax, t]) — 




g{z)e 



-Ax ^ 3 (x + kAx)c“"^<^+'‘'^^' 



is convergent for a>b, F satisfies AF(x, Ax, 
ri)/Ax — g{x)exp{~t]?>.{x)). Accordingly, if 
F(x, Ax, rj) approaches a limit F(x) as rj^O, 
F{x) is a solution of AF(x)/Ax = g(x). F(x) 
is called the principal solution of AF(x)/Ax = 
g(x). 



C. Difference Equations 

Let Ax= 1. An equation F(x, y(x), Ay(x), 

. . . , A"y(x)) = 0 in X and differences of an un- 
known function y(x) is called a difference 
equation. If the substitution y = <p(x) satisfies 
the equation for x in some interval, (p{x) is a 
solution of the equation. Because of the rela- 
tion between y(x), y(x -b 1), . . . , y(x -b n) and the 
differences of y at .x, we can transform the 
given difference equation in the form G(x,y(x), 
y(x -b 1), . . . , y(x -b n)) = 0. This form appears 
more often in applications and is called the 
standard form of a difference equation. 




